v1v2 (latest)

Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

25 February 2019

Silvio Savarese

Papers citing "Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression"

50 / 1,208 papers shown

Revisiting Color-Event based Tracking: A Unified Network, Dataset, and Metric

Yaowei Wang

Yonghong Tian

414

20 Nov 2022

Decoding Attention from Gaze: A Benchmark Dataset and End-to-End Models

Karan Uppal

Jaeah Kim

Shashank Singh

142

20 Nov 2022

DiffusionDet: Diffusion Model for Object DetectionIEEE International Conference on Computer Vision (ICCV), 2022

Shoufa Chen

Pei Sun

Yibing Song

Ping Luo

566

692

17 Nov 2022

^3

ETR: Decoder Distillation for Detection Transformer

210

17 Nov 2022

Multi-Camera Multi-Object Tracking on the Move via Single-Stage Global Association ApproachPattern Recognition (Pattern Recogn.), 2022

Ngan Le

301

17 Nov 2022

Towards 3D Object Detection with 2D Supervision

Xiangyu Zhang

236

15 Nov 2022

3D Cascade RCNN: High Quality Object Detection in Point CloudsIEEE Transactions on Image Processing (IEEE TIP), 2022

Qi Cai

Yingwei Pan

Ting Yao

Tao Mei

3DPC

232

15 Nov 2022

YORO -- Lightweight End to End Visual Grounding

257

15 Nov 2022

PatchRefineNet: Improving Binary Segmentation by Incorporating Signals from Optimal Patch-wise BinarizationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

S. Nagendra

Chaopeng Shen

Daniel Kifer

276

12 Nov 2022

Prior-enhanced Temporal Action Localization using Subject-aware Spatial AttentionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

207

10 Nov 2022

Efficient Joint Detection and Multiple Object Tracking with Spatially Aware Transformer

172

09 Nov 2022

Are Face Detection Models Biased?IEEE International Conference on Automatic Face & Gesture Recognition (FG), 2022

191

07 Nov 2022

Large Scale Radio Frequency Wideband Signal Detection & Recognition

266

04 Nov 2022

Deep Learning based Defect classification and detection in SEM images: A Mask R-CNN approach

152

03 Nov 2022

Translated Skip Connections -- Expanding the Receptive Fields of Fully Convolutional Neural NetworksInternational Conference on Information Photonics (ICIP), 2022

Joshua Bruton

Hairong Wang

SSeg

03 Nov 2022

PolyBuilding: Polygon Transformer for End-to-End Building Extraction

210

03 Nov 2022

Pair DETR: Contrastive Learning Speeds Up DETR Training

289

29 Oct 2022

ProContEXT: Exploring Progressive Context Transformer for TrackingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

382

27 Oct 2022

Refining Action Boundaries for One-stage DetectionAdvanced Video and Signal Based Surveillance (AVSS), 2022

Dima Damen

173

25 Oct 2022

Strong-TransCenter: Improved Multi-Object Tracking based on Transformers with Dense Representations

261

24 Oct 2022

Towards Unifying Reference Expression Generation and ComprehensionConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

246

24 Oct 2022

Robust Object Detection in Remote Sensing Imagery with Noisy and Sparse Geo-Annotations (Full Version)

Maximilian Bernhard

Matthias Schubert

ObjD

256

24 Oct 2022

RSVG: Exploring Data and Models for Visual Grounding on Remote Sensing DataIEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), 2022

Yangfan Zhan

Zhitong Xiong

Yuan. Yuan

276

207

23 Oct 2022

Transformers For Recognition In Overhead Imagery: A Reality CheckIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

301

23 Oct 2022

YOWO-Plus: An Incremental Improvement

Jianhua Yang

ViT

155

20 Oct 2022

JRDB-Pose: A Large-scale Dataset for Multi-Person Pose Estimation and TrackingComputer Vision and Pattern Recognition (CVPR), 2022

258

20 Oct 2022

TOIST: Task Oriented Instance Segmentation Transformer with Noun-Pronoun DistillationNeural Information Processing Systems (NeurIPS), 2022

Hao Zhao

293

19 Oct 2022

Understanding Embodied Reference with Touch-Line TransformerInternational Conference on Learning Representations (ICLR), 2022

Hao Zhao

359

11 Oct 2022

FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-trainingIEEE International Conference on Computer Vision (ICCV), 2022

Adrian Bulat

Ricardo Guerrero

Brais Martínez

Georgios Tzimiropoulos

316

10 Oct 2022

Video Referring Expression Comprehension via Transformer with Content-aware Query

319

06 Oct 2022

Spatio-Temporal Learnable Proposals for End-to-End Video Object DetectionBritish Machine Vision Conference (BMVC), 2022

K. Hashmi

D. Stricker

Muhammamd Zeshan Afzal

270

05 Oct 2022

FQDet: Fast-converging Query-based Detector

Cédric Picron

Punarjay Chakravarty

Tinne Tuytelaars

ObjD

356

05 Oct 2022

DirectTracker: 3D Multi-Object Tracking Using Direct Image Alignment and Photometric Bundle AdjustmentIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022

Laura Leal-Taixé

192

29 Sep 2022

Access Control with Encrypted Feature Maps for Object Detection Models

Teru Nagamori

Hiroki Ito

AprilPyone Maungmaung

Hitoshi Kiya

243

29 Sep 2022

Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual GroundingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

294

28 Sep 2022

Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual TasksNeural Information Processing Systems (NeurIPS), 2022

Fan Yang

...

Ming Tang

242

28 Sep 2022

Embracing Consistency: A One-Stage Approach for Spatio-Temporal Video GroundingNeural Information Processing Systems (NeurIPS), 2022

295

27 Sep 2022

$D$^{\bf{3}}$: Duplicate Detection Decontaminator for Multi-Athlete Tracking in Sports Videos$

^{\bf{3}}

: Duplicate Detection Decontaminator for Multi-Athlete Tracking in Sports VideosAsian Conference on Computer Vision (ACCV), 2022

Rui He

Zehua Fu

Qingjie Liu

Yunhong Wang

Xunxun Chen

287

25 Sep 2022

NeRF-Loc: Transformer-Based Object Localization Within Neural Radiance FieldsIEEE Robotics and Automation Letters (RA-L), 2022

Mingyu Ding

Jingdong Wang

261

24 Sep 2022

MGTR: End-to-End Mutual Gaze Detection with TransformerAsian Conference on Computer Vision (ACCV), 2022

134

22 Sep 2022

Detecting Rotated Objects as Gaussian Distributions and Its 3-D GeneralizationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

Xue Yang

273

120

22 Sep 2022

IoU-Enhanced Attention for End-to-End Task Specific Object DetectionAsian Conference on Computer Vision (ACCV), 2022

259

21 Sep 2022

DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world DetectionNeural Information Processing Systems (NeurIPS), 2022

Lewei Yao

Jianhua Han

Youpeng Wen

Xiaodan Liang

Dan Xu

Wei Zhang

Zhenguo Li

Chunjing Xu

Hang Xu

CLIP VLM

382

235

20 Sep 2022

Differentiable Topology-Preserved Distance Transform for Pulmonary Airway Segmentation

Minghui Zhang

Guangyao Yang

Yun Gu

297

17 Sep 2022

ScreenQA: Large-Scale Question-Answer Pairs over Mobile App ScreenshotsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022

Victor Carbune

Jason Lin

Maria Wang

Yun Zhu

Jindong Chen

RALM

1.1K

16 Sep 2022

Towards Improving Calibration in Object Detection Under Domain ShiftNeural Information Processing Systems (NeurIPS), 2022

Muhammad Akhtar Munir

M. H. Khan

M. Sarfraz

Mohsen Ali

310

15 Sep 2022

ComplETR: Reducing the cost of annotations for object detection in dense scenes with vision transformers

Achin Jain

Kibok Lee

Gurumurthy Swaminathan

Bernt Schiele

320

13 Sep 2022

YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications

...

464

2,923

07 Sep 2022

Multi-Grained Angle Representation for Remote Sensing Object DetectionIEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), 2022

230

07 Sep 2022

CAMO-MOT: Combined Appearance-Motion Optimization for 3D Multi-Object Tracking with Camera-LiDAR Fusion

Lei Yang

Jun Li

311

110

06 Sep 2022