ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.09630
  4. Cited By
Generalized Intersection over Union: A Metric and A Loss for Bounding
  Box Regression
v1v2 (latest)

Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

25 February 2019
S. Hamid Rezatofighi
Deyuan Li
JunYoung Gwak
Amir Sadeghian
Ian Reid
Silvio Savarese
ArXiv (abs)PDFHTML

Papers citing "Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression"

50 / 1,204 papers shown
Multi-Camera Multi-Object Tracking on the Move via Single-Stage Global
  Association Approach
Multi-Camera Multi-Object Tracking on the Move via Single-Stage Global Association ApproachPattern Recognition (Pattern Recogn.), 2022
Pha Nguyen
Kha Gia Quach
C. Duong
Son Lam Phung
Ngan Le
Khoa Luu
255
20
0
17 Nov 2022
Towards 3D Object Detection with 2D Supervision
Towards 3D Object Detection with 2D Supervision
Jinrong Yang
Tiancai Wang
Zheng Ge
Weixin Mao
Xiaoping Li
Xiangyu Zhang
142
4
0
15 Nov 2022
3D Cascade RCNN: High Quality Object Detection in Point Clouds
3D Cascade RCNN: High Quality Object Detection in Point CloudsIEEE Transactions on Image Processing (IEEE TIP), 2022
Qi Cai
Yingwei Pan
Ting Yao
Tao Mei
3DPC
176
33
0
15 Nov 2022
YORO -- Lightweight End to End Visual Grounding
YORO -- Lightweight End to End Visual Grounding
Chih-Hui Ho
Srikar Appalaraju
Bhavan A. Jasani
R. Manmatha
Nuno Vasconcelos
ObjD
173
27
0
15 Nov 2022
PatchRefineNet: Improving Binary Segmentation by Incorporating Signals
  from Optimal Patch-wise Binarization
PatchRefineNet: Improving Binary Segmentation by Incorporating Signals from Optimal Patch-wise BinarizationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
S. Nagendra
Chaopeng Shen
Daniel Kifer
MQ
232
8
0
12 Nov 2022
Prior-enhanced Temporal Action Localization using Subject-aware Spatial
  Attention
Prior-enhanced Temporal Action Localization using Subject-aware Spatial AttentionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Yifan Liu
Youbao Tang
Ning Zhang
Ruei-Sung Lin
Haoqian Wang
178
0
0
10 Nov 2022
Efficient Joint Detection and Multiple Object Tracking with Spatially
  Aware Transformer
Efficient Joint Detection and Multiple Object Tracking with Spatially Aware Transformer
S. S. Nijhawan
Leo Hoshikawa
Atsushi Irie
Masakazu Yoshimura
Junji Otsuka
Takeshi Ohashi
VOTViT
147
3
0
09 Nov 2022
Are Face Detection Models Biased?
Are Face Detection Models Biased?IEEE International Conference on Automatic Face & Gesture Recognition (FG), 2022
S. Mittal
K. Thakral
P. Majumdar
Mayank Vatsa
Richa Singh
CVBM
149
8
0
07 Nov 2022
Large Scale Radio Frequency Wideband Signal Detection & Recognition
Large Scale Radio Frequency Wideband Signal Detection & Recognition
Luke Boegner
Garrett M. Vanhoy
Phillip Vallance
Manbir Gulati
Dresden Feitzinger
B. Comar
Rob Miller
AI4TS
226
9
0
04 Nov 2022
Deep Learning based Defect classification and detection in SEM images: A
  Mask R-CNN approach
Deep Learning based Defect classification and detection in SEM images: A Mask R-CNN approach
Bappaditya Dey
Enrique Dehaerne
Kasem Khalil
S. Halder
Philippe Leray
Magdy A. Bayoumi
117
22
0
03 Nov 2022
Translated Skip Connections -- Expanding the Receptive Fields of Fully
  Convolutional Neural Networks
Translated Skip Connections -- Expanding the Receptive Fields of Fully Convolutional Neural NetworksInternational Conference on Information Photonics (ICIP), 2022
Joshua Bruton
Hairong Wang
SSeg
85
4
0
03 Nov 2022
PolyBuilding: Polygon Transformer for End-to-End Building Extraction
PolyBuilding: Polygon Transformer for End-to-End Building Extraction
Yuan Hu
Zhibin Wang
Zhou Huang
Yu Liu
3DVViT
182
10
0
03 Nov 2022
Pair DETR: Contrastive Learning Speeds Up DETR Training
Pair DETR: Contrastive Learning Speeds Up DETR Training
M. Iranmanesh
Xiaotong Chen
Kuo-Chin Lien
ViT
197
0
0
29 Oct 2022
ProContEXT: Exploring Progressive Context Transformer for Tracking
ProContEXT: Exploring Progressive Context Transformer for TrackingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Jinpeng Lan
Zhi-Qi Cheng
Ju He
Chenyang Li
Bin Luo
Xueting Bao
Wangmeng Xiang
Yifeng Geng
Xuansong Xie
336
50
0
27 Oct 2022
Refining Action Boundaries for One-stage Detection
Refining Action Boundaries for One-stage DetectionAdvanced Video and Signal Based Surveillance (AVSS), 2022
Hanyuan Wang
Majid Mirmehdi
Dima Damen
Toby Perrett
ObjD
153
1
0
25 Oct 2022
Strong-TransCenter: Improved Multi-Object Tracking based on Transformers
  with Dense Representations
Strong-TransCenter: Improved Multi-Object Tracking based on Transformers with Dense Representations
Amit Galor
Roy Orfaig
B. Bobrovsky
VOT
212
7
0
24 Oct 2022
Towards Unifying Reference Expression Generation and Comprehension
Towards Unifying Reference Expression Generation and ComprehensionConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Duo Zheng
Tao Kong
Ya Jing
Jiaan Wang
Xiaojie Wang
ObjD
177
9
0
24 Oct 2022
Robust Object Detection in Remote Sensing Imagery with Noisy and Sparse
  Geo-Annotations (Full Version)
Robust Object Detection in Remote Sensing Imagery with Noisy and Sparse Geo-Annotations (Full Version)
Maximilian Bernhard
Matthias Schubert
ObjD
224
3
0
24 Oct 2022
RSVG: Exploring Data and Models for Visual Grounding on Remote Sensing
  Data
RSVG: Exploring Data and Models for Visual Grounding on Remote Sensing DataIEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), 2022
Yangfan Zhan
Zhitong Xiong
Yuan. Yuan
242
188
0
23 Oct 2022
Transformers For Recognition In Overhead Imagery: A Reality Check
Transformers For Recognition In Overhead Imagery: A Reality CheckIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Francesco Luzi
Aneesh Gupta
L. Collins
Kyle Bradbury
Jordan M. Malof
ViT
201
4
0
23 Oct 2022
YOWO-Plus: An Incremental Improvement
YOWO-Plus: An Incremental Improvement
Jianhua Yang
ViT
131
5
0
20 Oct 2022
JRDB-Pose: A Large-scale Dataset for Multi-Person Pose Estimation and
  Tracking
JRDB-Pose: A Large-scale Dataset for Multi-Person Pose Estimation and TrackingComputer Vision and Pattern Recognition (CVPR), 2022
Edward Vendrow
Duy-Tho Le
Jianfei Cai
Hamid Rezatofighi
207
38
0
20 Oct 2022
TOIST: Task Oriented Instance Segmentation Transformer with Noun-Pronoun
  Distillation
TOIST: Task Oriented Instance Segmentation Transformer with Noun-Pronoun DistillationNeural Information Processing Systems (NeurIPS), 2022
Pengfei Li
Beiwen Tian
Yongliang Shi
Xiaoxue Chen
Hao Zhao
Guyue Zhou
Ya Zhang
262
29
0
19 Oct 2022
Understanding Embodied Reference with Touch-Line Transformer
Understanding Embodied Reference with Touch-Line TransformerInternational Conference on Learning Representations (ICLR), 2022
Yongqian Li
Xiaoxue Chen
Hao Zhao
Jiangtao Gong
Guyue Zhou
Federico Rossano
Yixin Zhu
288
20
0
11 Oct 2022
FS-DETR: Few-Shot DEtection TRansformer with prompting and without
  re-training
FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-trainingIEEE International Conference on Computer Vision (ICCV), 2022
Adrian Bulat
Ricardo Guerrero
Brais Martínez
Georgios Tzimiropoulos
255
50
0
10 Oct 2022
Video Referring Expression Comprehension via Transformer with
  Content-aware Query
Video Referring Expression Comprehension via Transformer with Content-aware Query
Ji Jiang
Meng Cao
Tengtao Song
Yuexian Zou
279
5
0
06 Oct 2022
Spatio-Temporal Learnable Proposals for End-to-End Video Object
  Detection
Spatio-Temporal Learnable Proposals for End-to-End Video Object DetectionBritish Machine Vision Conference (BMVC), 2022
K. Hashmi
D. Stricker
Muhammamd Zeshan Afzal
233
8
0
05 Oct 2022
FQDet: Fast-converging Query-based Detector
FQDet: Fast-converging Query-based Detector
Cédric Picron
Punarjay Chakravarty
Tinne Tuytelaars
ObjD
303
2
0
05 Oct 2022
DirectTracker: 3D Multi-Object Tracking Using Direct Image Alignment and
  Photometric Bundle Adjustment
DirectTracker: 3D Multi-Object Tracking Using Direct Image Alignment and Photometric Bundle AdjustmentIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022
M. Gladkova
N. Korobov
Nikolaus Demmel
Aljovsa Ovsep
Laura Leal-Taixé
Zorah Lähner
3DPC
177
8
0
29 Sep 2022
Access Control with Encrypted Feature Maps for Object Detection Models
Access Control with Encrypted Feature Maps for Object Detection Models
Teru Nagamori
Hiroki Ito
AprilPyone Maungmaung
Hitoshi Kiya
149
2
0
29 Sep 2022
Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual
  Grounding
Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual GroundingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Fengyuan Shi
Ruopeng Gao
Weilin Huang
Limin Wang
230
49
0
28 Sep 2022
Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual
  Tasks
Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual TasksNeural Information Processing Systems (NeurIPS), 2022
Zhiyang Chen
Yousong Zhu
Zhaowen Li
Fan Yang
Wei Li
...
Honghui Dong
Liwei Wu
Rui Zhao
Jinqiao Wang
Ming Tang
VLMVOS
221
17
0
28 Sep 2022
Embracing Consistency: A One-Stage Approach for Spatio-Temporal Video
  Grounding
Embracing Consistency: A One-Stage Approach for Spatio-Temporal Video GroundingNeural Information Processing Systems (NeurIPS), 2022
Yang Jin
Yongzhi Li
Zehuan Yuan
Yadong Mu
246
48
0
27 Sep 2022
D$^{\bf{3}}$: Duplicate Detection Decontaminator for Multi-Athlete
  Tracking in Sports Videos
D3^{\bf{3}}3: Duplicate Detection Decontaminator for Multi-Athlete Tracking in Sports VideosAsian Conference on Computer Vision (ACCV), 2022
Rui He
Zehua Fu
Qingjie Liu
Yunhong Wang
Xunxun Chen
191
0
0
25 Sep 2022
NeRF-Loc: Transformer-Based Object Localization Within Neural Radiance
  Fields
NeRF-Loc: Transformer-Based Object Localization Within Neural Radiance FieldsIEEE Robotics and Automation Letters (RA-L), 2022
Jiankai Sun
Yan Xu
Mingyu Ding
Hongwei Yi
Chen Wang
Jingdong Wang
Liangjun Zhang
Mac Schwager
236
13
0
24 Sep 2022
MGTR: End-to-End Mutual Gaze Detection with Transformer
MGTR: End-to-End Mutual Gaze Detection with TransformerAsian Conference on Computer Vision (ACCV), 2022
Han Guo
Zhengxi Hu
Jingtai Liu
ViT
116
11
0
22 Sep 2022
Detecting Rotated Objects as Gaussian Distributions and Its 3-D
  Generalization
Detecting Rotated Objects as Gaussian Distributions and Its 3-D GeneralizationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Xue Yang
Gefan Zhang
Xiaojiang Yang
Yue Zhou
Wentao Wang
Jin Tang
Tao He
Junchi Yan
237
117
0
22 Sep 2022
IoU-Enhanced Attention for End-to-End Task Specific Object Detection
IoU-Enhanced Attention for End-to-End Task Specific Object DetectionAsian Conference on Computer Vision (ACCV), 2022
Jing Zhao
Shengjian Wu
Li Sun
Qingli Li
229
9
0
21 Sep 2022
DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for
  Open-world Detection
DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world DetectionNeural Information Processing Systems (NeurIPS), 2022
Lewei Yao
Jianhua Han
Youpeng Wen
Xiaodan Liang
Dan Xu
Wei Zhang
Zhenguo Li
Chunjing Xu
Hang Xu
CLIPVLM
349
220
0
20 Sep 2022
Differentiable Topology-Preserved Distance Transform for Pulmonary
  Airway Segmentation
Differentiable Topology-Preserved Distance Transform for Pulmonary Airway Segmentation
Minghui Zhang
Guangyao Yang
Yun Gu
249
6
0
17 Sep 2022
ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots
ScreenQA: Large-Scale Question-Answer Pairs over Mobile App ScreenshotsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
Yu-Chung Hsiao
Fedir Zubach
Maria Wang
Jindong Chen
Victor Carbune
Jason Lin
Maria Wang
Yun Zhu
Jindong Chen
RALM
989
48
0
16 Sep 2022
Towards Improving Calibration in Object Detection Under Domain Shift
Towards Improving Calibration in Object Detection Under Domain ShiftNeural Information Processing Systems (NeurIPS), 2022
Muhammad Akhtar Munir
M. H. Khan
M. Sarfraz
Mohsen Ali
208
27
0
15 Sep 2022
ComplETR: Reducing the cost of annotations for object detection in dense
  scenes with vision transformers
ComplETR: Reducing the cost of annotations for object detection in dense scenes with vision transformers
Achin Jain
Kibok Lee
Gurumurthy Swaminathan
Han Yang
Bernt Schiele
Avinash Ravichandran
Onkar Dabeer
ViT
294
1
0
13 Sep 2022
YOLOv6: A Single-Stage Object Detection Framework for Industrial
  Applications
YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications
Chuyin Li
Lu Li
Hongliang Jiang
Kaiheng Weng
Yifei Geng
...
Linyuan Zhou
Xiaoming Xu
Xiangxiang Chu
Xiaoming Wei
Xiaolin K. Wei
ObjD
404
2,763
0
07 Sep 2022
Multi-Grained Angle Representation for Remote Sensing Object Detection
Multi-Grained Angle Representation for Remote Sensing Object DetectionIEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), 2022
Hao Wang
Zhanchao Huang
Zhengchao Chen
Ying Song
Wei Li
206
18
0
07 Sep 2022
CAMO-MOT: Combined Appearance-Motion Optimization for 3D Multi-Object
  Tracking with Camera-LiDAR Fusion
CAMO-MOT: Combined Appearance-Motion Optimization for 3D Multi-Object Tracking with Camera-LiDAR Fusion
Li-e Wang
Wei Wei
Wenyuan Qin
Xiaoyu Li
Lei Yang
Zhiwei Li
Lei Zhu
Hong Wang
Jun Li
Hua Liu
3DPCVOT
268
100
0
06 Sep 2022
PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards
  Video Object Detection
PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards Video Object DetectionEuropean Conference on Computer Vision (ECCV), 2022
Han Wang
Jun Tang
Xiaodong Liu
Shanyan Guan
Rong Xie
Li Song
ViT
140
35
0
06 Sep 2022
Task-wise Sampling Convolutions for Arbitrary-Oriented Object Detection
  in Aerial Images
Task-wise Sampling Convolutions for Arbitrary-Oriented Object Detection in Aerial ImagesIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022
Zhanchao Huang
Wei Li
X. Xia
Hao Wang
Ran Tao
336
18
0
06 Sep 2022
RLIP: Relational Language-Image Pre-training for Human-Object
  Interaction Detection
RLIP: Relational Language-Image Pre-training for Human-Object Interaction DetectionNeural Information Processing Systems (NeurIPS), 2022
Hangjie Yuan
Jianwen Jiang
Samuel Albanie
Tao Feng
Ziyuan Huang
Dong Ni
Mingqian Tang
VLM
374
76
0
05 Sep 2022
Consistent-Teacher: Towards Reducing Inconsistent Pseudo-targets in
  Semi-supervised Object Detection
Consistent-Teacher: Towards Reducing Inconsistent Pseudo-targets in Semi-supervised Object DetectionComputer Vision and Pattern Recognition (CVPR), 2022
Xinjiang Wang
Xingyi Yang
Shilong Zhang
Yijiang Li
Xue Jiang
Shijie Fang
Chengqi Lyu
Kaibing Chen
Wayne Zhang
316
90
0
04 Sep 2022
Previous
123...151617...232425
Next
Page 16 of 25
Pageof 25