Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1902.09630
Cited By
v1
v2 (latest)
Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression
25 February 2019
S. Hamid Rezatofighi
Deyuan Li
JunYoung Gwak
Amir Sadeghian
Ian Reid
Silvio Savarese
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression"
50 / 1,204 papers shown
Single-Stage Visual Query Localization in Egocentric Videos
Neural Information Processing Systems (NeurIPS), 2023
Hanwen Jiang
Santhosh Kumar Ramakrishnan
Kristen Grauman
269
21
0
15 Jun 2023
Relation-Aware Diffusion Model for Controllable Poster Layout Generation
International Conference on Information and Knowledge Management (CIKM), 2023
Fengheng Li
An Liu
Wei Feng
Honghe Zhu
Yaoyu Li
...
Jingjing Lv
Xin Zhu
Jun-Jun Shen
Zhangang Lin
Jingping Shao
187
41
0
15 Jun 2023
World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Ziqiao Ma
Jiayi Pan
J. Chai
ObjD
VLM
218
12
0
14 Jun 2023
OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments
Quentin Delfosse
Johannes Czech
Bjarne Gregori
Sebastian Sztwiertnia
Kristian Kersting
504
25
0
14 Jun 2023
Single-Stage Visual Relationship Learning using Conditional Queries
Neural Information Processing Systems (NeurIPS), 2023
Alakh Desai
Tz-Ying Wu
Subarna Tripathi
Nuno Vasconcelos
224
9
0
09 Jun 2023
SparseTrack: Multi-Object Tracking by Performing Scene Decomposition based on Pseudo-Depth
Zelin Liu
Xinggang Wang
Cheng Wang
Wenyu Liu
X. Bai
VOS
VOT
378
97
0
08 Jun 2023
Object Detection with Transformers: A Review
Italian National Conference on Sensors (INS), 2023
Tahira Shehzadi
K. Hashmi
D. Stricker
Muhammad Zeshan Afzal
ViT
MU
423
58
0
07 Jun 2023
Language Adaptive Weight Generation for Multi-task Visual Grounding
Computer Vision and Pattern Recognition (CVPR), 2023
Wei Su
Peihan Miao
Huanzhang Dou
Gaoang Wang
Liang Qiao
Zheyang Li
Xi Li
ObjD
297
50
0
06 Jun 2023
Student Classroom Behavior Detection based on Improved YOLOv7
Fan Yang
149
12
0
06 Jun 2023
Cross-Domain Car Detection Model with Integrated Convolutional Block Attention Mechanism
Image and Vision Computing (IVC), 2023
Haoxuan Xu
Songning Lai
Xianyang Li
Y. Yang
ViT
263
17
0
31 May 2023
VIPriors 3: Visual Inductive Priors for Data-Efficient Deep Learning Challenges
Robert-Jan Bruintjes
A. Lengyel
Marcos Baptista-Rios
O. Kayhan
Davide Zambrano
Nergis Tomen
Jan van Gemert
203
9
0
31 May 2023
Table Detection for Visually Rich Document Images
Knowledge-Based Systems (KBS), 2023
Bin Xiao
Murat Simsek
B. Kantarci
Ala Abu Alkheir
197
11
0
30 May 2023
LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Yi Tu
Ya Guo
Huan Chen
Jinyang Tang
209
22
0
30 May 2023
Contextual Object Detection with Multimodal Large Language Models
International Journal of Computer Vision (IJCV), 2023
Yuhang Zang
Wei Li
Jun Han
Kaiyang Zhou
Chen Change Loy
ObjD
VLM
MLLM
328
142
0
29 May 2023
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation
AAAI Conference on Artificial Intelligence (AAAI), 2023
Shilin Yan
Renrui Zhang
Ziyu Guo
Wenchao Chen
Wei Zhang
Guoying Gu
Yu Qiao
Hao Dong
Zhongjiang He
Shiyang Feng
VOS
337
58
0
25 May 2023
Semi-Supervised and Long-Tailed Object Detection with CascadeMatch
International Journal of Computer Vision (IJCV), 2023
Yuhang Zang
Kaiyang Zhou
Chen Huang
Chen Change Loy
238
14
0
24 May 2023
Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models
AAAI Conference on Artificial Intelligence (AAAI), 2023
Ruichen Wang
Zekang Chen
Chen Chen
Jiancang Ma
H. Lu
Xiaodong Lin
DiffM
330
92
0
23 May 2023
A comprehensive theoretical framework for the optimization of neural networks classification performance with respect to weighted metrics
Optimization Letters (Optim. Lett.), 2023
Francesco Marchetti
Sabrina Guastavino
C. Campi
F. Benvenuto
Michele Piana
127
2
0
22 May 2023
nnDetection for Intracranial Aneurysms Detection and Localization
Zehan Zhang
Negar Firoozeh
Shaojun Xia
Mahmud Mossa-Basha
Chengcheng Zhu
73
3
0
22 May 2023
Boosting Long-tailed Object Detection via Step-wise Learning on Smooth-tail Data
IEEE International Conference on Computer Vision (ICCV), 2023
Na Dong
Yongqiang Zhang
Mingli Ding
G. Lee
203
6
0
22 May 2023
UVOSAM: A Mask-free Paradigm for Unsupervised Video Object Segmentation via Segment Anything Model
Pattern Recognition (Pattern Recogn.), 2023
Zhenghao Zhang
Shengfan Zhang
Zhichao Wei
Zuozhuo Dai
Siyu Zhu
VOS
VLM
304
21
0
22 May 2023
Surgical-VQLA: Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surgery
IEEE International Conference on Robotics and Automation (ICRA), 2023
Long Bai
Mobarakol Islam
Lalithkumar Seenivasan
Hongliang Ren
185
52
0
19 May 2023
Selecting Learnable Training Samples is All DETRs Need in Crowded Pedestrian Detection
ACM Multimedia (ACM MM), 2023
Feng Gao
Jiaxu Leng
Ji Gan
Xinbo Gao
ViT
221
7
0
18 May 2023
Rethinking Boundary Discontinuity Problem for Oriented Object Detection
Computer Vision and Pattern Recognition (CVPR), 2023
Hang Xu
Xinyuan Liu
Haonan Xu
Yike Ma
Zunjie Zhu
C. Yan
Feng Dai
175
25
0
17 May 2023
Two-Stream Regression Network for Dental Implant Position Prediction
Expert systems with applications (ESWA), 2023
Xinquan Yang
Xuguang Li
Xuechen Li
Wenting Chen
Linlin Shen
Xuzhao Li
Yongqiang Deng
260
15
0
17 May 2023
Understanding 3D Object Interaction from a Single Image
IEEE International Conference on Computer Vision (ICCV), 2023
Shengyi Qian
David Fouhey
364
32
0
16 May 2023
CLIP-VG: Self-paced Curriculum Adapting of CLIP for Visual Grounding
IEEE transactions on multimedia (IEEE TMM), 2023
Linhui Xiao
Xiaoshan Yang
Fang Peng
Ming Yan
Yaowei Wang
Changsheng Xu
ObjD
VLM
472
62
0
15 May 2023
CLRerNet: Improving Confidence of Lane Detection with LaneIoU
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Hiroto Honda
Yusuke Uchida
286
46
0
15 May 2023
Hausdorff Distance Matching with Adaptive Query Denoising for Rotated Detection Transformer
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Hakjin Lee
Minki Song
Jamyoung Koo
Junghoon Seo
560
15
0
12 May 2023
SSD-MonoDETR: Supervised Scale-aware Deformable Transformer for Monocular 3D Object Detection
IEEE Transactions on Intelligent Vehicles (TIV), 2023
Xuan He
Fan Yang
Kailun Yang
Jiacheng Lin
Haolong Fu
Ming Wang
Jin Yuan
Zhiyong Li
ViT
342
22
0
12 May 2023
Undercover Deepfakes: Detecting Fake Segments in Videos
Sanjay Saha
Rashindrie Perera
Sachith Seneviratne
T. Malepathirana
Sanka Rasnayaka
Deshani Geethika
Terence Sim
Saman K. Halgamuge
DiffM
415
15
0
11 May 2023
Real-time instance segmentation with polygons using an Intersection-over-Union loss
Canadian Conference on Computer and Robot Vision (CRV), 2023
Katia Jodogne-Del Litto
Guillaume-Alexandre Bilodeau
179
4
0
09 May 2023
A Cross-direction Task Decoupling Network for Small Logo Detection
IEEE International Conference on Multimedia and Expo (ICME), 2023
Hou
Sujuan Hou
Li
Xingzhuo Li
Min
...
Jing
Zheng
Yuanjie Zheng
Jiang
Shuqiang Jiang
149
5
0
04 May 2023
MH-DETR: Video Moment and Highlight Detection with Cross-modal Transformer
IEEE International Joint Conference on Neural Network (IJCNN), 2023
Yifang Xu
Yunzhuo Sun
Yang Li
Yilei Shi
Xiaoxia Zhu
S. Du
ViT
271
48
0
29 Apr 2023
Unified Sequence-to-Sequence Learning for Single- and Multi-Modal Visual Object Tracking
Xin Chen
Houwen Peng
Jiawen Zhu
Dong Wang
Han Hu
Huchuan Lu
376
27
0
27 Apr 2023
End-to-End Spatio-Temporal Action Localisation with Video Transformers
Computer Vision and Pattern Recognition (CVPR), 2023
A. Gritsenko
Xuehan Xiong
Josip Djolonga
Mostafa Dehghani
Chen Sun
Mario Lucic
Cordelia Schmid
Anurag Arnab
ViT
258
22
0
24 Apr 2023
Deep Learning in Breast Cancer Imaging: A Decade of Progress and Future Directions
IEEE Reviews in Biomedical Engineering (RBME), 2023
Luyang Luo
Xi Wang
Yi Lin
Xiaoqi Ma
Andong Tan
R. Chan
V. Vardhanabhuti
W. C. Chu
Kwang-Ting Cheng
Hao Chen
438
95
0
13 Apr 2023
Real-time Trajectory-based Social Group Detection
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Simindokht Jahangard
Munawar Hayat
Hamid Rezatofighi
149
7
0
12 Apr 2023
Overload: Latency Attacks on Object Detection for Edge Devices
Computer Vision and Pattern Recognition (CVPR), 2023
Erh-Chung Chen
Pin-Yu Chen
I-Hsin Chung
Che-Rung Lee
AAML
341
19
0
11 Apr 2023
Relational Context Learning for Human-Object Interaction Detection
Computer Vision and Pattern Recognition (CVPR), 2023
Sanghyun Kim
Deunsol Jung
Minsu Cho
278
66
0
11 Apr 2023
StageInteractor: Query-based Object Detector with Cross-stage Interaction
IEEE International Conference on Computer Vision (ICCV), 2023
Yao Teng
Haisong Liu
Sheng Guo
Limin Wang
ObjD
312
12
0
11 Apr 2023
PlantDet: A benchmark for Plant Detection in the Three-Rivers-Source Region
International Conference on Artificial Neural Networks (ICANN), 2023
Huanhuan Li
Xuechao Zou
Yu-an Zhang
Jiangcai Zhaba
Guomei Li
Lamao Yongga
250
0
0
11 Apr 2023
Detection Transformer with Stable Matching
IEEE International Conference on Computer Vision (ICCV), 2023
Siyi Liu
Tianhe Ren
Jia-Yu Chen
Zhaoyang Zeng
Hao Zhang
...
Hongyang Li
Jun Huang
Hang Su
Jun Zhu
Lei Zhang
233
56
0
10 Apr 2023
DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment
Computer Vision and Pattern Recognition (CVPR), 2023
Lewei Yao
Jianhua Han
Xiaodan Liang
Danqian Xu
Wei Zhang
Zhenguo Li
Hang Xu
VLM
ObjD
CLIP
312
104
0
10 Apr 2023
Sky-GVINS: a Sky-segmentation Aided GNSS-Visual-Inertial System for Robust Navigation in Urban Canyons
Geo-Spatial Information Science (GSIS), 2023
Jie Yin
Tao Li
Hao Yin
Wenxian Yu
Danping Zou
172
14
0
08 Apr 2023
DATE: Domain Adaptive Product Seeker for E-commerce
Computer Vision and Pattern Recognition (CVPR), 2023
Haoyuan Li
Haojie Jiang
Tao Jin
Meng-Juan Li
Yan Chen
Zhijie Lin
Yang Zhao
Zhou Zhao
316
6
0
07 Apr 2023
Boundary-Denoising for Video Activity Localization
International Conference on Learning Representations (ICLR), 2023
Mengmeng Xu
Mattia Soldan
Jialin Gao
Shuming Liu
Juan-Manuel Perez-Rua
Guohao Li
191
15
0
06 Apr 2023
Detecting and Grounding Multi-Modal Media Manipulation
Computer Vision and Pattern Recognition (CVPR), 2023
Rui Shao
Tianxing Wu
Ziwei Liu
272
111
0
05 Apr 2023
Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA
AAAI Conference on Artificial Intelligence (AAAI), 2023
Yongxin Zhu
Ziqiang Liu
Yukang Liang
Xin Li
Hao Liu
Changcun Bao
Linli Xu
172
9
0
04 Apr 2023
A Comprehensive Review of YOLO Architectures in Computer Vision: From YOLOv1 to YOLOv8 and YOLO-NAS
Machine Learning and Knowledge Extraction (MLKE), 2023
Juan R. Terven
Diana-Margarita Córdova-Esparza
722
2,109
0
02 Apr 2023
Previous
1
2
3
...
12
13
14
...
23
24
25
Next
Page 13 of 25
Page
of 25
Go