Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1902.09630
Cited By
v1
v2 (latest)
Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression
25 February 2019
S. Hamid Rezatofighi
Deyuan Li
JunYoung Gwak
Amir Sadeghian
Ian Reid
Silvio Savarese
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression"
50 / 1,203 papers shown
MUST: The First Dataset and Unified Framework for Multispectral UAV Single Object Tracking
Computer Vision and Pattern Recognition (CVPR), 2025
Haolin Qin
Tingfa Xu
Tianhao Li
Zhenxiang Chen
Tao Feng
Jia-Nan Li
245
8
0
22 Mar 2025
Temporal Action Detection Model Compression by Progressive Block Drop
Computer Vision and Pattern Recognition (CVPR), 2025
Xiaoyong Chen
Yong Guo
Jiaming Liang
Sitong Zhuang
Runhao Zeng
Xiping Hu
302
1
0
21 Mar 2025
Long-VMNet: Accelerating Long-Form Video Understanding via Fixed Memory
Saket Gurukar
Asim Kadav
VLM
357
1
0
17 Mar 2025
STEP: Simultaneous Tracking and Estimation of Pose for Animals and Humans
Shashikant Verma
Harish Katti
Soumyaratna Debnath
Yamuna Swamy
Shanmuganathan Raman
943
1
0
17 Mar 2025
Ship Detection in Remote Sensing Imagery for Arbitrarily Oriented Object Detection
Bibi Erum Ayesha
T. Satyanarayana Murthy
Palamakula Ramesh Babu
Ramu Kuchipudi
248
3
0
17 Mar 2025
Towards General Multimodal Visual Tracking
Andong Lu
Mai Wen
Jinhu Wang
Yuanzhi Guo
Chenglong Li
Jin Tang
Bin Luo
190
1
0
14 Mar 2025
Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection
International Conference on Learning Representations (ICLR), 2025
Chuhan Zhang
Chaoyang Zhu
Pingcheng Dong
Long Chen
Dong Zhang
ObjD
VLM
1.0K
4
0
14 Mar 2025
Large-scale Pre-training for Grounded Video Caption Generation
Evangelos Kazakos
Cordelia Schmid
Josef Sivic
447
3
0
13 Mar 2025
Similarity-Guided Layer-Adaptive Vision Transformer for UAV Tracking
Computer Vision and Pattern Recognition (CVPR), 2025
Chaocan Xue
Bineng Zhong
Qihua Liang
Yaozong Zheng
Ning Li
Yuanliang Xue
Shuxiang Song
206
27
0
09 Mar 2025
SDTrack: A Baseline for Event-based Tracking via Spiking Neural Networks
Yimeng Shan
Zhenbang Ren
Haodi Wu
Wenjie Wei
Rui-jie Zhu
...
Kexin Shi
Jiashuo Wang
Nhan Duy Truong
Haicheng Qu
Jing Zhang
414
4
0
09 Mar 2025
Advancing Autonomous Vehicle Intelligence: Deep Learning and Multimodal LLM for Traffic Sign Recognition and Robust Lane Detection
Chandan Kumar Sah
Ankit Kumar Shaw
Xiaoli Lian
Arsalan Shahid Baig
Tuopu Wen
Yunlong Wang
Mengmeng Yang
Ke Wang
293
6
0
08 Mar 2025
Two-stream Beats One-stream: Asymmetric Siamese Network for Efficient Visual Tracking
AAAI Conference on Artificial Intelligence (AAAI), 2025
Jiawen Zhu
Huayi Tang
Xin Chen
Xinying Wang
Dong Wang
Huchuan Lu
240
13
0
01 Mar 2025
Detection of Customer Interested Garments in Surveillance Video using Computer Vision
International Conference on Computing Communication and Networking Technologies (ICCCNT), 2020
Earnest Paul Ijjina
A. Joshi
Goutham Kanahasabai
120
0
0
01 Mar 2025
A Guide to Failure in Machine Learning: Reliability and Robustness from Foundations to Practice
Eric Heim
Oren Wright
David Shriver
OOD
FaML
339
0
0
01 Mar 2025
MITracker: Multi-View Integration for Visual Object Tracking
Computer Vision and Pattern Recognition (CVPR), 2025
Mengjie Xu
Yitao Zhu
Haotian Jiang
Jiaming Li
Zhenrong Shen
...
Haolin Huang
Xinyu Wang
Qing Yang
H. Zhang
Qian Wang
268
2
0
27 Feb 2025
OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection
Shuming Liu
Chen Zhao
Fatimah Zohra
Mattia Soldan
Alejandro Pardo
...
Juan Carlos León Alcázar
A. Cioppa
Silvio Giancola
Carlos Hinojosa
Bernard Ghanem
297
5
0
27 Feb 2025
LightFC-X: Lightweight Convolutional Tracker for RGB-X Tracking
Yunfeng Li
Bo Wang
Ye Li
241
1
0
25 Feb 2025
SwimVG: Step-wise Multimodal Fusion and Adaption for Visual Grounding
Liangtao Shi
Ting Liu
Xiantao Hu
Yue Hu
Quanjun Yin
Richang Hong
ObjD
384
4
0
24 Feb 2025
Enhancing Domain-Specific Retrieval-Augmented Generation: Synthetic Data Generation and Evaluation using Reasoning Models
Aryan Jadon
Avinash Patil
Shashank Kumar
SyDa
221
6
0
21 Feb 2025
Visible-Thermal Tiny Object Detection: A Benchmark Dataset and Baselines
Xinyi Ying
Chao Xiao
Ruojing Li
Xu He
Boyang Li
...
Miao Li
Shilin Zhou
Wei An
Weidong Sheng
Tianpeng Liu
385
27
0
21 Feb 2025
FacaDiffy: Inpainting Unseen Facade Parts Using Diffusion Models
ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences (ISPRS Annals), 2025
Thomas Froech
Olaf Wysocki
Yan Xia
Junyu Xie
Benedikt Schwab
Zorah Lähner
T. H. Kolbe
129
0
0
20 Feb 2025
Dense Object Detection Based on De-homogenized Queries
Yueming Huang
Chenrui Ma
Hao Zhou
Hao Wu
Guowu Yuan
406
0
0
11 Feb 2025
Adaptive Perception for Unified Visual Multi-modal Object Tracking
IEEE Transactions on Artificial Intelligence (IEEE TAI), 2025
Xiantao Hu
Bineng Zhong
Qihua Liang
Zhiyi Mo
Liangtao Shi
Ying Tai
Jian Yang
253
8
0
10 Feb 2025
RS-YOLOX: A High Precision Detector for Object Detection in Satellite Remote Sensing Images
Applied Sciences (AS), 2022
Lei Yang
Guowu Yuan
Hao Zhou
Hongyu Liu
Jian Chen
Hao Wu
390
39
0
05 Feb 2025
YOLOSCM: An improved YOLO algorithm for cars detection
Applied and Computational Engineering (ACE), 2025
Changhui Deng
Lieyang Chen
Shinan Liu
229
5
0
23 Jan 2025
PD-SORT: Occlusion-Robust Multi-Object Tracking Using Pseudo-Depth Cues
IEEE transactions on consumer electronics (IEEE TCE), 2025
Yanchao Wang
Dawei Zhang
Run Li
Zhonglong Zheng
Minglu Li
VOT
233
8
0
20 Jan 2025
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection
Pengcheng Zhao
Zhixian He
Fuwei Zhang
Shujin Lin
Fan Zhou
368
3
0
18 Jan 2025
The Devil is in Temporal Token: High Quality Video Reasoning Segmentation
Computer Vision and Pattern Recognition (CVPR), 2025
Sitong Gong
Yunzhi Zhuge
Lu Zhang
Zhiyong Yang
Pingping Zhang
Huchuan Lu
245
17
0
15 Jan 2025
Toward Realistic Camouflaged Object Detection: Benchmarks and Method
Zhimeng Xin
Tianxu Wu
Shiming Chen
Shuo Ye
Zijing Xie
Yixiong Zou
Xinge You
Yufei Guo
199
0
0
13 Jan 2025
UniQ: Unified Decoder with Task-specific Queries for Efficient Scene Graph Generation
ACM Multimedia (MM), 2024
Xinyao Liao
Xiaoye Qu
Dangyang Chen
Yuanyuan Fu
320
1
0
10 Jan 2025
BEN: Using Confidence-Guided Matting for Dichotomous Image Segmentation
Maxwell Meyer
Jack Spruyt
298
5
0
08 Jan 2025
RDD4D: 4D Attention-Guided Road Damage Detection And Classification
Asma Alkalbani
Muhammad Saqib
Ahmed Salim Alrawahi
A. Anwar
Chandarnath Adak
Saeed Anwar
199
2
0
07 Jan 2025
Hierarchical Alignment-enhanced Adaptive Grounding Network for Generalized Referring Expression Comprehension
AAAI Conference on Artificial Intelligence (AAAI), 2025
Yaxian Wang
Henghui Ding
Shuting He
Xudong Jiang
Bifan Wei
Jun Liu
ObjD
261
8
0
03 Jan 2025
EC-IoU: Orienting Safety for Object Detectors via Ego-Centric Intersection-over-Union
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Brian Hsuan-Cheng Liao
Chih-Hong Cheng
Hasan Esen
Alois Knoll
EgoV
274
1
0
03 Jan 2025
PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement
Chengyou Jia
Minnan Luo
Zhuohang Dang
Guangwen Dai
Xiao Chang
Jiangming Wang
DiffM
452
1
0
31 Dec 2024
Differential Evolution Integrated Hybrid Deep Learning Model for Object Detection in Pre-made Dishes
Lujia Lv
Di Wu
Yangyi Xia
Jia Wu
Xiaojing Liu
Yi He
171
0
0
31 Dec 2024
Towards Visual Grounding: A Survey
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Linhui Xiao
Xiaoshan Yang
X. Lan
Yaowei Wang
Changsheng Xu
ObjD
955
31
0
28 Dec 2024
Pinwheel-shaped Convolution and Scale-based Dynamic Loss for Infrared Small Target Detection
AAAI Conference on Artificial Intelligence (AAAI), 2024
Jiangnan Yang
Shuangli Liu
Jingjun Wu
Xinyu Su
Nan Hai
Xueli Huang
296
105
0
22 Dec 2024
Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking
AAAI Conference on Artificial Intelligence (AAAI), 2024
Xiantao Hu
Ying Tai
Xu Zhao
Chen Zhao
Ying Tai
Jun Yu Li
Bineng Zhong
Jian Yang
335
38
0
20 Dec 2024
Robust Tracking via Mamba-based Context-aware Token Learning
AAAI Conference on Artificial Intelligence (AAAI), 2024
Jinxia Xie
Bineng Zhong
Qihua Liang
Ning Li
Zhiyi Mo
Shuxiang Song
Mamba
241
17
0
18 Dec 2024
What is YOLOv6? A Deep Insight into the Object Detection Model
Athulya Sundaresan Geetha
3DH
VLM
ObjD
317
6
0
17 Dec 2024
Parallel CPU- and GPU-based connected component algorithms for event building for hybrid pixel detectors
Journal of Instrumentation (JINST), 2024
Tomáš Čelko
František Mráz
Benedikt Bergmann
P. Mánek
173
0
0
16 Dec 2024
Oriented Tiny Object Detection: A Dataset, Benchmark, and Dynamic Unbiased Learning
Chang Xu
Ruixiang Zhang
Wen Yang
Haoran Zhu
Fang Xu
Jian Ding
Gui-Song Xia
ObjD
342
2
0
16 Dec 2024
Exploring Temporal Event Cues for Dense Video Captioning in Cyclic Co-learning
AAAI Conference on Artificial Intelligence (AAAI), 2024
Zhuyang Xie
Yan Yang
Yankai Yu
Jie Wang
Yongquan Jiang
Xiao-Jun Wu
378
2
0
16 Dec 2024
Exploring Enhanced Contextual Information for Video-Level Object Tracking
AAAI Conference on Artificial Intelligence (AAAI), 2024
Ben Kang
Xin Chen
Simiao Lai
Yang Liu
Y. Liu
Dong Wang
Mamba
312
26
0
15 Dec 2024
Just a Few Glances: Open-Set Visual Perception with Image Prompt Paradigm
AAAI Conference on Artificial Intelligence (AAAI), 2024
Jinrong Zhang
Penghui Wang
Chunxiao Liu
Wei Liu
D. Jin
Qiong Zhang
Erli Meng
Zhengnan Hu
VLM
235
1
0
14 Dec 2024
Point Cloud to Mesh Reconstruction: Methods, Trade-offs, and Implementation Guide
Fatima Zahra Iguenfer
Achraf Hsain
Hiba Amissa
Yousra Chtouki
3DPC
3DV
320
0
0
14 Dec 2024
Temporal Action Localization with Cross Layer Task Decoupling and Refinement
AAAI Conference on Artificial Intelligence (AAAI), 2024
Qiang Li
Di Liu
Jun Kong
Sen Li
Hui Xu
Jianzhong Wang
342
1
0
12 Dec 2024
TimeRefine: Temporal Grounding with Time Refining Video LLM
Xizi Wang
Feng Cheng
Ziyang Wang
Huiyu Wang
Md. Mohaiminul Islam
Lorenzo Torresani
Joey Tianyi Zhou
Gedas Bertasius
David J. Crandall
480
6
0
12 Dec 2024
Mojito: Motion Trajectory and Intensity Control for Video Generation
Xuehai He
Shuohang Wang
Jianwei Yang
Xiaoxia Wu
Longji Xu
Kuan-Chieh Wang
Z. Zhan
Olatunji Ruwase
Yelong Shen
Xinze Wang
VGen
714
5
0
12 Dec 2024
Previous
1
2
3
4
5
...
23
24
25
Next