Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1902.09630
Cited By
v1
v2 (latest)
Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression
25 February 2019
S. Hamid Rezatofighi
Deyuan Li
JunYoung Gwak
Amir Sadeghian
Ian Reid
Silvio Savarese
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression"
50 / 1,203 papers shown
Tri-modal Confluence with Temporal Dynamics for Scene Graph Generation in Operating Rooms
Diandian Guo
Manxi Lin
Jialun Pei
He Tang
Yueming Jin
Pheng-Ann Heng
222
4
0
14 Apr 2024
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection
Lewei Yao
Renjie Pi
Jianhua Han
Xiaodan Liang
Hang Xu
Wei Zhang
Zhenguo Li
Dan Xu
VLM
ObjD
301
45
0
14 Apr 2024
SFSORT: Scene Features-based Simple Online Real-Time Tracker
M. M. Morsali
Z. Sharifi
F. Fallah
S. Hashembeiki
H. Mohammadzade
S. B. Shouraki
VOT
243
8
0
11 Apr 2024
ConsistencyDet: A Few-step Denoising Framework for Object Detection Using the Consistency Model
Lifan Jiang
Zhihui Wang
Changmiao Wang
Ming Li
Jiaxu Leng
DiffM
336
0
0
11 Apr 2024
Uncertainty-aware Medical Diagnostic Phrase Identification and Grounding
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
K. Zou
Yang Bai
Zhihao Chen
Yang Zhou
Yidi Chen
...
Xuedong Yuan
Xiaojing Shen
Huazhu Fu
Yih-Chung Tham
Huazhu Fu
MedIm
397
5
0
10 Apr 2024
YOLC: You Only Look Clusters for Tiny Object Detection in Aerial Images
Chenguang Liu
Guangshuai Gao
Ziyue Huang
Zhenghui Hu
Qingjie Liu
Yunhong Wang
ObjD
354
59
0
09 Apr 2024
Mixed-Query Transformer: A Unified Image Segmentation Architecture
Pei Wang
Zhaowei Cai
Hao Yang
Ashwin Swaminathan
R. Manmatha
Stefano Soatto
265
4
0
06 Apr 2024
DQ-DETR: DETR with Dynamic Query for Tiny Object Detection
European Conference on Computer Vision (ECCV), 2024
Yi-Xin Huang
Hou-I Liu
Hong-Han Shuai
Wen-Huang Cheng
332
57
0
04 Apr 2024
UniAV: Unified Audio-Visual Perception for Multi-Task Video Event Localization
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Tiantian Geng
Teng Wang
Jinming Duan
Yanfu Zhang
Weili Guan
Feng Zheng
Ling Shao
291
2
0
04 Apr 2024
TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate Expression
Computer Vision and Pattern Recognition (CVPR), 2024
Ho-Joong Kim
Jung-Ho Hong
Heejo Kong
Seong-Whan Lee
219
17
0
03 Apr 2024
EGTR: Extracting Graph from Transformer for Scene Graph Generation
Computer Vision and Pattern Recognition (CVPR), 2024
Jinbae Im
Jeongyeon Nam
Nokyung Park
Hyungmin Lee
Seunghyun Park
ViT
595
52
0
02 Apr 2024
Red-Teaming Segment Anything Model
K. Jankowski
Bartlomiej Sobieski
Mateusz Kwiatkowski
J. Szulc
Michael F. Janik
Hubert Baniecki
P. Biecek
VLM
AAML
201
4
0
02 Apr 2024
Sparse Semi-DETR: Sparse Learnable Queries for Semi-Supervised Object Detection
Computer Vision and Pattern Recognition (CVPR), 2024
Tahira Shehzadi
K. Hashmi
Didier Stricker
Muhammad Zeshan Afzal
239
28
0
02 Apr 2024
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Weifeng Lin
Xinyu Wei
Ruichuan An
Shiyang Feng
Bocheng Zou
Yulin Luo
Siyuan Huang
Shanghang Zhang
Jiaming Song
VLM
402
84
0
29 Mar 2024
ENet-21: An Optimized light CNN Structure for Lane Detection
Seyed Rasoul Hosseini
Mohammad Teshnehlab
253
6
0
28 Mar 2024
OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation
Zhenyu Wang
Yali Li
Taichi Liu
Hengshuang Zhao
Shengjin Wang
3DPC
ObjD
284
17
0
28 Mar 2024
Infrared Small Target Detection with Scale and Location Sensitivity
Qiankun Liu
Rui Liu
Bolun Zheng
Hongkui Wang
Ying Fu
252
130
0
28 Mar 2024
AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation
Qingping Sun
Yanjun Wang
Ailing Zeng
Wanqi Yin
Chen Wei
...
Haiyi Mei
Chi Sing Leung
Ziwei Liu
Lei Yang
Zhongang Cai
3DH
262
39
0
26 Mar 2024
Exploring Dynamic Transformer for Efficient Object Tracking
Jiawen Zhu
Xin Chen
Haiwen Diao
Shuai Li
Jun-Yan He
Chenyang Li
Bin Luo
Dong Wang
Huchuan Lu
406
13
0
26 Mar 2024
Multiple Object Tracking as ID Prediction
Ruopeng Gao
Yijun Zhang
Limin Wang
571
53
0
25 Mar 2024
Innovative Quantitative Analysis for Disease Progression Assessment in Familial Cerebral Cavernous Malformations
IEEE Transactions on Biomedical Engineering (IEEE TBME), 2024
Ruige Zong
Tao Wang
Chunwang Li
Xinlin Zhang
Yuanbin Chen
...
Qixuan Li
Qinquan Gao
Dezhi Kang
Fuxin Lin
Tong Tong
269
0
0
23 Mar 2024
An In-Depth Analysis of Data Reduction Methods for Sustainable Deep Learning
Open Research Europe (ORE), 2024
Víctor Toscano-Durán
Javier Perera-Lago
Eduardo Paluzo-Hidalgo
Rocio Gonzalez-Diaz
Miguel A. Gutiérrez-Naranjo
Matteo Rucco
219
4
0
22 Mar 2024
Surgical-LVLM: Learning to Adapt Large Vision-Language Model for Grounded Visual Question Answering in Robotic Surgery
Guan-Feng Wang
Long Bai
Wan Jun Nah
Jie Wang
Zhaoxi Zhang
Zhen Chen
Jinlin Wu
Mobarakol Islam
Hongbin Liu
Hongliang Ren
355
28
0
22 Mar 2024
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
Qing Jiang
Feng Li
Zhaoyang Zeng
Tianhe Ren
Shilong Liu
Lei Zhang
VLM
361
80
0
21 Mar 2024
Volumetric Environment Representation for Vision-Language Navigation
Rui Liu
Wenguan Wang
Yi Yang
252
60
0
21 Mar 2024
Spatio-Temporal Proximity-Aware Dual-Path Model for Panoramic Activity Recognition
Sumin Lee
Yooseung Wang
Sangmin Woo
Changick Kim
230
2
0
21 Mar 2024
MaskSAM: Towards Auto-prompt SAM with Mask Classification for Volumetric Medical Image Segmentation
Bin Xie
Hao Tang
Bin Duan
Dawen Cai
Yan Yan
Gady Agam
VLM
MedIm
208
1
0
21 Mar 2024
Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments
Yang Yang
Wenhai Wang
Zhe Chen
Jifeng Dai
Liang Zheng
180
6
0
20 Mar 2024
Fast-Poly: A Fast Polyhedral Framework For 3D Multi-Object Tracking
Xiaoyu Li
Dedong Liu
Lijun Zhao
Yitao Wu
Xian Wu
Jinghan Gao
3DPC
253
12
0
20 Mar 2024
CalliRewrite: Recovering Handwriting Behaviors from Calligraphy Images without Supervision
Yuxuan Luo
Zekun Wu
Zhouhui Lian
251
1
0
20 Mar 2024
Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
European Conference on Computer Vision (ECCV), 2024
Zixin Zhu
Xuelu Feng
Dongdong Chen
Junsong Yuan
Chunming Qiao
Gang Hua
DiffM
319
17
0
18 Mar 2024
Siamese Learning with Joint Alignment and Regression for Weakly-Supervised Video Paragraph Grounding
Computer Vision and Pattern Recognition (CVPR), 2024
Chaolei Tan
Jian-Huang Lai
Wei-Shi Zheng
Jianfang Hu
AI4TS
356
10
0
18 Mar 2024
Cannabis Seed Variant Detection using Faster R-CNN
Toqi Tahamid Sarker
Taminul Islam
Khaled R Ahmed
133
4
0
15 Mar 2024
Autoregressive Queries for Adaptive Tracking with Spatio-TemporalTransformers
Computer Vision and Pattern Recognition (CVPR), 2024
Jinxia Xie
Bineng Zhong
Zhiyi Mo
Shengping Zhang
Liangtao Shi
Shuxiang Song
Rongrong Ji
319
113
0
15 Mar 2024
OneTracker: Unifying Visual Object Tracking with Foundation Models and Efficient Tuning
Computer Vision and Pattern Recognition (CVPR), 2024
Lingyi Hong
Shilin Yan
Renrui Zhang
Wanyun Li
Xinyu Zhou
...
Kaixun Jiang
Yiting Chen
Jinglun Li
Zhaoyu Chen
Wenqiang Zhang
VLM
226
118
0
14 Mar 2024
Efficient Transferability Assessment for Selection of Pre-trained Detectors
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Zhao Wang
Aoxue Li
Zhenguo Li
Qi Dou
174
0
0
14 Mar 2024
MOTPose: Multi-object 6D Pose Estimation for Dynamic Video Sequences using Attention-based Temporal Fusion
IEEE International Conference on Robotics and Automation (ICRA), 2024
Arul Selvam Periyasamy
Sven Behnke
3DPC
180
1
0
14 Mar 2024
Perceive With Confidence: Statistical Safety Assurances for Navigation with Learning-Based Perception
Conference on Robot Learning (CoRL), 2024
Anushri Dixit
Zhiting Mei
Meghan Booker
Mariko Storey-Matsutani
Mariko Storey-Matsutani
Allen Z. Ren
Ola Shorinwa
Anirudha Majumdar
640
10
0
13 Mar 2024
Learning Data Association for Multi-Object Tracking using Only Coordinates
Pattern Recognition (Pattern Recogn.), 2024
Mehdi Miah
Guillaume-Alexandre Bilodeau
Nicolas Saunier
VOT
231
10
0
12 Mar 2024
Class Imbalance in Object Detection: An Experimental Diagnosis and Study of Mitigation Strategies
Nieves Crasto
ObjD
189
14
0
11 Mar 2024
Ada-Tracker: Soft Tissue Tracking via Inter-Frame and Adaptive-Template Matching
IEEE International Conference on Robotics and Automation (ICRA), 2024
Jiaxin Guo
Jiangliu Wang
Zhaoshuo Li
Tongyu Jia
Qi Dou
Yao Xiao
MedIm
228
2
0
11 Mar 2024
Long-Term Visual Object Tracking with Event Cameras: An Associative Memory Augmented Tracker and A Benchmark Dataset
Tianlin Li
Ju Huang
Shiao Wang
Ju Huang
Bowei Jiang
Bo Jiang
367
11
0
09 Mar 2024
PEEB: Part-based Image Classifiers with an Explainable and Editable Language Bottleneck
Thang M. Pham
Peijie Chen
Tin Nguyen
Seunghyun Yoon
Trung Bui
Peijie Chen
VLM
362
11
0
08 Mar 2024
Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performance
European Conference on Computer Vision (ECCV), 2024
Liting Lin
Heng Fan
Zhipeng Zhang
Yaowei Wang
Yong-mei Xu
Haibin Ling
340
89
0
08 Mar 2024
Multi-step Temporal Modeling for UAV Tracking
Xiaoying Yuan
Tingfa Xu
Xincong Liu
Ying Wang
Haolin Qin
Yuqiang Fang
Jianan Li
216
22
0
07 Mar 2024
Discriminative Probing and Tuning for Text-to-Image Generation
Leigang Qu
Wenjie Wang
Chak Tou Leong
Hanwang Zhang
Liqiang Nie
Tat-Seng Chua
375
12
0
07 Mar 2024
AO-DETR: Anti-Overlapping DETR for X-Ray Prohibited Items Detection
Mingyuan Li
Tong Jia
Hao Wang
Bowen Ma
Shuyang Lin
Da Cai
Dongyue Chen
ViT
272
53
0
07 Mar 2024
Vision-Language Models for Medical Report Generation and Visual Question Answering: A Review
Iryna Hartsock
Ghulam Rasool
377
166
0
04 Mar 2024
FakeNewsGPT4: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs
Xuannan Liu
Peipei Li
Huaibo Huang
Zekun Li
Xing Cui
Jiahao Liang
Lixiong Qin
Weihong Deng
Zhaofeng He
184
3
0
04 Mar 2024
GPTSee: Enhancing Moment Retrieval and Highlight Detection via Description-Based Similarity Features
Yunzhuo Sun
Yifang Xu
Zien Xie
Yukun Shu
Sidan Du
317
10
0
03 Mar 2024
Previous
1
2
3
...
7
8
9
...
23
24
25
Next