ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.09630
  4. Cited By
Generalized Intersection over Union: A Metric and A Loss for Bounding
  Box Regression
v1v2 (latest)

Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

25 February 2019
S. Hamid Rezatofighi
Deyuan Li
JunYoung Gwak
Amir Sadeghian
Ian Reid
Silvio Savarese
ArXiv (abs)PDFHTML

Papers citing "Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression"

50 / 1,203 papers shown
Title
UNO: Unifying One-stage Video Scene Graph Generation via Object-Centric Visual Representation Learning
UNO: Unifying One-stage Video Scene Graph Generation via Object-Centric Visual Representation Learning
Huy Le
Nhat Chung
Tung Kieu
Jingkang Yang
Ngan Le
VOSOCL
329
1
0
07 Sep 2025
TinyDef-DETR: A DETR-based Framework for Defect Detection in Transmission Lines from UAV Imagery
TinyDef-DETR: A DETR-based Framework for Defect Detection in Transmission Lines from UAV Imagery
Feng Shen
Jiaming Cui
Shuai Zhou
Wenqiang Li
Ruifeng Qin
164
0
0
07 Sep 2025
SparkUI-Parser: Enhancing GUI Perception with Robust Grounding and Parsing
SparkUI-Parser: Enhancing GUI Perception with Robust Grounding and Parsing
Hongyi Jing
Jiafu Chen
Chen Rao
Ziqiang Dang
Jiajie Teng
...
Shuo Fang
Huaizhong Lin
Rui Lv
Chenguang Ma
Lei Zhao
60
0
0
05 Sep 2025
Heatmap Guided Query Transformers for Robust Astrocyte Detection across Immunostains and Resolutions
Heatmap Guided Query Transformers for Robust Astrocyte Detection across Immunostains and Resolutions
Xizhe Zhang
Jiayang Zhu
MedIm
65
0
0
03 Sep 2025
EdgeAttNet: Towards Barb-Aware Filament Segmentation
EdgeAttNet: Towards Barb-Aware Filament Segmentation
Victor Solomon
P. Martens
Jingyu Liu
Rafal Angryk
68
0
0
03 Sep 2025
HERO-VQL: Hierarchical, Egocentric and Robust Visual Query Localization
HERO-VQL: Hierarchical, Egocentric and Robust Visual Query Localization
Joohyun Chang
Soyeon Hong
Hyogun Lee
Seong Jong Ha
Dongho Lee
Seong Tae Kim
Jinwoo Choi
EgoV
200
1
0
30 Aug 2025
HCCM: Hierarchical Cross-Granularity Contrastive and Matching Learning for Natural Language-Guided Drones
HCCM: Hierarchical Cross-Granularity Contrastive and Matching Learning for Natural Language-Guided Drones
Hao Ruan
Jinliang Lin
Yingxin Lai
Zhiming Luo
Shaozi Li
VLM
132
1
0
29 Aug 2025
Few-Shot Pattern Detection via Template Matching and Regression
Few-Shot Pattern Detection via Template Matching and Regression
Eunchan Jo
Dahyun Kang
Sanghyun Kim
Yunseon Choi
Minsu Cho
98
0
0
25 Aug 2025
Explain Before You Answer: A Survey on Compositional Visual Reasoning
Explain Before You Answer: A Survey on Compositional Visual Reasoning
Fucai Ke
Joy Hsu
Zhixi Cai
Zixian Ma
Xin Zheng
...
P. D. Haghighi
Gholamreza Haffari
Ranjay Krishna
Jiajun Wu
H. Rezatofighi
ReLMCoGeLRM
296
6
0
24 Aug 2025
RADAR: A Reasoning-Guided Attribution Framework for Explainable Visual Data Analysis
RADAR: A Reasoning-Guided Attribution Framework for Explainable Visual Data Analysis
Anku Rani
Aparna Garimella
Apoorv Saxena
Balaji Vasan Srinivasan
Paul Pu Liang
86
0
0
23 Aug 2025
Aligning Moments in Time using Video Queries
Aligning Moments in Time using Video Queries
Yogesh Kumar
Uday Agarwal
Manish Gupta
Anand Mishra
235
1
0
21 Aug 2025
RATopo: Improving Lane Topology Reasoning via Redundancy Assignment
RATopo: Improving Lane Topology Reasoning via Redundancy Assignment
Han Li
Shaofei Huang
Longfei Xu
Yulu Gao
Beipeng Mu
Si Liu
72
0
0
21 Aug 2025
Inter-Class Relational Loss for Small Object Detection: A Case Study on License Plates
Inter-Class Relational Loss for Small Object Detection: A Case Study on License Plates
Dian Ning
Dong Seog Han
72
0
0
20 Aug 2025
You Only Pose Once: A Minimalist's Detection Transformer for Monocular RGB Category-level 9D Multi-Object Pose Estimation
You Only Pose Once: A Minimalist's Detection Transformer for Monocular RGB Category-level 9D Multi-Object Pose Estimation
Hakjin Lee
Junghoon Seo
Jaehoon Sim
76
1
0
20 Aug 2025
Temporal-Conditional Referring Video Object Segmentation with Noise-Free Text-to-Video Diffusion Model
Temporal-Conditional Referring Video Object Segmentation with Noise-Free Text-to-Video Diffusion Model
Ruixin Zhang
Jiaqing Fan
Yifan Liao
Qian Qiao
Fanzhang Li
DiffMVOS
202
0
0
19 Aug 2025
CMF-IoU: Multi-Stage Cross-Modal Fusion 3D Object Detection with IoU Joint Prediction
CMF-IoU: Multi-Stage Cross-Modal Fusion 3D Object Detection with IoU Joint Prediction
Zhiwei Ning
Zhaojiang Liu
Xuanang Gao
Yifan Zuo
Jie Yang
Yuming Fang
Wei Liu
3DPC
76
1
0
18 Aug 2025
Real-Time Beach Litter Detection and Counting: A Comparative Analysis of RT-DETR Model Variants
Real-Time Beach Litter Detection and Counting: A Comparative Analysis of RT-DETR Model Variants
Miftahul Huda
Arsyiah Azahra
Putri Maulida Chairani
Dimas Rizky Ramadhani
Nabila Azhari
Ade Lailani
114
0
0
18 Aug 2025
LayoutRectifier: An Optimization-based Post-processing for Graphic Design Layout Generation
LayoutRectifier: An Optimization-based Post-processing for Graphic Design Layout Generation
I-Chao Shen
Ariel Shamir
Takeo Igarashi
144
2
0
15 Aug 2025
Colon Polyps Detection from Colonoscopy Images Using Deep Learning
Colon Polyps Detection from Colonoscopy Images Using Deep Learning
Md Al Amin
Bikash Kumar Paul
72
0
0
14 Aug 2025
IAG: Input-aware Backdoor Attack on VLM-based Visual Grounding
IAG: Input-aware Backdoor Attack on VLM-based Visual Grounding
Junxian Li
Beining Xu
Simin Chen
Jiatong Li
Jingdi Lei
Haodong Zhao
Di Zhang
AAML
148
0
0
13 Aug 2025
EventRR: Event Referential Reasoning for Referring Video Object Segmentation
EventRR: Event Referential Reasoning for Referring Video Object Segmentation
Huihui Xu
Jiashi Lin
Haoyu Chen
Junjun He
Lei Zhu
VOS
271
0
0
10 Aug 2025
Dual-Stream Attention with Multi-Modal Queries for Object Detection in Transportation Applications
Dual-Stream Attention with Multi-Modal Queries for Object Detection in Transportation Applications
Noreen Anwar
Guillaume-Alexandre Bilodeau
W. Bouachir
65
0
0
06 Aug 2025
Infrared Object Detection with Ultra Small ConvNets: Is ImageNet Pretraining Still Useful?
Infrared Object Detection with Ultra Small ConvNets: Is ImageNet Pretraining Still Useful?
Srikanth Muralidharan
H. R. Medeiros
Masih Aminbeidokhti
Eric Granger
M. Pedersoli
101
0
0
04 Aug 2025
DMTrack: Spatio-Temporal Multimodal Tracking via Dual-Adapter
DMTrack: Spatio-Temporal Multimodal Tracking via Dual-Adapter
Weihong Li
Shaohua Dong
Haonan Lu
Yanhao Zhang
Heng Fan
L. Zhang
89
0
0
03 Aug 2025
RMT-PPAD: Real-time Multi-task Learning for Panoptic Perception in Autonomous Driving
RMT-PPAD: Real-time Multi-task Learning for Panoptic Perception in Autonomous Driving
Jiayuan Wang
Q. M. Jonathan Wu
Katsuya Suto
Ning Zhang
137
2
0
02 Aug 2025
3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection
3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection
Yung-Hsu Yang
Luigi Piccinelli
Mattia Segu
Siyuan Li
Rui Huang
Yuqian Fu
Marc Pollefeys
Hermann Blum
Z. Bauer
3DPC
198
3
0
31 Jul 2025
Decoupled Spatio-Temporal Consistency Learning for Self-Supervised Tracking
Decoupled Spatio-Temporal Consistency Learning for Self-Supervised TrackingAAAI Conference on Artificial Intelligence (AAAI), 2025
Yaozong Zheng
Bineng Zhong
Qihua Liang
Ning Li
Shuxiang Song
200
23
0
29 Jul 2025
Detection Transformers Under the Knife: A Neuroscience-Inspired Approach to Ablations
Detection Transformers Under the Knife: A Neuroscience-Inspired Approach to Ablations
Nils Hütten
Florian Hölken
Hasan Tercan
Tobias Meisen
MedIm
132
0
0
29 Jul 2025
Towards Universal Modal Tracking with Online Dense Temporal Token Learning
Towards Universal Modal Tracking with Online Dense Temporal Token LearningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Yaozong Zheng
Bineng Zhong
Qihua Liang
Shengping Zhang
Guorong Li
Xianxian Li
Rongrong Ji
133
19
0
27 Jul 2025
JDATT: A Joint Distillation Framework for Atmospheric Turbulence Mitigation and Target Detection
JDATT: A Joint Distillation Framework for Atmospheric Turbulence Mitigation and Target Detection
Zhiming Liu
P. Hill
Qirui Yang
202
1
0
26 Jul 2025
ATCTrack: Aligning Target-Context Cues with Dynamic Target States for Robust Vision-Language Tracking
ATCTrack: Aligning Target-Context Cues with Dynamic Target States for Robust Vision-Language Tracking
X. Feng
Shuyan Hu
X. Li
D. Zhang
M. Wu
Jie Zhang
Xiaosha Chen
K. Huang
142
3
0
26 Jul 2025
ABCD: Automatic Blood Cell Detection via Attention-Guided Improved YOLOX
ABCD: Automatic Blood Cell Detection via Attention-Guided Improved YOLOX
Ahmed Endris Hasen
Yang Shangming
Chiagoziem C. Ukwuoma
Biniyam Gashaw
Abel Zenebe Yutra
113
0
0
25 Jul 2025
Demystify Protein Generation with Hierarchical Conditional Diffusion Models
Demystify Protein Generation with Hierarchical Conditional Diffusion Models
Zinan Ling
Yi Shi
Da Yan
Yang Zhou
Bo Hui
Bo Hui
DiffM
215
0
0
24 Jul 2025
Explicit Context Reasoning with Supervision for Visual Tracking
Explicit Context Reasoning with Supervision for Visual Tracking
Fansheng Zeng
Bineng Zhong
Haiying Xia
Yufei Tan
Xiantao Hu
Liangtao Shi
Shuxiang Song
137
2
0
22 Jul 2025
LDRFusion: A LiDAR-Dominant multimodal refinement framework for 3D object detection
LDRFusion: A LiDAR-Dominant multimodal refinement framework for 3D object detection
Jijun Wang
Yan Wu
Yujian Mo
Siyue Tao
Jun Yan
Yinghao Hu
3DPC
131
1
0
22 Jul 2025
InterpIoU: Rethinking Bounding Box Regression with Interpolation-Based IoU Optimization
InterpIoU: Rethinking Bounding Box Regression with Interpolation-Based IoU Optimization
Haoyuan Liu
Hiroshi Watanabe
126
2
0
16 Jul 2025
Deep Generative Methods and Tire Architecture Design
Deep Generative Methods and Tire Architecture Design
Fouad Oubari
Raphael Meunier
Rodrigue Décatoire
Mathilde Mougeot
DiffMAI4CE
162
0
0
15 Jul 2025
ESG-Net: Event-Aware Semantic Guided Network for Dense Audio-Visual Event Localization
ESG-Net: Event-Aware Semantic Guided Network for Dense Audio-Visual Event Localization
Huilai Li
Yonghao Dang
Ying Xing
Yiming Wang
Jianqin Yin
135
0
0
14 Jul 2025
When Trackers Date Fish: A Benchmark and Framework for Underwater Multiple Fish Tracking
When Trackers Date Fish: A Benchmark and Framework for Underwater Multiple Fish Tracking
Weiran Li
Yeqiang Liu
Qiannan Guo
Yijie Wei
Hwa Liang Leo
Zhenbo Li
122
0
0
08 Jul 2025
Boosting Temporal Sentence Grounding via Causal Inference
Boosting Temporal Sentence Grounding via Causal Inference
Kefan Tang
Lihuo He
Jisheng Dang
Xinbo Gao
OODCML
197
0
0
07 Jul 2025
DMAT: An End-to-End Framework for Joint Atmospheric Turbulence Mitigation and Object Detection
DMAT: An End-to-End Framework for Joint Atmospheric Turbulence Mitigation and Object Detection
P. Hill
Zhiming Liu
A. Achim
Dave Bull
Qirui Yang
109
0
0
06 Jul 2025
WAVE: Warp-Based View Guidance for Consistent Novel View Synthesis Using a Single Image
WAVE: Warp-Based View Guidance for Consistent Novel View Synthesis Using a Single Image
Jiwoo Park
Tae Eun Choi
Youngjun Jun
Seong Jae Hwang
DiffM
183
0
0
30 Jun 2025
Grounding DINO-US-SAM: Text-Prompted Multi-Organ Segmentation in Ultrasound with LoRA-Tuned Vision-Language Models
Grounding DINO-US-SAM: Text-Prompted Multi-Organ Segmentation in Ultrasound with LoRA-Tuned Vision-Language ModelsIEEE Transactions on Ultrasonics, Ferroelectrics and Frequency Control (IEEE TUFFC), 2025
Hamza Rasaee
Taha Koleilat
H. Rivaz
145
1
0
30 Jun 2025
Learning Frequency and Memory-Aware Prompts for Multi-Modal Object Tracking
Learning Frequency and Memory-Aware Prompts for Multi-Modal Object Tracking
Boyue Xu
Ruichao Hou
Tongwei Ren
Dongming Zhou
Gangshan Wu
Jinde Cao
126
0
0
30 Jun 2025
R1-Track: Direct Application of MLLMs to Visual Object Tracking via Reinforcement Learning
R1-Track: Direct Application of MLLMs to Visual Object Tracking via Reinforcement Learning
Biao Wang
Wenwen Li
Jiawei Ge
221
4
0
27 Jun 2025
CSDN: A Context-Gated Self-Adaptive Detection Network for Real-Time Object Detection
CSDN: A Context-Gated Self-Adaptive Detection Network for Real-Time Object Detection
Haolin Wei
ObjD
229
0
0
21 Jun 2025
PR-DETR: Injecting Position and Relation Prior for Dense Video Captioning
PR-DETR: Injecting Position and Relation Prior for Dense Video Captioning
Yizhe Li
Sanping Zhou
Zheng Qin
Le Wang
ViT
160
0
0
19 Jun 2025
BRISC: Annotated Dataset for Brain Tumor Segmentation and Classification
BRISC: Annotated Dataset for Brain Tumor Segmentation and Classification
Amirreza Fateh
Yasin Rezvani
Sara Moayedi
Sadjad Rezvani
Fatemeh Fateh
Mansoor Fateh
Vahid Abolghasemi
290
6
0
17 Jun 2025
Text-Aware Image Restoration with Diffusion Models
Text-Aware Image Restoration with Diffusion Models
Jaewon Min
J. Kim
Paul Hyunbin Cho
J. Lee
Jihye Park
Minkyu Park
S. Kim
Hyunhee Park
Seungryong Kim
254
1
0
11 Jun 2025
Data-Efficient Challenges in Visual Inductive Priors: A Retrospective
Data-Efficient Challenges in Visual Inductive Priors: A Retrospective
Robert-Jan Bruintjes
A. Lengyel
O. Kayhan
Davide Zambrano
Nergis Tomen
Hadi Jamali Rad
Jan van Gemert
VLM
161
0
0
10 Jun 2025
Previous
12345...232425
Next