ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.09630
  4. Cited By
Generalized Intersection over Union: A Metric and A Loss for Bounding
  Box Regression
v1v2 (latest)

Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

25 February 2019
S. Hamid Rezatofighi
Deyuan Li
JunYoung Gwak
Amir Sadeghian
Ian Reid
Silvio Savarese
ArXiv (abs)PDFHTML

Papers citing "Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression"

50 / 1,203 papers shown
Reinforcement Learning for Large Model: A Survey
Reinforcement Learning for Large Model: A Survey
Weijia Wu
Chen Gao
Joya Chen
Kevin Lin
Qingwei Meng
Yiming Zhang
Yuke Qiu
Hong Zhou
Mike Zheng Shou
308
2
0
24 Dec 2025
CauSight: Learning to Supersense for Visual Causal Discovery
Yize Zhang
M. Chen
Sirui Chen
Bo Peng
Y. Zhang
Tianyu Li
Chaochao Lu
CMLReLMLRM
141
0
0
01 Dec 2025
InstanceV: Instance-Level Video Generation
InstanceV: Instance-Level Video Generation
Yuheng Chen
Teng Hu
Jiangning Zhang
Zhucun Xue
Ran Yi
Lizhuang Ma
DiffMVGen
120
0
0
28 Nov 2025
DocVAL: Validated Chain-of-Thought Distillation for Grounded Document VQA
DocVAL: Validated Chain-of-Thought Distillation for Grounded Document VQA
Ahmad Mohammadshirazi
Pinaki Prasad Guha Neogi
Dheeraj Kulshrestha
R. Ramnath
VGen
130
0
0
27 Nov 2025
Ar2Can: An Architect and an Artist Leveraging a Canvas for Multi-Human Generation
Ar2Can: An Architect and an Artist Leveraging a Canvas for Multi-Human Generation
Shubhankar Borse
Phuc Pham
Farzad Farhadzadeh
Seokeon Choi
P. Nguyen
Anh Tran
Sungrack Yun
Munawar Hayat
Fatih Porikli
76
0
0
27 Nov 2025
Context-Aware Token Pruning and Discriminative Selective Attention for Transformer Tracking
Context-Aware Token Pruning and Discriminative Selective Attention for Transformer Tracking
Janani Kugarajeevan
T. Kokul
A. Ramanan
Subha Fernando
ViT
83
0
0
25 Nov 2025
StableTrack: Stabilizing Multi-Object Tracking on Low-Frequency Detections
StableTrack: Stabilizing Multi-Object Tracking on Low-Frequency Detections
Matvei Shelukhan
Timur Mamedov
Karina Kvanchiani
VOT
403
0
0
25 Nov 2025
SFHand: A Streaming Framework for Language-guided 3D Hand Forecasting and Embodied Manipulation
SFHand: A Streaming Framework for Language-guided 3D Hand Forecasting and Embodied Manipulation
Ruicong Liu
Yifei Huang
Liangyang Ouyang
Caixin Kang
Yoichi Sato
95
1
0
22 Nov 2025
MGA-VQA: Secure and Interpretable Graph-Augmented Visual Question Answering with Memory-Guided Protection Against Unauthorized Knowledge Use
MGA-VQA: Secure and Interpretable Graph-Augmented Visual Question Answering with Memory-Guided Protection Against Unauthorized Knowledge Use
Ahmad Mohammadshirazi
Pinaki Prasad Guha Neogi
Dheeraj Kulshrestha
R. Ramnath
104
0
0
22 Nov 2025
REXO: Indoor Multi-View Radar Object Detection via 3D Bounding Box Diffusion
REXO: Indoor Multi-View Radar Object Detection via 3D Bounding Box Diffusion
Ryoma Yataka
Pu Perry Wang
P. Boufounos
R. Takahashi
113
0
0
21 Nov 2025
Real-Time 3D Object Detection with Inference-Aligned Learning
Chenyu Zhao
Xianwei Zheng
Zimin Xia
Linwei Yue
Nan Xue
3DPC
232
0
0
20 Nov 2025
Backdoor Attacks on Open Vocabulary Object Detectors via Multi-Modal Prompt Tuning
Backdoor Attacks on Open Vocabulary Object Detectors via Multi-Modal Prompt Tuning
Ankita Raj
Chetan Arora
ObjDAAMLVLM
253
0
0
16 Nov 2025
Scale-Aware Relay and Scale-Adaptive Loss for Tiny Object Detection in Aerial Images
Scale-Aware Relay and Scale-Adaptive Loss for Tiny Object Detection in Aerial Images
Jinfu Li
Yuqi Huang
Hong Song
Ting Wang
Jianghan Xia
Yucong Lin
Jingfan Fan
Jian Yang
ObjD
222
0
0
13 Nov 2025
Sim4Seg: Boosting Multimodal Multi-disease Medical Diagnosis Segmentation with Region-Aware Vision-Language Similarity Masks
Sim4Seg: Boosting Multimodal Multi-disease Medical Diagnosis Segmentation with Region-Aware Vision-Language Similarity Masks
Lingran Song
Yucheng Zhou
Jianbing Shen
107
0
0
10 Nov 2025
SportR: A Benchmark for Multimodal Large Language Model Reasoning in Sports
SportR: A Benchmark for Multimodal Large Language Model Reasoning in Sports
H. Xia
Haonan Ge
Junbo Zou
Hyun Woo Choi
Xuebin Zhang
...
Shichao Chen
Rhys Tracy
Vicente Ordonez
Weining Shen
H. Chen
ReLMLRMVLM
423
1
0
09 Nov 2025
Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation
Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation
Lin Li
Chuhan Zhang
Dong Zhang
Chong Sun
Chen Li
L. Chen
148
0
0
08 Nov 2025
Temporal Zoom Networks: Distance Regression and Continuous Depth for Efficient Action Localization
Temporal Zoom Networks: Distance Regression and Continuous Depth for Efficient Action Localization
Ibne Farabi Shihab
Sanjeda Akter
Anuj Sharma
192
0
0
06 Nov 2025
Part-Aware Bottom-Up Group Reasoning for Fine-Grained Social Interaction Detection
Part-Aware Bottom-Up Group Reasoning for Fine-Grained Social Interaction Detection
Dongkeun Kim
Minsu Cho
Suha Kwak
92
0
0
05 Nov 2025
RIS-Assisted 3D Spherical Splatting for Object Composition Visualization using Detection Transformers
RIS-Assisted 3D Spherical Splatting for Object Composition Visualization using Detection Transformers
Anastasios T. Sotiropoulos
Stavros Tsimpoukis
Dimitrios Tyrovolas
S. Ioannidis
P. Diamantoulakis
G. Karagiannidis
C. Liaskos
86
0
0
04 Nov 2025
Unsupervised Learning for Industrial Defect Detection: A Case Study on Shearographic Data
Unsupervised Learning for Industrial Defect Detection: A Case Study on Shearographic Data
Jessica Plassmann
Nicolas Schuler
Georg von Freymann
Michael Schuth
92
1
0
04 Nov 2025
3EED: Ground Everything Everywhere in 3D
3EED: Ground Everything Everywhere in 3D
Rong Li
Yuhao Dong
Tianshuai Hu
Ao Liang
Youquan Liu
Dongyue Lu
Liang Pan
Lingdong Kong
Junwei Liang
Ziwei Liu
116
4
0
03 Nov 2025
Gaussian Combined Distance: A Generic Metric for Object Detection
Gaussian Combined Distance: A Generic Metric for Object DetectionIEEE Geoscience and Remote Sensing Letters (GRSL), 2025
Ziqian Guan
Xieyi Fu
Pengjun Huang
Hengyuan Zhang
Hubin Du
Yongtao Liu
Yinglin Wang
Qang Ma
162
1
0
31 Oct 2025
PT-DETR: Small Target Detection Based on Partially-Aware Detail Focus
PT-DETR: Small Target Detection Based on Partially-Aware Detail Focus
Bingcong Huo
Zhiming Wang
88
0
0
30 Oct 2025
MELDAE: A Framework for Micro-Expression Spotting, Detection, and Automatic Evaluation in In-the-Wild Conversational Scenes
MELDAE: A Framework for Micro-Expression Spotting, Detection, and Automatic Evaluation in In-the-Wild Conversational Scenes
Yigui Feng
Qinglin Wang
Yang Liu
Ke Liu
Haotian Mo
Enhao Huang
G. Liu
M. Liu
Jie Liu
72
1
0
26 Oct 2025
Enrich and Detect: Video Temporal Grounding with Multimodal LLMs
Enrich and Detect: Video Temporal Grounding with Multimodal LLMs
Shraman Pramanick
E. Mavroudi
Yale Song
Rama Chellappa
Lorenzo Torresani
Triantafyllos Afouras
180
0
0
19 Oct 2025
Enhancing Rotated Object Detection via Anisotropic Gaussian Bounding Box and Bhattacharyya Distance
Enhancing Rotated Object Detection via Anisotropic Gaussian Bounding Box and Bhattacharyya Distance
Chien Thai
Mai Xuan Trang
Huong Ninh
Hoang Hiep Ly
Anh Son Le
145
0
0
18 Oct 2025
Beat Tracking as Object Detection
Beat Tracking as Object Detection
Jaehoon Ahn
Moon-Ryul Jung
ObjD
239
0
0
16 Oct 2025
Structured Universal Adversarial Attacks on Object Detection for Video Sequences
Structured Universal Adversarial Attacks on Object Detection for Video Sequences
Sven Jacob
Weijia Shao
Gjergji Kasneci
AAML
100
0
0
16 Oct 2025
MATRIX: Mask Track Alignment for Interaction-aware Video Generation
MATRIX: Mask Track Alignment for Interaction-aware Video Generation
Siyoon Jin
S. Kim
Dahyun Chung
J. Lee
Hyunwook Choi
Jisu Nam
J. Kim
S. Kim
VGen
106
1
0
08 Oct 2025
VLA-R1: Enhancing Reasoning in Vision-Language-Action Models
VLA-R1: Enhancing Reasoning in Vision-Language-Action Models
Angen Ye
Zeyu Zhang
Boyuan Wang
Xiaofeng Wang
Dapeng Zhang
Zheng Hua Zhu
LRMVLM
147
7
0
02 Oct 2025
Forestpest-YOLO: A High-Performance Detection Framework for Small Forestry Pests
Forestpest-YOLO: A High-Performance Detection Framework for Small Forestry Pests
Aoduo Li
Peikai Lin
Jiancheng Li
Zhen Zhang
Shiting Wu
Zexiao Liang
Zhifa Jiang
152
0
0
01 Oct 2025
Hierarchy-Aware Neural Subgraph Matching with Enhanced Similarity Measure
Hierarchy-Aware Neural Subgraph Matching with Enhanced Similarity MeasureIEEE Transactions on Knowledge and Data Engineering (TKDE), 2025
Zhouyang Liu
Ning Liu
Yixin Chen
Jiezhong He
Menghan Jia
Dongsheng Li
125
0
0
01 Oct 2025
Contrastive Diffusion Guidance for Spatial Inverse Problems
Contrastive Diffusion Guidance for Spatial Inverse Problems
Sattwik Basu
Chaitanya Amballa
Zhongweiyang Xu
Jorge Vančo Sampedro
Srihari Nelakuditi
Romit Roy Choudhury
88
0
0
30 Sep 2025
Anchor-free Cross-view Object Geo-localization with Gaussian Position Encoding and Cross-view Association
Anchor-free Cross-view Object Geo-localization with Gaussian Position Encoding and Cross-view Association
Xingtao Ling
Chenlin Fu
Yingying Zhu
96
0
0
30 Sep 2025
Talk in Pieces, See in Whole: Disentangling and Hierarchical Aggregating Representations for Language-based Object Detection
Talk in Pieces, See in Whole: Disentangling and Hierarchical Aggregating Representations for Language-based Object Detection
Sojung An
Kwanyong Park
Yong Jae Lee
Donghyun Kim
156
0
0
29 Sep 2025
Collaborating Vision, Depth, and Thermal Signals for Multi-Modal Tracking: Dataset and Algorithm
Collaborating Vision, Depth, and Thermal Signals for Multi-Modal Tracking: Dataset and Algorithm
Xue-Feng Zhu
Tianyang Xu
Yifan Pan
Jinjie Gu
Xi Li
Jiwen Lu
Xiao-Jun Wu
Josef Kittler
186
0
0
29 Sep 2025
FM-SIREN & FM-FINER: Nyquist-Informed Frequency Multiplier for Implicit Neural Representation with Periodic Activation
FM-SIREN & FM-FINER: Nyquist-Informed Frequency Multiplier for Implicit Neural Representation with Periodic Activation
Mohammed Alsakabi
Wael Mobeirek
John M. Dolan
Ozan K. Tonguz
140
0
0
27 Sep 2025
Real-Time Object Detection Meets DINOv3
Real-Time Object Detection Meets DINOv3
Shihua Huang
Yongjie Hou
Longfei Liu
Xuanlong Yu
Xi Shen
ObjD3DHPINNVLM
364
5
0
25 Sep 2025
Fine-Tuning LLMs to Analyze Multiple Dimensions of Code Review: A Maximum Entropy Regulated Long Chain-of-Thought Approach
Fine-Tuning LLMs to Analyze Multiple Dimensions of Code Review: A Maximum Entropy Regulated Long Chain-of-Thought Approach
Yongda Yu
Guohao Shi
Xianwei Wu
Haochuan He
XueMing Gu
...
Kui Liu
Qiushi Wang
Zhao Tian
Haifeng Shen
Guoping Rong
LRM
136
0
0
25 Sep 2025
Robust RGB-T Tracking via Learnable Visual Fourier Prompt Fine-tuning and Modality Fusion Prompt Generation
Robust RGB-T Tracking via Learnable Visual Fourier Prompt Fine-tuning and Modality Fusion Prompt Generation
Hongtao Yang
Bineng Zhong
Qihua Liang
Zhiruo Zhu
Yaozong Zheng
Ning Li
157
0
0
24 Sep 2025
LayoutAgent: A Vision-Language Agent Guided Compositional Diffusion for Spatial Layout Planning
LayoutAgent: A Vision-Language Agent Guided Compositional Diffusion for Spatial Layout Planning
Zezhong Fan
Xiaohan Li
Luyi Ma
Kai Zhao
Liang Peng
Topojoy Biswas
Evren Körpeoglu
Kaushiki Nag
Kannan Achan
DiffM
170
0
0
24 Sep 2025
SynapFlow: A Modular Framework Towards Large-Scale Analysis of Dendritic Spines
SynapFlow: A Modular Framework Towards Large-Scale Analysis of Dendritic Spines
Pamela Osuna-Vargas
Altug Kamacioglu
Dominik F. Aschauer
Petros E. Vlachos
Sercan Alipek
Jochen Triesch
Simon Rumpel
Matthias Kaschube
105
0
0
23 Sep 2025
Surgical-MambaLLM: Mamba2-enhanced Multimodal Large Language Model for VQLA in Robotic Surgery
Surgical-MambaLLM: Mamba2-enhanced Multimodal Large Language Model for VQLA in Robotic SurgeryInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Pengfei Hao
Hongqiu Wang
Shuaibo Li
Zhaohu Xing
Guang Yang
Kaishun Wu
Lei Zhu
Mamba
108
1
0
20 Sep 2025
Speech-to-See: End-to-End Speech-Driven Open-Set Object Detection
Speech-to-See: End-to-End Speech-Driven Open-Set Object Detection
Wenhuan Lu
Xinyue Song
Wenjun Ke
Zhizhi Yu
Wenhao Yang
Jianguo Wei
ObjD
88
0
0
20 Sep 2025
Robust Object Detection for Autonomous Driving via Curriculum-Guided Group Relative Policy Optimization
Robust Object Detection for Autonomous Driving via Curriculum-Guided Group Relative Policy Optimization
Xu Jia
129
0
0
19 Sep 2025
T-SiamTPN: Temporal Siamese Transformer Pyramid Networks for Robust and Efficient UAV Tracking
T-SiamTPN: Temporal Siamese Transformer Pyramid Networks for Robust and Efficient UAV Tracking
Hojat Ardi
Amir Jahanshahi
Ali Diba
153
0
0
16 Sep 2025
Beyond Artificial Misalignment: Detecting and Grounding Semantic-Coordinated Multimodal Manipulations
Beyond Artificial Misalignment: Detecting and Grounding Semantic-Coordinated Multimodal Manipulations
Jinjie Shen
Yaxiong Wang
Lechao Cheng
Nan Pu
Zhun Zhong
142
1
0
16 Sep 2025
Towards Understanding Visual Grounding in Visual Language Models
Towards Understanding Visual Grounding in Visual Language Models
Georgios Pantazopoulos
Eda B. Özyiğit
ObjD
300
3
0
12 Sep 2025
Hyperspectral Mamba for Hyperspectral Object Tracking
Hyperspectral Mamba for Hyperspectral Object Tracking
Long Gao
Yunhe Zhang
Yan Jiang
Weiying Xie
Yunsong Li
Mamba
130
0
0
10 Sep 2025
Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model
Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model
Zhuoxu Huang
Mingqi Gao
Jungong Han
136
1
0
09 Sep 2025
1234...232425
Next