ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.09630
  4. Cited By
Generalized Intersection over Union: A Metric and A Loss for Bounding
  Box Regression
v1v2 (latest)

Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

25 February 2019
S. Hamid Rezatofighi
Deyuan Li
JunYoung Gwak
Amir Sadeghian
Ian Reid
Silvio Savarese
ArXiv (abs)PDFHTML

Papers citing "Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression"

50 / 1,199 papers shown
Title
Context-Aware Token Pruning and Discriminative Selective Attention for Transformer Tracking
Context-Aware Token Pruning and Discriminative Selective Attention for Transformer Tracking
Janani Kugarajeevan
T. Kokul
A. Ramanan
Subha Fernando
ViT
21
0
0
25 Nov 2025
StableTrack: Stabilizing Multi-Object Tracking on Low-Frequency Detections
StableTrack: Stabilizing Multi-Object Tracking on Low-Frequency Detections
Matvei Shelukhan
Timur Mamedov
Karina Kvanchiani
VOT
256
0
0
25 Nov 2025
SFHand: A Streaming Framework for Language-guided 3D Hand Forecasting and Embodied Manipulation
SFHand: A Streaming Framework for Language-guided 3D Hand Forecasting and Embodied Manipulation
Ruicong Liu
Yifei Huang
Liangyang Ouyang
Caixin Kang
Yoichi Sato
39
1
0
22 Nov 2025
MGA-VQA: Secure and Interpretable Graph-Augmented Visual Question Answering with Memory-Guided Protection Against Unauthorized Knowledge Use
MGA-VQA: Secure and Interpretable Graph-Augmented Visual Question Answering with Memory-Guided Protection Against Unauthorized Knowledge Use
Ahmad Mohammadshirazi
Pinaki Prasad Guha Neogi
Dheeraj Kulshrestha
R. Ramnath
56
0
0
22 Nov 2025
REXO: Indoor Multi-View Radar Object Detection via 3D Bounding Box Diffusion
REXO: Indoor Multi-View Radar Object Detection via 3D Bounding Box Diffusion
Ryoma Yataka
Pu Perry Wang
P. Boufounos
R. Takahashi
57
0
0
21 Nov 2025
Real-Time 3D Object Detection with Inference-Aligned Learning
Chenyu Zhao
Xianwei Zheng
Zimin Xia
Linwei Yue
Nan Xue
3DPC
144
0
0
20 Nov 2025
Backdoor Attacks on Open Vocabulary Object Detectors via Multi-Modal Prompt Tuning
Backdoor Attacks on Open Vocabulary Object Detectors via Multi-Modal Prompt Tuning
Ankita Raj
Chetan Arora
ObjDAAMLVLM
153
0
0
16 Nov 2025
Scale-Aware Relay and Scale-Adaptive Loss for Tiny Object Detection in Aerial Images
Scale-Aware Relay and Scale-Adaptive Loss for Tiny Object Detection in Aerial Images
Jinfu Li
Yuqi Huang
Hong Song
Ting Wang
Jianghan Xia
Yucong Lin
Jingfan Fan
Jian Yang
ObjD
138
0
0
13 Nov 2025
Sim4Seg: Boosting Multimodal Multi-disease Medical Diagnosis Segmentation with Region-Aware Vision-Language Similarity Masks
Sim4Seg: Boosting Multimodal Multi-disease Medical Diagnosis Segmentation with Region-Aware Vision-Language Similarity Masks
Lingran Song
Yucheng Zhou
Jianbing Shen
52
0
0
10 Nov 2025
SportR: A Benchmark for Multimodal Large Language Model Reasoning in Sports
SportR: A Benchmark for Multimodal Large Language Model Reasoning in Sports
H. Xia
Haonan Ge
Junbo Zou
Hyun Woo Choi
Xuebin Zhang
...
Shichao Chen
Rhys Tracy
Vicente Ordonez
Weining Shen
H. Chen
ReLMLRMVLM
294
1
0
09 Nov 2025
Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation
Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation
Lin Li
Chuhan Zhang
Dong Zhang
Chong Sun
Chen Li
L. Chen
108
0
0
08 Nov 2025
Temporal Zoom Networks: Distance Regression and Continuous Depth for Efficient Action Localization
Temporal Zoom Networks: Distance Regression and Continuous Depth for Efficient Action Localization
Ibne Farabi Shihab
Sanjeda Akter
Anuj Sharma
108
0
0
06 Nov 2025
Part-Aware Bottom-Up Group Reasoning for Fine-Grained Social Interaction Detection
Part-Aware Bottom-Up Group Reasoning for Fine-Grained Social Interaction Detection
Dongkeun Kim
Minsu Cho
Suha Kwak
52
0
0
05 Nov 2025
RIS-Assisted 3D Spherical Splatting for Object Composition Visualization using Detection Transformers
RIS-Assisted 3D Spherical Splatting for Object Composition Visualization using Detection Transformers
Anastasios T. Sotiropoulos
Stavros Tsimpoukis
Dimitrios Tyrovolas
S. Ioannidis
P. Diamantoulakis
G. Karagiannidis
C. Liaskos
66
0
0
04 Nov 2025
Unsupervised Learning for Industrial Defect Detection: A Case Study on Shearographic Data
Unsupervised Learning for Industrial Defect Detection: A Case Study on Shearographic Data
Jessica Plassmann
Nicolas Schuler
Georg von Freymann
Michael Schuth
56
1
0
04 Nov 2025
3EED: Ground Everything Everywhere in 3D
3EED: Ground Everything Everywhere in 3D
Rong Li
Yuhao Dong
Tianshuai Hu
Ao Liang
Youquan Liu
Dongyue Lu
Liang Pan
Lingdong Kong
Junwei Liang
Ziwei Liu
72
3
0
03 Nov 2025
Gaussian Combined Distance: A Generic Metric for Object Detection
Gaussian Combined Distance: A Generic Metric for Object DetectionIEEE Geoscience and Remote Sensing Letters (GRSL), 2025
Ziqian Guan
Xieyi Fu
Pengjun Huang
Hengyuan Zhang
Hubin Du
Yongtao Liu
Yinglin Wang
Qang Ma
94
0
0
31 Oct 2025
PT-DETR: Small Target Detection Based on Partially-Aware Detail Focus
PT-DETR: Small Target Detection Based on Partially-Aware Detail Focus
Bingcong Huo
Zhiming Wang
40
0
0
30 Oct 2025
MELDAE: A Framework for Micro-Expression Spotting, Detection, and Automatic Evaluation in In-the-Wild Conversational Scenes
MELDAE: A Framework for Micro-Expression Spotting, Detection, and Automatic Evaluation in In-the-Wild Conversational Scenes
Yigui Feng
Qinglin Wang
Yang Liu
Ke Liu
Haotian Mo
Enhao Huang
G. Liu
M. Liu
Jie Liu
52
1
0
26 Oct 2025
Enrich and Detect: Video Temporal Grounding with Multimodal LLMs
Enrich and Detect: Video Temporal Grounding with Multimodal LLMs
Shraman Pramanick
E. Mavroudi
Yale Song
Rama Chellappa
Lorenzo Torresani
Triantafyllos Afouras
124
0
0
19 Oct 2025
Enhancing Rotated Object Detection via Anisotropic Gaussian Bounding Box and Bhattacharyya Distance
Enhancing Rotated Object Detection via Anisotropic Gaussian Bounding Box and Bhattacharyya Distance
Chien Thai
Mai Xuan Trang
Huong Ninh
Hoang Hiep Ly
Anh Son Le
92
0
0
18 Oct 2025
Structured Universal Adversarial Attacks on Object Detection for Video Sequences
Structured Universal Adversarial Attacks on Object Detection for Video Sequences
Sven Jacob
Weijia Shao
Gjergji Kasneci
AAML
64
0
0
16 Oct 2025
Beat Tracking as Object Detection
Beat Tracking as Object Detection
Jaehoon Ahn
Moon-Ryul Jung
ObjD
175
0
0
16 Oct 2025
MATRIX: Mask Track Alignment for Interaction-aware Video Generation
MATRIX: Mask Track Alignment for Interaction-aware Video Generation
Siyoon Jin
S. Kim
Dahyun Chung
J. Lee
Hyunwook Choi
Jisu Nam
J. Kim
S. Kim
VGen
74
1
0
08 Oct 2025
VLA-R1: Enhancing Reasoning in Vision-Language-Action Models
VLA-R1: Enhancing Reasoning in Vision-Language-Action Models
Angen Ye
Zeyu Zhang
Boyuan Wang
Xiaofeng Wang
Dapeng Zhang
Zheng Hua Zhu
LRMVLM
123
6
0
02 Oct 2025
Forestpest-YOLO: A High-Performance Detection Framework for Small Forestry Pests
Forestpest-YOLO: A High-Performance Detection Framework for Small Forestry Pests
Aoduo Li
Peikai Lin
Jiancheng Li
Zhen Zhang
Shiting Wu
Zexiao Liang
Zhifa Jiang
96
0
0
01 Oct 2025
Hierarchy-Aware Neural Subgraph Matching with Enhanced Similarity Measure
Hierarchy-Aware Neural Subgraph Matching with Enhanced Similarity MeasureIEEE Transactions on Knowledge and Data Engineering (TKDE), 2025
Zhouyang Liu
Ning Liu
Yixin Chen
Jiezhong He
Menghan Jia
Dongsheng Li
95
0
0
01 Oct 2025
Anchor-free Cross-view Object Geo-localization with Gaussian Position Encoding and Cross-view Association
Anchor-free Cross-view Object Geo-localization with Gaussian Position Encoding and Cross-view Association
Xingtao Ling
Chenlin Fu
Yingying Zhu
72
0
0
30 Sep 2025
Contrastive Diffusion Guidance for Spatial Inverse Problems
Contrastive Diffusion Guidance for Spatial Inverse Problems
Sattwik Basu
Chaitanya Amballa
Zhongweiyang Xu
Jorge Vančo Sampedro
Srihari Nelakuditi
Romit Roy Choudhury
56
0
0
30 Sep 2025
Collaborating Vision, Depth, and Thermal Signals for Multi-Modal Tracking: Dataset and Algorithm
Collaborating Vision, Depth, and Thermal Signals for Multi-Modal Tracking: Dataset and Algorithm
Xue-Feng Zhu
Tianyang Xu
Yifan Pan
Jinjie Gu
Xi Li
Jiwen Lu
Xiao-Jun Wu
Josef Kittler
126
0
0
29 Sep 2025
Talk in Pieces, See in Whole: Disentangling and Hierarchical Aggregating Representations for Language-based Object Detection
Talk in Pieces, See in Whole: Disentangling and Hierarchical Aggregating Representations for Language-based Object Detection
Sojung An
Kwanyong Park
Yong Jae Lee
Donghyun Kim
128
0
0
29 Sep 2025
FM-SIREN & FM-FINER: Nyquist-Informed Frequency Multiplier for Implicit Neural Representation with Periodic Activation
FM-SIREN & FM-FINER: Nyquist-Informed Frequency Multiplier for Implicit Neural Representation with Periodic Activation
Mohammed Alsakabi
Wael Mobeirek
John M. Dolan
Ozan K. Tonguz
76
0
0
27 Sep 2025
Real-Time Object Detection Meets DINOv3
Real-Time Object Detection Meets DINOv3
Shihua Huang
Yongjie Hou
Longfei Liu
Xuanlong Yu
Xi Shen
ObjD3DHPINNVLM
282
2
0
25 Sep 2025
Fine-Tuning LLMs to Analyze Multiple Dimensions of Code Review: A Maximum Entropy Regulated Long Chain-of-Thought Approach
Fine-Tuning LLMs to Analyze Multiple Dimensions of Code Review: A Maximum Entropy Regulated Long Chain-of-Thought Approach
Yongda Yu
Guohao Shi
Xianwei Wu
Haochuan He
XueMing Gu
...
Kui Liu
Qiushi Wang
Zhao Tian
Haifeng Shen
Guoping Rong
LRM
96
0
0
25 Sep 2025
Robust RGB-T Tracking via Learnable Visual Fourier Prompt Fine-tuning and Modality Fusion Prompt Generation
Robust RGB-T Tracking via Learnable Visual Fourier Prompt Fine-tuning and Modality Fusion Prompt Generation
Hongtao Yang
Bineng Zhong
Qihua Liang
Zhiruo Zhu
Yaozong Zheng
Ning Li
128
0
0
24 Sep 2025
LayoutAgent: A Vision-Language Agent Guided Compositional Diffusion for Spatial Layout Planning
LayoutAgent: A Vision-Language Agent Guided Compositional Diffusion for Spatial Layout Planning
Zezhong Fan
Xiaohan Li
Luyi Ma
Kai Zhao
Liang Peng
Topojoy Biswas
Evren Körpeoglu
Kaushiki Nag
Kannan Achan
DiffM
138
0
0
24 Sep 2025
SynapFlow: A Modular Framework Towards Large-Scale Analysis of Dendritic Spines
SynapFlow: A Modular Framework Towards Large-Scale Analysis of Dendritic Spines
Pamela Osuna-Vargas
Altug Kamacioglu
Dominik F. Aschauer
Petros E. Vlachos
Sercan Alipek
Jochen Triesch
Simon Rumpel
Matthias Kaschube
65
0
0
23 Sep 2025
Surgical-MambaLLM: Mamba2-enhanced Multimodal Large Language Model for VQLA in Robotic Surgery
Surgical-MambaLLM: Mamba2-enhanced Multimodal Large Language Model for VQLA in Robotic SurgeryInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Pengfei Hao
Hongqiu Wang
Shuaibo Li
Zhaohu Xing
Guang Yang
Kaishun Wu
Lei Zhu
Mamba
52
1
0
20 Sep 2025
Speech-to-See: End-to-End Speech-Driven Open-Set Object Detection
Speech-to-See: End-to-End Speech-Driven Open-Set Object Detection
Wenhuan Lu
Xinyue Song
Wenjun Ke
Zhizhi Yu
Wenhao Yang
Jianguo Wei
ObjD
76
0
0
20 Sep 2025
Robust Object Detection for Autonomous Driving via Curriculum-Guided Group Relative Policy Optimization
Robust Object Detection for Autonomous Driving via Curriculum-Guided Group Relative Policy Optimization
Xu Jia
97
0
0
19 Sep 2025
T-SiamTPN: Temporal Siamese Transformer Pyramid Networks for Robust and Efficient UAV Tracking
T-SiamTPN: Temporal Siamese Transformer Pyramid Networks for Robust and Efficient UAV Tracking
Hojat Ardi
Amir Jahanshahi
Ali Diba
133
0
0
16 Sep 2025
Beyond Artificial Misalignment: Detecting and Grounding Semantic-Coordinated Multimodal Manipulations
Beyond Artificial Misalignment: Detecting and Grounding Semantic-Coordinated Multimodal Manipulations
Jinjie Shen
Yaxiong Wang
Lechao Cheng
Nan Pu
Zhun Zhong
86
1
0
16 Sep 2025
Towards Understanding Visual Grounding in Visual Language Models
Towards Understanding Visual Grounding in Visual Language Models
Georgios Pantazopoulos
Eda B. Özyiğit
ObjD
240
1
0
12 Sep 2025
Hyperspectral Mamba for Hyperspectral Object Tracking
Hyperspectral Mamba for Hyperspectral Object Tracking
Long Gao
Yunhe Zhang
Yan Jiang
Weiying Xie
Yunsong Li
Mamba
114
0
0
10 Sep 2025
Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model
Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model
Zhuoxu Huang
Mingqi Gao
Jungong Han
80
1
0
09 Sep 2025
UNO: Unifying One-stage Video Scene Graph Generation via Object-Centric Visual Representation Learning
UNO: Unifying One-stage Video Scene Graph Generation via Object-Centric Visual Representation Learning
Huy Le
Nhat Chung
Tung Kieu
Jingkang Yang
Ngan Le
VOSOCL
297
1
0
07 Sep 2025
TinyDef-DETR: A DETR-based Framework for Defect Detection in Transmission Lines from UAV Imagery
TinyDef-DETR: A DETR-based Framework for Defect Detection in Transmission Lines from UAV Imagery
Feng Shen
Jiaming Cui
Shuai Zhou
Wenqiang Li
Ruifeng Qin
156
0
0
07 Sep 2025
SparkUI-Parser: Enhancing GUI Perception with Robust Grounding and Parsing
SparkUI-Parser: Enhancing GUI Perception with Robust Grounding and Parsing
Hongyi Jing
Jiafu Chen
Chen Rao
Ziqiang Dang
Jiajie Teng
...
Shuo Fang
Huaizhong Lin
Rui Lv
Chenguang Ma
Lei Zhao
56
0
0
05 Sep 2025
EdgeAttNet: Towards Barb-Aware Filament Segmentation
EdgeAttNet: Towards Barb-Aware Filament Segmentation
Victor Solomon
P. Martens
Jingyu Liu
Rafal Angryk
56
0
0
03 Sep 2025
Heatmap Guided Query Transformers for Robust Astrocyte Detection across Immunostains and Resolutions
Heatmap Guided Query Transformers for Robust Astrocyte Detection across Immunostains and Resolutions
Xizhe Zhang
Jiayang Zhu
MedIm
57
0
0
03 Sep 2025
1234...222324
Next