ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.09630
  4. Cited By
Generalized Intersection over Union: A Metric and A Loss for Bounding
  Box Regression
v1v2 (latest)

Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

25 February 2019
S. Hamid Rezatofighi
Deyuan Li
JunYoung Gwak
Amir Sadeghian
Ian Reid
Silvio Savarese
ArXiv (abs)PDFHTML

Papers citing "Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression"

50 / 1,203 papers shown
AVA-Bench: Atomic Visual Ability Benchmark for Vision Foundation Models
AVA-Bench: Atomic Visual Ability Benchmark for Vision Foundation Models
Zheda Mai
A. Chowdhury
Zihe Wang
Sooyoung Jeon
Jingyan Bai
Jiacheng Hou
Jihyung Kil
Wei-Lun Chao
CoGe
227
4
0
10 Jun 2025
Decoupling the Image Perception and Multimodal Reasoning for Reasoning Segmentation with Digital Twin Representations
Decoupling the Image Perception and Multimodal Reasoning for Reasoning Segmentation with Digital Twin Representations
Yizhen Li
Dell Zhang
Xuelong Li
Yiqing Shen
VLM
188
0
0
09 Jun 2025
MiMo-VL Technical Report
MiMo-VL Technical Report
Xiaomi LLM-Core Team
Zihao Yue
Zhenru Lin
Yifan Song
Weikun Wang
...
Di Zhang
Chong Ma
Chang Liu
Can Cai
Bingquan Xia
OffRLMoEVLMLRM
255
14
0
04 Jun 2025
HiLO: High-Level Object Fusion for Autonomous Driving using Transformers
HiLO: High-Level Object Fusion for Autonomous Driving using Transformers
Timo Osterburg
Franz Albers
Christopher P. Diehl
Rajesh Pushparaj
Torsten Bertram
238
1
0
03 Jun 2025
Conformal Object Detection by Sequential Risk Control
Conformal Object Detection by Sequential Risk Control
Léo Andéol
Luca Mossina
Adrien Mazoyer
Sébastien Gerchinovitz
413
0
0
29 May 2025
CADReview: Automatically Reviewing CAD Programs with Error Detection and Correction
CADReview: Automatically Reviewing CAD Programs with Error Detection and CorrectionAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Jiali Chen
Xusen Hei
HongFei Liu
Yuancheng Wei
Zikun Deng
Jiayuan Xie
Yi Cai
Li Qing
183
0
0
28 May 2025
Can NeRFs See without Cameras?
Can NeRFs See without Cameras?
Chaitanya Amballa
Sattwik Basu
Yu-Lin Wei
Zhijian Yang
Mehmet Ergezer
Romit Roy Choudhury
192
0
0
28 May 2025
MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding
MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding
Fuwen Luo
Shengfeng Lou
C. L. Philip Chen
Ziyue Wang
Chenliang Li
...
Peng Li
Ming Yan
Ji Zhang
Fei Huang
Wenshu Fan
AI4TSLRM
248
4
0
27 May 2025
Fully Spiking Neural Networks for Unified Frame-Event Object Tracking
Fully Spiking Neural Networks for Unified Frame-Event Object Tracking
Jingjun Yang
Liangwei Fan
Jinpu Zhang
Xiangkai Lian
Hui Shen
D. Hu
179
0
0
27 May 2025
Open-Det: An Efficient Learning Framework for Open-Ended Detection
Open-Det: An Efficient Learning Framework for Open-Ended Detection
Guiping Cao
Tao Wang
Wenjian Huang
X. Lan
Jianguo Zhang
Shihong Deng
ObjDVLM
193
1
0
27 May 2025
MSA at SemEval-2025 Task 3: High Quality Weak Labeling and LLM Ensemble Verification for Multilingual Hallucination Detection
MSA at SemEval-2025 Task 3: High Quality Weak Labeling and LLM Ensemble Verification for Multilingual Hallucination Detection
Baraa Hikal
Ahmed Nasreldin
Ali Hamdi
HILM
122
2
0
27 May 2025
CSTrack: Enhancing RGB-X Tracking via Compact Spatiotemporal Features
CSTrack: Enhancing RGB-X Tracking via Compact Spatiotemporal Features
X. Feng
D. Zhang
Shuyan Hu
X. Li
M. Wu
Jie Zhang
Xiaosha Chen
Kexin Huang
189
4
0
26 May 2025
From Data to Modeling: Fully Open-vocabulary Scene Graph Generation
From Data to Modeling: Fully Open-vocabulary Scene Graph Generation
Zuyao Chen
Jinlin Wu
Zhen Lei
Chang Wen Chen
182
0
0
26 May 2025
MLLMs are Deeply Affected by Modality Bias
MLLMs are Deeply Affected by Modality Bias
Xu Zheng
Chenfei Liao
Yuqian Fu
Kaiyu Lei
Yuanhuiyi Lyu
...
Yu Jiang
Andrii Zadaianchuk
Dacheng Tao
Luc Van Gool
Xuming Hu
312
11
0
24 May 2025
The Coherence Trap: When MLLM-Crafted Narratives Exploit Manipulated Visual Contexts
The Coherence Trap: When MLLM-Crafted Narratives Exploit Manipulated Visual Contexts
Yuchen Zhang
Yaxiong Wang
Yujiao Wu
Lianwei Wu
Li Zhu
Zhedong Zheng
AAML
389
0
0
23 May 2025
Efficient Motion Prompt Learning for Robust Visual Tracking
Efficient Motion Prompt Learning for Robust Visual Tracking
Jie Zhao
Xin Chen
Yongsheng Yuan
Michael Felsberg
Dong Wang
Huchuan Lu
174
1
0
22 May 2025
Diff-MM: Exploring Pre-trained Text-to-Image Generation Model for Unified Multi-modal Object Tracking
Diff-MM: Exploring Pre-trained Text-to-Image Generation Model for Unified Multi-modal Object Tracking
Shiyu Xuan
Zechao Li
Jinhui Tang
305
1
0
19 May 2025
LiDAR MOT-DETR: A LiDAR-based Two-Stage Transformer for 3D Multiple Object Tracking
LiDAR MOT-DETR: A LiDAR-based Two-Stage Transformer for 3D Multiple Object Tracking
Martha Teiko Teye
Ori Maoz
Matthias Rottmann
VOT
602
1
0
19 May 2025
Content Generation Models in Computational Pathology: A Comprehensive Survey on Methods, Applications, and Challenges
Content Generation Models in Computational Pathology: A Comprehensive Survey on Methods, Applications, and Challenges
Yuan Zhang
Xinfeng Zhang
Xiaoming Qi Xinyu Wu
Xinyu Wu
Feng Chen
Guanyu Yang
Huazhu Fu
MedImLM&MAAI4CE
454
1
0
16 May 2025
Using Cross-Domain Detection Loss to Infer Multi-Scale Information for Improved Tiny Head Tracking
Using Cross-Domain Detection Loss to Infer Multi-Scale Information for Improved Tiny Head TrackingIEEE International Conference on Automatic Face & Gesture Recognition (FG), 2025
Jisu Kim
Alex Mattingly
Eung-Joo Lee
Benjamin S. Riggan
167
0
0
14 May 2025
Visually Interpretable Subtask Reasoning for Visual Question Answering
Visually Interpretable Subtask Reasoning for Visual Question Answering
Yu Cheng
A. Goel
Hakan Bilen
LRM
247
2
0
12 May 2025
DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer
DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection TransformerComputer Vision and Pattern Recognition (CVPR), 2025
Ho-Joong Kim
Y. E. Lee
Jung-Ho Hong
Seong-Whan Lee
351
1
0
09 May 2025
CGTrack: Cascade Gating Network with Hierarchical Feature Aggregation for UAV Tracking
CGTrack: Cascade Gating Network with Hierarchical Feature Aggregation for UAV TrackingIEEE International Conference on Robotics and Automation (ICRA), 2025
Weihong Li
Xiaoqiong Liu
Heng Fan
L. Zhang
215
2
0
09 May 2025
RGBX-DiffusionDet: A Framework for Multi-Modal RGB-X Object Detection Using DiffusionDet
RGBX-DiffusionDet: A Framework for Multi-Modal RGB-X Object Detection Using DiffusionDetPattern Recognition (Pattern Recogn.), 2025
Eliraz Orfaig
Inna Stainvas
Igal Bilik
313
0
0
05 May 2025
Efficient Vision-based Vehicle Speed Estimation
Efficient Vision-based Vehicle Speed EstimationJournal of Real-Time Image Processing (JRIP), 2025
Andrej Macko
Lukás Gajdosech
Viktor Kocur
901
1
0
02 May 2025
Can Foundation Models Really Segment Tumors? A Benchmarking Odyssey in Lung CT Imaging
Can Foundation Models Really Segment Tumors? A Benchmarking Odyssey in Lung CT Imaging
Elena Mulero Ayllón
Massimiliano Mantegna
Linlin Shen
Paolo Soda
V. Guarrasi
M. Tortora
234
3
0
02 May 2025
Foundation Model-Driven Framework for Human-Object Interaction Prediction with Segmentation Mask Integration
Foundation Model-Driven Framework for Human-Object Interaction Prediction with Segmentation Mask Integration
Juhan Park
Kyungjae Lee
Hyung Jin Chang
Jungchan Cho
VLM
274
0
0
28 Apr 2025
Improving Open-World Object Localization by Discovering Background
Improving Open-World Object Localization by Discovering Background
Ashish Singh
Michael Jeffrey Jones
Kuan-Chuan Peng
A. Cherian
Moitreya Chatterjee
Erik Learned-Miller
ObjDOCLVLM
297
0
0
24 Apr 2025
Marginalized Generalized IoU (MGIoU): A Unified Objective Function for Optimizing Any Convex Parametric Shapes
Marginalized Generalized IoU (MGIoU): A Unified Objective Function for Optimizing Any Convex Parametric Shapes
Duy-Tho Le
Trung Pham
Jianfei Cai
H. Rezatofighi
315
2
0
23 Apr 2025
Progressive Language-guided Visual Learning for Multi-Task Visual Grounding
Progressive Language-guided Visual Learning for Multi-Task Visual Grounding
Jingchao Wang
Hong Wang
Wenlong Zhang
Kunhua Ji
Dingjiang Huang
Yefeng Zheng
ObjD
366
3
0
22 Apr 2025
EIoU-EMC: A Novel Loss for Domain-specific Nested Entity Recognition
EIoU-EMC: A Novel Loss for Domain-specific Nested Entity RecognitionAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Jian Zhang
Tianqing Zhang
Qi Li
Hongwei Wang
193
0
0
19 Apr 2025
FocusTrack: A Self-Adaptive Local Sampling Algorithm for Efficient Anti-UAV Tracking
FocusTrack: A Self-Adaptive Local Sampling Algorithm for Efficient Anti-UAV TrackingIEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), 2025
Ying Wang
Tingfa Xu
Jianan Li
301
11
0
18 Apr 2025
Weak Cube R-CNN: Weakly Supervised 3D Detection using only 2D Bounding Boxes
Weak Cube R-CNN: Weakly Supervised 3D Detection using only 2D Bounding BoxesScandinavian Conference on Image Analysis (SCIA), 2025
Andreas Lau Hansen
Lukas Wanzeck
Dim P. Papadopoulos
198
1
0
17 Apr 2025
EarthGPT-X: A Spatial MLLM for Multi-level Multi-Source Remote Sensing Imagery Understanding with Visual Prompting
EarthGPT-X: A Spatial MLLM for Multi-level Multi-Source Remote Sensing Imagery Understanding with Visual PromptingIEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), 2025
Wei Zhang
Miaoxin Cai
Yaqian Ning
Tianze Zhang
Yin Zhuang
He Chen
He Chen
Jun Li
Xuerui Mao
395
0
0
17 Apr 2025
Image Editing with Diffusion Models: A Survey
Image Editing with Diffusion Models: A Survey
Jia Wang
Jie Hu
Xiaoqi Ma
Hanghang Ma
Xiaoming Wei
Enhua Wu
319
5
0
17 Apr 2025
Learning Occlusion-Robust Vision Transformers for Real-Time UAV Tracking
Learning Occlusion-Robust Vision Transformers for Real-Time UAV TrackingComputer Vision and Pattern Recognition (CVPR), 2025
You Wu
Xucheng Wang
Xiangyang Yang
Mengyuan Liu
Dan Zeng
Hengzhou Ye
Shuiwang Li
253
15
0
12 Apr 2025
Light-YOLOv8-Flame: A Lightweight High-Performance Flame Detection Algorithm
Light-YOLOv8-Flame: A Lightweight High-Performance Flame Detection Algorithm
Jiawei Lan
Zhibiao Wang
Haoyang Yu
Ye Tao
Wenhua Cui
405
3
0
11 Apr 2025
Memory-efficient Streaming VideoLLMs for Real-time Procedural Video Understanding
Memory-efficient Streaming VideoLLMs for Real-time Procedural Video Understanding
Dibyadip Chatterjee
Edoardo Remelli
Yale Song
Bugra Tekin
Abhay Mittal
...
Shreyas Hampali
Eric Sauser
Shugao Ma
Angela Yao
Fadime Sener
VLM
254
3
0
10 Apr 2025
End-to-End Facial Expression Detection in Long Videos
End-to-End Facial Expression Detection in Long Videos
Yini Fang
Alec Diallo
Yiqi Shi
F. Jumelle
Bertram Shi
CVBM
140
0
0
10 Apr 2025
Few-Shot Adaptation of Grounding DINO for Agricultural Domain
Few-Shot Adaptation of Grounding DINO for Agricultural Domain
Rajhans Singh
Rafael Bidese Puhl
Kshitiz Dhakal
Sudhir Sornapudi
302
3
0
09 Apr 2025
Are We Done with Object-Centric Learning?
Are We Done with Object-Centric Learning?
Alexander Rubinstein
Christian Schroeder de Witt
Matthias Bethge
Seong Joon Oh
OCL
2.1K
3
0
09 Apr 2025
PRIMEDrive-CoT: A Precognitive Chain-of-Thought Framework for Uncertainty-Aware Object Interaction in Driving Scene Scenario
PRIMEDrive-CoT: A Precognitive Chain-of-Thought Framework for Uncertainty-Aware Object Interaction in Driving Scene Scenario
Sriram Mandalika
Lalitha V
Athira Nambiar
223
4
0
08 Apr 2025
Mathematical Modeling of Option Pricing with an Extended Black-Scholes Framework
Mathematical Modeling of Option Pricing with an Extended Black-Scholes Framework
Nikhil Shivakumar Nayak
408
12
0
04 Apr 2025
COST: Contrastive One-Stage Transformer for Vision-Language Small Object Tracking
COST: Contrastive One-Stage Transformer for Vision-Language Small Object TrackingInformation Fusion (Inf. Fusion), 2025
Chunhui Zhang
Li Liu
Jialin Gao
Xin Sun
Hao Wen
Xi Zhou
Shiming Ge
Yucheng Wang
284
4
0
02 Apr 2025
InteractionMap: Improving Online Vectorized HDMap Construction with Interaction
InteractionMap: Improving Online Vectorized HDMap Construction with InteractionComputer Vision and Pattern Recognition (CVPR), 2025
Kuang Wu
Chuan Yang
Zhanbin Li
280
5
0
27 Mar 2025
BOOTPLACE: Bootstrapped Object Placement with Detection Transformers
BOOTPLACE: Bootstrapped Object Placement with Detection TransformersComputer Vision and Pattern Recognition (CVPR), 2025
Hang Zhou
Wei Ji
Rui Ma
Li Cheng
ViT
271
0
0
27 Mar 2025
HierRelTriple: Guiding Indoor Layout Generation with Hierarchical Relationship Triplet Losses
HierRelTriple: Guiding Indoor Layout Generation with Hierarchical Relationship Triplet Losses
Kaifan Sun
Bingchen Yang
Peter Wonka
Jun Xiao
Haiyong Jiang
290
1
0
26 Mar 2025
RoboEngine: Plug-and-Play Robot Data Augmentation with Semantic Robot Segmentation and Background Generation
RoboEngine: Plug-and-Play Robot Data Augmentation with Semantic Robot Segmentation and Background Generation
Chengbo Yuan
Suraj Joshi
Shaoting Zhu
Hang Su
Hang Zhao
Yang Gao
VGen
321
23
0
24 Mar 2025
CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object Detection
CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object Detection
Zhichao Sun
Huazhang Hu
Yidong Ma
Gang Liu
Nemo Chen
Xu Tang
Feng-Long Xie
Yongchao Xu
ObjD
458
0
0
24 Mar 2025
SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Tracking
SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual TrackingComputer Vision and Pattern Recognition (CVPR), 2025
Wenrui Cai
Qingjie Liu
Longji Xu
MoE
370
3
0
24 Mar 2025
Previous
123456...232425
Next