ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.09630
  4. Cited By
Generalized Intersection over Union: A Metric and A Loss for Bounding
  Box Regression

Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

25 February 2019
S. Hamid Rezatofighi
Nathan Tsoi
JunYoung Gwak
Amir Sadeghian
Ian Reid
Silvio Savarese
ArXivPDFHTML

Papers citing "Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression"

50 / 302 papers shown
Title
Adaptively Bypassing Vision Transformer Blocks for Efficient Visual
  Tracking
Adaptively Bypassing Vision Transformer Blocks for Efficient Visual Tracking
Xiangyang Yang
Dan Zeng
Xucheng Wang
You Wu
Hengzhou Ye
Qijun Zhao
Shuiwang Li
57
3
0
12 Jun 2024
OED: Towards One-stage End-to-End Dynamic Scene Graph Generation
OED: Towards One-stage End-to-End Dynamic Scene Graph Generation
Guan-Bo Wang
Zhiming Li
Qingchao Chen
Yang Liu
30
9
0
27 May 2024
Enhanced Object Tracking by Self-Supervised Auxiliary Depth Estimation
  Learning
Enhanced Object Tracking by Self-Supervised Auxiliary Depth Estimation Learning
Zhenyu Wei
Yujie He
Zhanchuan Cai
MDE
35
0
0
23 May 2024
Context-Enhanced Video Moment Retrieval with Large Language Models
Context-Enhanced Video Moment Retrieval with Large Language Models
Weijia Liu
Bo Miao
Jiuxin Cao
Xueling Zhu
Bo Liu
Mehwish Nasim
Ajmal Saeed Mian
29
2
0
21 May 2024
Replication Study and Benchmarking of Real-Time Object Detection Models
Replication Study and Benchmarking of Real-Time Object Detection Models
Pierre-Luc Asselin
Vincent Coulombe
William Guimont-Martin
William Larrivée-Hardy
38
0
0
11 May 2024
Two in One Go: Single-stage Emotion Recognition with Decoupled
  Subject-context Transformer
Two in One Go: Single-stage Emotion Recognition with Decoupled Subject-context Transformer
Xinpeng Li
Teng Wang
Jian Zhao
Shuyi Mao
Jinbao Wang
Feng Zheng
Xiaojiang Peng
Xuelong Li
28
1
0
26 Apr 2024
Surgical-DeSAM: Decoupling SAM for Instrument Segmentation in Robotic
  Surgery
Surgical-DeSAM: Decoupling SAM for Instrument Segmentation in Robotic Surgery
Yuyang Sheng
Sophia Bano
Matthew J. Clarkson
Mobarakol Islam
41
6
0
22 Apr 2024
Task-Driven Exploration: Decoupling and Inter-Task Feedback for Joint
  Moment Retrieval and Highlight Detection
Task-Driven Exploration: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection
Jin Yang
Ping Wei
Huan Li
Ziyang Ren
40
8
0
14 Apr 2024
Tri-modal Confluence with Temporal Dynamics for Scene Graph Generation
  in Operating Rooms
Tri-modal Confluence with Temporal Dynamics for Scene Graph Generation in Operating Rooms
Diandian Guo
Manxi Lin
Jialun Pei
He Tang
Yueming Jin
Pheng-Ann Heng
24
1
0
14 Apr 2024
DQ-DETR: DETR with Dynamic Query for Tiny Object Detection
DQ-DETR: DETR with Dynamic Query for Tiny Object Detection
Yi-Xin Huang
Hou-I Liu
Hong-Han Shuai
Wen-Huang Cheng
29
15
0
04 Apr 2024
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Weifeng Lin
Xinyu Wei
Ruichuan An
Peng Gao
Bocheng Zou
Yulin Luo
Siyuan Huang
Shanghang Zhang
Hongsheng Li
VLM
61
33
0
29 Mar 2024
Exploring Dynamic Transformer for Efficient Object Tracking
Exploring Dynamic Transformer for Efficient Object Tracking
Jiawen Zhu
Xin Chen
Haiwen Diao
Shuai Li
Jun-Yan He
Chenyang Li
Bin Luo
Dong Wang
Huchuan Lu
41
2
0
26 Mar 2024
Multiple Object Tracking as ID Prediction
Multiple Object Tracking as ID Prediction
Ruopeng Gao
Yijun Zhang
Limin Wang
53
12
0
25 Mar 2024
Surgical-LVLM: Learning to Adapt Large Vision-Language Model for Grounded Visual Question Answering in Robotic Surgery
Surgical-LVLM: Learning to Adapt Large Vision-Language Model for Grounded Visual Question Answering in Robotic Surgery
Guan-Feng Wang
Long Bai
Wan Jun Nah
Jie Wang
Zhaoxi Zhang
Zhen Chen
Jinlin Wu
Mobarakol Islam
Hongbin Liu
Hongliang Ren
40
14
0
22 Mar 2024
Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video
  Object Segmentation
Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Zixin Zhu
Xuelu Feng
Dongdong Chen
Junsong Yuan
Chunming Qiao
Gang Hua
DiffM
37
7
0
18 Mar 2024
Siamese Learning with Joint Alignment and Regression for
  Weakly-Supervised Video Paragraph Grounding
Siamese Learning with Joint Alignment and Regression for Weakly-Supervised Video Paragraph Grounding
Chaolei Tan
Jian-Huang Lai
Wei-Shi Zheng
Jianfang Hu
AI4TS
36
5
0
18 Mar 2024
Perceive With Confidence: Statistical Safety Assurances for Navigation with Learning-Based Perception
Perceive With Confidence: Statistical Safety Assurances for Navigation with Learning-Based Perception
Anushri Dixit
Zhiting Mei
Meghan Booker
Mariko Storey-Matsutani
Mariko Storey-Matsutani
Allen Z. Ren
Ola Shorinwa
Anirudha Majumdar
29
5
0
13 Mar 2024
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
Mingxin Huang
Dezhi Peng
Hongliang Li
Zhenghao Peng
Chongyu Liu
Dahua Lin
Yuliang Liu
Xiang Bai
Lianwen Jin
72
1
0
15 Jan 2024
Shape-IoU: More Accurate Metric considering Bounding Box Shape and Scale
Shape-IoU: More Accurate Metric considering Bounding Box Shape and Scale
Hao Zhang
Shuaijie Zhang
23
68
0
29 Dec 2023
Diffusion-Based Particle-DETR for BEV Perception
Diffusion-Based Particle-DETR for BEV Perception
Asen Nachkov
Martin Danelljan
D. Paudel
Luc Van Gool
DiffM
26
3
0
18 Dec 2023
Overcome the Fear Of Missing Out: Active Sensing UAV Scanning for
  Precision Agriculture
Overcome the Fear Of Missing Out: Active Sensing UAV Scanning for Precision Agriculture
Marios Krestenitis
Emmanuel K. Raptis
Athanasios Ch. Kapoutsis
Konstantinos Ioannidis
Elias B. Kosmatopoulos
S. Vrochidis
37
6
0
15 Dec 2023
Class-Wise Buffer Management for Incremental Object Detection: An
  Effective Buffer Training Strategy
Class-Wise Buffer Management for Incremental Object Detection: An Effective Buffer Training Strategy
Junsu Kim
Sumin Hong
Chanwoo Kim
Jihyeon Kim
Yihalem Yimolal Tiruneh
Jeongwan On
Jihyun Song
Sunhwa Choi
Seungryul Baek
8
3
0
14 Dec 2023
Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance
Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance
Kuan-Chih Huang
Yi-Hsuan Tsai
Ming-Hsuan Yang
3DPC
27
4
0
12 Dec 2023
Edge Wasserstein Distance Loss for Oriented Object Detection
Edge Wasserstein Distance Loss for Oriented Object Detection
Yuke Zhu
Yumeng Ruan
Zihua Xiong
Sheng Guo
22
0
0
12 Dec 2023
You Only Learn One Query: Learning Unified Human Query for Single-Stage
  Multi-Person Multi-Task Human-Centric Perception
You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception
Sheng Jin
Shuhuai Li
Tong Li
Wentao Liu
Chao Qian
Ping Luo
29
5
0
09 Dec 2023
Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph
  Generation via Visual-Concept Alignment and Retention
Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and Retention
Zuyao Chen
Jinlin Wu
Zhen Lei
Zhaoxiang Zhang
Changwen Chen
25
11
0
18 Nov 2023
POISE: Pose Guided Human Silhouette Extraction under Occlusions
POISE: Pose Guided Human Silhouette Extraction under Occlusions
Arindam Dutta
Rohit Lal
Dripta S. Raychaudhuri
Calvin-Khang Ta
A. Roy-Chowdhury
3DH
CVBM
31
5
0
09 Nov 2023
Being Aware of Localization Accuracy By Generating Predicted-IoU-Guided
  Quality Scores
Being Aware of Localization Accuracy By Generating Predicted-IoU-Guided Quality Scores
Peng Liu
Weibo Wang
Yuhan Guo
Jiubin Tan
16
1
0
23 Sep 2023
DCPT: Darkness Clue-Prompted Tracking in Nighttime UAVs
DCPT: Darkness Clue-Prompted Tracking in Nighttime UAVs
Jiawen Zhu
Huayi Tang
Zhi-Qi Cheng
Ju He
Bin Luo
Shihao Qiu
Shengming Li
Huchuan Lu
24
12
0
19 Sep 2023
DeNoising-MOT: Towards Multiple Object Tracking with Severe Occlusions
DeNoising-MOT: Towards Multiple Object Tracking with Severe Occlusions
Teng Fu
Xiaocong Wang
Haiyang Yu
Ke Niu
Bin Li
Xiangyang Xue
VOT
ViT
29
6
0
09 Sep 2023
YOLO series target detection algorithms for underwater environments
YOLO series target detection algorithms for underwater environments
Chenjie Zhang
Pengcheng Jiao
13
3
0
07 Sep 2023
Temporal Collection and Distribution for Referring Video Object
  Segmentation
Temporal Collection and Distribution for Referring Video Object Segmentation
Jiajin Tang
Ge Zheng
Sibei Yang
VOS
28
14
0
07 Sep 2023
A Unified Framework for 3D Point Cloud Visual Grounding
A Unified Framework for 3D Point Cloud Visual Grounding
Haojia Lin
Yongdong Luo
Xiawu Zheng
Lijiang Li
Fei Chao
Taisong Jin
Donghao Luo
Yan Wang
Liujuan Cao
Rongrong Ji
19
2
0
23 Aug 2023
HODN: Disentangling Human-Object Feature for HOI Detection
HODN: Disentangling Human-Object Feature for HOI Detection
Shuman Fang
Zhiwen Lin
Ke Yan
Jie Li
Xianming Lin
Rongrong Ji
44
5
0
20 Aug 2023
Language-Guided Diffusion Model for Visual Grounding
Language-Guided Diffusion Model for Visual Grounding
Sijia Chen
Baochun Li
27
5
0
18 Aug 2023
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Muhammad Awais
Muzammal Naseer
Salman Khan
Rao Muhammad Anwer
Hisham Cholakkal
M. Shah
Ming Yang
F. Khan
VLM
24
118
0
25 Jul 2023
MFMAN-YOLO: A Method for Detecting Pole-like Obstacles in Complex
  Environment
MFMAN-YOLO: A Method for Detecting Pole-like Obstacles in Complex Environment
Lei Cai
H. Wang
C. Zhou
Yongqiang Wang
Bo Liu
30
0
0
24 Jul 2023
Enhancing Your Trained DETRs with Box Refinement
Enhancing Your Trained DETRs with Box Refinement
Yiqun Chen
Qiang Chen
Pei Sun
Shoufa Chen
Jingdong Wang
Jian Cheng
30
2
0
21 Jul 2023
Rethinking Intersection Over Union for Small Object Detection in
  Few-Shot Regime
Rethinking Intersection Over Union for Small Object Detection in Few-Shot Regime
Pierre Le Jeune
Anissa Zergaïnoh-Mokraoui
ObjD
14
6
0
17 Jul 2023
All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment
All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment
Chunhui Zhang
Xin Sun
Li Liu
Yiqian Yang
Qiong Liu
Xiaoping Zhou
Yanfeng Wang
38
15
0
07 Jul 2023
Hierarchical Open-vocabulary Universal Image Segmentation
Hierarchical Open-vocabulary Universal Image Segmentation
Xudong Wang
Shufang Li
Konstantinos Kallidromitis
Yu Kato
Kazuki Kozuka
Trevor Darrell
VLM
OCL
32
36
0
03 Jul 2023
Concurrent ischemic lesion age estimation and segmentation of CT brain
  using a Transformer-based network
Concurrent ischemic lesion age estimation and segmentation of CT brain using a Transformer-based network
A. Marcus
P. Bentley
Daniel Rueckert
MedIm
8
9
0
21 Jun 2023
CrossKD: Cross-Head Knowledge Distillation for Object Detection
CrossKD: Cross-Head Knowledge Distillation for Object Detection
Jiabao Wang
Yuming Chen
Zhaohui Zheng
Xiang Li
Ming-Ming Cheng
Qibin Hou
38
32
0
20 Jun 2023
Referred by Multi-Modality: A Unified Temporal Transformer for Video
  Object Segmentation
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation
Shilin Yan
Renrui Zhang
Ziyu Guo
Wenchao Chen
Wei Zhang
Hongyang Li
Yu Qiao
Hao Dong
Zhongjiang He
Peng Gao
VOS
16
30
0
25 May 2023
Two-Stream Regression Network for Dental Implant Position Prediction
Two-Stream Regression Network for Dental Implant Position Prediction
Xinquan Yang
Xuguang Li
Xuechen Li
Wenting Chen
Linlin Shen
X. Li
Yongqiang Deng
18
6
0
17 May 2023
Understanding 3D Object Interaction from a Single Image
Understanding 3D Object Interaction from a Single Image
Shengyi Qian
David Fouhey
26
15
0
16 May 2023
CLRerNet: Improving Confidence of Lane Detection with LaneIoU
CLRerNet: Improving Confidence of Lane Detection with LaneIoU
Hiroto Honda
Yusuke Uchida
15
27
0
15 May 2023
End-to-End Spatio-Temporal Action Localisation with Video Transformers
End-to-End Spatio-Temporal Action Localisation with Video Transformers
A. Gritsenko
Xuehan Xiong
Josip Djolonga
Mostafa Dehghani
Chen Sun
Mario Lucic
Cordelia Schmid
Anurag Arnab
ViT
32
13
0
24 Apr 2023
Real-time Trajectory-based Social Group Detection
Real-time Trajectory-based Social Group Detection
Simindokht Jahangard
Munawar Hayat
Hamid Rezatofighi
11
2
0
12 Apr 2023
Overload: Latency Attacks on Object Detection for Edge Devices
Overload: Latency Attacks on Object Detection for Edge Devices
Erh-Chung Chen
Pin-Yu Chen
I-Hsin Chung
Che-Rung Lee
AAML
36
12
0
11 Apr 2023
Previous
1234567
Next