ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.09630
  4. Cited By
Generalized Intersection over Union: A Metric and A Loss for Bounding
  Box Regression
v1v2 (latest)

Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

25 February 2019
S. Hamid Rezatofighi
Deyuan Li
JunYoung Gwak
Amir Sadeghian
Ian Reid
Silvio Savarese
ArXiv (abs)PDFHTML

Papers citing "Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression"

50 / 1,204 papers shown
Single-Stage Visual Query Localization in Egocentric Videos
Single-Stage Visual Query Localization in Egocentric VideosNeural Information Processing Systems (NeurIPS), 2023
Hanwen Jiang
Santhosh Kumar Ramakrishnan
Kristen Grauman
269
21
0
15 Jun 2023
Relation-Aware Diffusion Model for Controllable Poster Layout Generation
Relation-Aware Diffusion Model for Controllable Poster Layout GenerationInternational Conference on Information and Knowledge Management (CIKM), 2023
Fengheng Li
An Liu
Wei Feng
Honghe Zhu
Yaoyu Li
...
Jingjing Lv
Xin Zhu
Jun-Jun Shen
Zhangang Lin
Jingping Shao
187
41
0
15 Jun 2023
World-to-Words: Grounded Open Vocabulary Acquisition through Fast
  Mapping in Vision-Language Models
World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Ziqiao Ma
Jiayi Pan
J. Chai
ObjDVLM
218
12
0
14 Jun 2023
OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments
OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments
Quentin Delfosse
Johannes Czech
Bjarne Gregori
Sebastian Sztwiertnia
Kristian Kersting
504
25
0
14 Jun 2023
Single-Stage Visual Relationship Learning using Conditional Queries
Single-Stage Visual Relationship Learning using Conditional QueriesNeural Information Processing Systems (NeurIPS), 2023
Alakh Desai
Tz-Ying Wu
Subarna Tripathi
Nuno Vasconcelos
224
9
0
09 Jun 2023
SparseTrack: Multi-Object Tracking by Performing Scene Decomposition
  based on Pseudo-Depth
SparseTrack: Multi-Object Tracking by Performing Scene Decomposition based on Pseudo-Depth
Zelin Liu
Xinggang Wang
Cheng Wang
Wenyu Liu
X. Bai
VOSVOT
378
97
0
08 Jun 2023
Object Detection with Transformers: A Review
Object Detection with Transformers: A ReviewItalian National Conference on Sensors (INS), 2023
Tahira Shehzadi
K. Hashmi
D. Stricker
Muhammad Zeshan Afzal
ViTMU
423
58
0
07 Jun 2023
Language Adaptive Weight Generation for Multi-task Visual Grounding
Language Adaptive Weight Generation for Multi-task Visual GroundingComputer Vision and Pattern Recognition (CVPR), 2023
Wei Su
Peihan Miao
Huanzhang Dou
Gaoang Wang
Liang Qiao
Zheyang Li
Xi Li
ObjD
297
50
0
06 Jun 2023
Student Classroom Behavior Detection based on Improved YOLOv7
Student Classroom Behavior Detection based on Improved YOLOv7
Fan Yang
149
12
0
06 Jun 2023
Cross-Domain Car Detection Model with Integrated Convolutional Block
  Attention Mechanism
Cross-Domain Car Detection Model with Integrated Convolutional Block Attention MechanismImage and Vision Computing (IVC), 2023
Haoxuan Xu
Songning Lai
Xianyang Li
Y. Yang
ViT
263
17
0
31 May 2023
VIPriors 3: Visual Inductive Priors for Data-Efficient Deep Learning
  Challenges
VIPriors 3: Visual Inductive Priors for Data-Efficient Deep Learning Challenges
Robert-Jan Bruintjes
A. Lengyel
Marcos Baptista-Rios
O. Kayhan
Davide Zambrano
Nergis Tomen
Jan van Gemert
203
9
0
31 May 2023
Table Detection for Visually Rich Document Images
Table Detection for Visually Rich Document ImagesKnowledge-Based Systems (KBS), 2023
Bin Xiao
Murat Simsek
B. Kantarci
Ala Abu Alkheir
197
11
0
30 May 2023
LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training
  for Document Understanding
LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document UnderstandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Yi Tu
Ya Guo
Huan Chen
Jinyang Tang
209
22
0
30 May 2023
Contextual Object Detection with Multimodal Large Language Models
Contextual Object Detection with Multimodal Large Language ModelsInternational Journal of Computer Vision (IJCV), 2023
Yuhang Zang
Wei Li
Jun Han
Kaiyang Zhou
Chen Change Loy
ObjDVLMMLLM
328
142
0
29 May 2023
Referred by Multi-Modality: A Unified Temporal Transformer for Video
  Object Segmentation
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object SegmentationAAAI Conference on Artificial Intelligence (AAAI), 2023
Shilin Yan
Renrui Zhang
Ziyu Guo
Wenchao Chen
Wei Zhang
Guoying Gu
Yu Qiao
Hao Dong
Zhongjiang He
Shiyang Feng
VOS
337
58
0
25 May 2023
Semi-Supervised and Long-Tailed Object Detection with CascadeMatch
Semi-Supervised and Long-Tailed Object Detection with CascadeMatchInternational Journal of Computer Vision (IJCV), 2023
Yuhang Zang
Kaiyang Zhou
Chen Huang
Chen Change Loy
238
14
0
24 May 2023
Compositional Text-to-Image Synthesis with Attention Map Control of
  Diffusion Models
Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion ModelsAAAI Conference on Artificial Intelligence (AAAI), 2023
Ruichen Wang
Zekang Chen
Chen Chen
Jiancang Ma
H. Lu
Xiaodong Lin
DiffM
330
92
0
23 May 2023
A comprehensive theoretical framework for the optimization of neural
  networks classification performance with respect to weighted metrics
A comprehensive theoretical framework for the optimization of neural networks classification performance with respect to weighted metricsOptimization Letters (Optim. Lett.), 2023
Francesco Marchetti
Sabrina Guastavino
C. Campi
F. Benvenuto
Michele Piana
127
2
0
22 May 2023
nnDetection for Intracranial Aneurysms Detection and Localization
nnDetection for Intracranial Aneurysms Detection and Localization
Zehan Zhang
Negar Firoozeh
Shaojun Xia
Mahmud Mossa-Basha
Chengcheng Zhu
73
3
0
22 May 2023
Boosting Long-tailed Object Detection via Step-wise Learning on
  Smooth-tail Data
Boosting Long-tailed Object Detection via Step-wise Learning on Smooth-tail DataIEEE International Conference on Computer Vision (ICCV), 2023
Na Dong
Yongqiang Zhang
Mingli Ding
G. Lee
203
6
0
22 May 2023
UVOSAM: A Mask-free Paradigm for Unsupervised Video Object Segmentation via Segment Anything Model
UVOSAM: A Mask-free Paradigm for Unsupervised Video Object Segmentation via Segment Anything ModelPattern Recognition (Pattern Recogn.), 2023
Zhenghao Zhang
Shengfan Zhang
Zhichao Wei
Zuozhuo Dai
Siyu Zhu
VOSVLM
304
21
0
22 May 2023
Surgical-VQLA: Transformer with Gated Vision-Language Embedding for
  Visual Question Localized-Answering in Robotic Surgery
Surgical-VQLA: Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic SurgeryIEEE International Conference on Robotics and Automation (ICRA), 2023
Long Bai
Mobarakol Islam
Lalithkumar Seenivasan
Hongliang Ren
185
52
0
19 May 2023
Selecting Learnable Training Samples is All DETRs Need in Crowded
  Pedestrian Detection
Selecting Learnable Training Samples is All DETRs Need in Crowded Pedestrian DetectionACM Multimedia (ACM MM), 2023
Feng Gao
Jiaxu Leng
Ji Gan
Xinbo Gao
ViT
221
7
0
18 May 2023
Rethinking Boundary Discontinuity Problem for Oriented Object Detection
Rethinking Boundary Discontinuity Problem for Oriented Object DetectionComputer Vision and Pattern Recognition (CVPR), 2023
Hang Xu
Xinyuan Liu
Haonan Xu
Yike Ma
Zunjie Zhu
C. Yan
Feng Dai
175
25
0
17 May 2023
Two-Stream Regression Network for Dental Implant Position Prediction
Two-Stream Regression Network for Dental Implant Position PredictionExpert systems with applications (ESWA), 2023
Xinquan Yang
Xuguang Li
Xuechen Li
Wenting Chen
Linlin Shen
Xuzhao Li
Yongqiang Deng
260
15
0
17 May 2023
Understanding 3D Object Interaction from a Single Image
Understanding 3D Object Interaction from a Single ImageIEEE International Conference on Computer Vision (ICCV), 2023
Shengyi Qian
David Fouhey
364
32
0
16 May 2023
CLIP-VG: Self-paced Curriculum Adapting of CLIP for Visual Grounding
CLIP-VG: Self-paced Curriculum Adapting of CLIP for Visual GroundingIEEE transactions on multimedia (IEEE TMM), 2023
Linhui Xiao
Xiaoshan Yang
Fang Peng
Ming Yan
Yaowei Wang
Changsheng Xu
ObjDVLM
472
62
0
15 May 2023
CLRerNet: Improving Confidence of Lane Detection with LaneIoU
CLRerNet: Improving Confidence of Lane Detection with LaneIoUIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Hiroto Honda
Yusuke Uchida
286
46
0
15 May 2023
Hausdorff Distance Matching with Adaptive Query Denoising for Rotated
  Detection Transformer
Hausdorff Distance Matching with Adaptive Query Denoising for Rotated Detection TransformerIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Hakjin Lee
Minki Song
Jamyoung Koo
Junghoon Seo
560
15
0
12 May 2023
SSD-MonoDETR: Supervised Scale-aware Deformable Transformer for
  Monocular 3D Object Detection
SSD-MonoDETR: Supervised Scale-aware Deformable Transformer for Monocular 3D Object DetectionIEEE Transactions on Intelligent Vehicles (TIV), 2023
Xuan He
Fan Yang
Kailun Yang
Jiacheng Lin
Haolong Fu
Ming Wang
Jin Yuan
Zhiyong Li
ViT
342
22
0
12 May 2023
Undercover Deepfakes: Detecting Fake Segments in Videos
Undercover Deepfakes: Detecting Fake Segments in Videos
Sanjay Saha
Rashindrie Perera
Sachith Seneviratne
T. Malepathirana
Sanka Rasnayaka
Deshani Geethika
Terence Sim
Saman K. Halgamuge
DiffM
415
15
0
11 May 2023
Real-time instance segmentation with polygons using an
  Intersection-over-Union loss
Real-time instance segmentation with polygons using an Intersection-over-Union lossCanadian Conference on Computer and Robot Vision (CRV), 2023
Katia Jodogne-Del Litto
Guillaume-Alexandre Bilodeau
179
4
0
09 May 2023
A Cross-direction Task Decoupling Network for Small Logo Detection
A Cross-direction Task Decoupling Network for Small Logo DetectionIEEE International Conference on Multimedia and Expo (ICME), 2023
Hou
Sujuan Hou
Li
Xingzhuo Li
Min
...
Jing
Zheng
Yuanjie Zheng
Jiang
Shuqiang Jiang
149
5
0
04 May 2023
MH-DETR: Video Moment and Highlight Detection with Cross-modal
  Transformer
MH-DETR: Video Moment and Highlight Detection with Cross-modal TransformerIEEE International Joint Conference on Neural Network (IJCNN), 2023
Yifang Xu
Yunzhuo Sun
Yang Li
Yilei Shi
Xiaoxia Zhu
S. Du
ViT
271
48
0
29 Apr 2023
Unified Sequence-to-Sequence Learning for Single- and Multi-Modal Visual
  Object Tracking
Unified Sequence-to-Sequence Learning for Single- and Multi-Modal Visual Object Tracking
Xin Chen
Houwen Peng
Jiawen Zhu
Dong Wang
Han Hu
Huchuan Lu
376
27
0
27 Apr 2023
End-to-End Spatio-Temporal Action Localisation with Video Transformers
End-to-End Spatio-Temporal Action Localisation with Video TransformersComputer Vision and Pattern Recognition (CVPR), 2023
A. Gritsenko
Xuehan Xiong
Josip Djolonga
Mostafa Dehghani
Chen Sun
Mario Lucic
Cordelia Schmid
Anurag Arnab
ViT
258
22
0
24 Apr 2023
Deep Learning in Breast Cancer Imaging: A Decade of Progress and Future
  Directions
Deep Learning in Breast Cancer Imaging: A Decade of Progress and Future DirectionsIEEE Reviews in Biomedical Engineering (RBME), 2023
Luyang Luo
Xi Wang
Yi Lin
Xiaoqi Ma
Andong Tan
R. Chan
V. Vardhanabhuti
W. C. Chu
Kwang-Ting Cheng
Hao Chen
438
95
0
13 Apr 2023
Real-time Trajectory-based Social Group Detection
Real-time Trajectory-based Social Group DetectionIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Simindokht Jahangard
Munawar Hayat
Hamid Rezatofighi
149
7
0
12 Apr 2023
Overload: Latency Attacks on Object Detection for Edge Devices
Overload: Latency Attacks on Object Detection for Edge DevicesComputer Vision and Pattern Recognition (CVPR), 2023
Erh-Chung Chen
Pin-Yu Chen
I-Hsin Chung
Che-Rung Lee
AAML
341
19
0
11 Apr 2023
Relational Context Learning for Human-Object Interaction Detection
Relational Context Learning for Human-Object Interaction DetectionComputer Vision and Pattern Recognition (CVPR), 2023
Sanghyun Kim
Deunsol Jung
Minsu Cho
278
66
0
11 Apr 2023
StageInteractor: Query-based Object Detector with Cross-stage
  Interaction
StageInteractor: Query-based Object Detector with Cross-stage InteractionIEEE International Conference on Computer Vision (ICCV), 2023
Yao Teng
Haisong Liu
Sheng Guo
Limin Wang
ObjD
312
12
0
11 Apr 2023
PlantDet: A benchmark for Plant Detection in the Three-Rivers-Source
  Region
PlantDet: A benchmark for Plant Detection in the Three-Rivers-Source RegionInternational Conference on Artificial Neural Networks (ICANN), 2023
Huanhuan Li
Xuechao Zou
Yu-an Zhang
Jiangcai Zhaba
Guomei Li
Lamao Yongga
250
0
0
11 Apr 2023
Detection Transformer with Stable Matching
Detection Transformer with Stable MatchingIEEE International Conference on Computer Vision (ICCV), 2023
Siyi Liu
Tianhe Ren
Jia-Yu Chen
Zhaoyang Zeng
Hao Zhang
...
Hongyang Li
Jun Huang
Hang Su
Jun Zhu
Lei Zhang
233
56
0
10 Apr 2023
DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via
  Word-Region Alignment
DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region AlignmentComputer Vision and Pattern Recognition (CVPR), 2023
Lewei Yao
Jianhua Han
Xiaodan Liang
Danqian Xu
Wei Zhang
Zhenguo Li
Hang Xu
VLMObjDCLIP
312
104
0
10 Apr 2023
Sky-GVINS: a Sky-segmentation Aided GNSS-Visual-Inertial System for
  Robust Navigation in Urban Canyons
Sky-GVINS: a Sky-segmentation Aided GNSS-Visual-Inertial System for Robust Navigation in Urban CanyonsGeo-Spatial Information Science (GSIS), 2023
Jie Yin
Tao Li
Hao Yin
Wenxian Yu
Danping Zou
172
14
0
08 Apr 2023
DATE: Domain Adaptive Product Seeker for E-commerce
DATE: Domain Adaptive Product Seeker for E-commerceComputer Vision and Pattern Recognition (CVPR), 2023
Haoyuan Li
Haojie Jiang
Tao Jin
Meng-Juan Li
Yan Chen
Zhijie Lin
Yang Zhao
Zhou Zhao
316
6
0
07 Apr 2023
Boundary-Denoising for Video Activity Localization
Boundary-Denoising for Video Activity LocalizationInternational Conference on Learning Representations (ICLR), 2023
Mengmeng Xu
Mattia Soldan
Jialin Gao
Shuming Liu
Juan-Manuel Perez-Rua
Guohao Li
191
15
0
06 Apr 2023
Detecting and Grounding Multi-Modal Media Manipulation
Detecting and Grounding Multi-Modal Media ManipulationComputer Vision and Pattern Recognition (CVPR), 2023
Rui Shao
Tianxing Wu
Ziwei Liu
272
111
0
05 Apr 2023
Locate Then Generate: Bridging Vision and Language with Bounding Box for
  Scene-Text VQA
Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQAAAAI Conference on Artificial Intelligence (AAAI), 2023
Yongxin Zhu
Ziqiang Liu
Yukang Liang
Xin Li
Hao Liu
Changcun Bao
Linli Xu
172
9
0
04 Apr 2023
A Comprehensive Review of YOLO Architectures in Computer Vision: From
  YOLOv1 to YOLOv8 and YOLO-NAS
A Comprehensive Review of YOLO Architectures in Computer Vision: From YOLOv1 to YOLOv8 and YOLO-NASMachine Learning and Knowledge Extraction (MLKE), 2023
Juan R. Terven
Diana-Margarita Córdova-Esparza
722
2,109
0
02 Apr 2023
Previous
123...121314...232425
Next
Page 13 of 25
Pageof 25