v1v2 (latest)

Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

25 February 2019

Silvio Savarese

Papers citing "Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression"

50 / 1,204 papers shown

Single-Stage Visual Query Localization in Egocentric VideosNeural Information Processing Systems (NeurIPS), 2023

Hanwen Jiang

Santhosh Kumar Ramakrishnan

Kristen Grauman

269

15 Jun 2023

Relation-Aware Diffusion Model for Controllable Poster Layout GenerationInternational Conference on Information and Knowledge Management (CIKM), 2023

...

187

15 Jun 2023

World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

218

14 Jun 2023

OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments

Quentin Delfosse

Johannes Czech

Bjarne Gregori

Sebastian Sztwiertnia

Kristian Kersting

504

14 Jun 2023

Single-Stage Visual Relationship Learning using Conditional QueriesNeural Information Processing Systems (NeurIPS), 2023

224

09 Jun 2023

SparseTrack: Multi-Object Tracking by Performing Scene Decomposition based on Pseudo-Depth

Cheng Wang

378

08 Jun 2023

Object Detection with Transformers: A ReviewItalian National Conference on Sensors (INS), 2023

Tahira Shehzadi

K. Hashmi

D. Stricker

Muhammad Zeshan Afzal

ViT MU

423

07 Jun 2023

Language Adaptive Weight Generation for Multi-task Visual GroundingComputer Vision and Pattern Recognition (CVPR), 2023

Xi Li

297

06 Jun 2023

Student Classroom Behavior Detection based on Improved YOLOv7

Fan Yang

149

06 Jun 2023

Cross-Domain Car Detection Model with Integrated Convolutional Block Attention MechanismImage and Vision Computing (IVC), 2023

263

31 May 2023

VIPriors 3: Visual Inductive Priors for Data-Efficient Deep Learning Challenges

203

31 May 2023

Table Detection for Visually Rich Document ImagesKnowledge-Based Systems (KBS), 2023

197

30 May 2023

LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document UnderstandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

209

30 May 2023

Contextual Object Detection with Multimodal Large Language ModelsInternational Journal of Computer Vision (IJCV), 2023

328

142

29 May 2023

Referred by Multi-Modality: A Unified Temporal Transformer for Video Object SegmentationAAAI Conference on Artificial Intelligence (AAAI), 2023

Ziyu Guo

Wei Zhang

Yu Qiao

Zhongjiang He

337

25 May 2023

Semi-Supervised and Long-Tailed Object Detection with CascadeMatchInternational Journal of Computer Vision (IJCV), 2023

238

24 May 2023

Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion ModelsAAAI Conference on Artificial Intelligence (AAAI), 2023

330

23 May 2023

A comprehensive theoretical framework for the optimization of neural networks classification performance with respect to weighted metricsOptimization Letters (Optim. Lett.), 2023

127

22 May 2023

nnDetection for Intracranial Aneurysms Detection and Localization

22 May 2023

Boosting Long-tailed Object Detection via Step-wise Learning on Smooth-tail DataIEEE International Conference on Computer Vision (ICCV), 2023

Na Dong

Yongqiang Zhang

Mingli Ding

G. Lee

203

22 May 2023

UVOSAM: A Mask-free Paradigm for Unsupervised Video Object Segmentation via Segment Anything ModelPattern Recognition (Pattern Recogn.), 2023

304

22 May 2023

Surgical-VQLA: Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic SurgeryIEEE International Conference on Robotics and Automation (ICRA), 2023

Long Bai

Mobarakol Islam

Lalithkumar Seenivasan

Hongliang Ren

185

19 May 2023

Selecting Learnable Training Samples is All DETRs Need in Crowded Pedestrian DetectionACM Multimedia (ACM MM), 2023

Xinbo Gao

221

18 May 2023

Rethinking Boundary Discontinuity Problem for Oriented Object DetectionComputer Vision and Pattern Recognition (CVPR), 2023

Hang Xu

175

17 May 2023

Two-Stream Regression Network for Dental Implant Position PredictionExpert systems with applications (ESWA), 2023

260

17 May 2023

Understanding 3D Object Interaction from a Single ImageIEEE International Conference on Computer Vision (ICCV), 2023

Shengyi Qian

David Fouhey

364

16 May 2023

CLIP-VG: Self-paced Curriculum Adapting of CLIP for Visual GroundingIEEE transactions on multimedia (IEEE TMM), 2023

Linhui Xiao

Xiaoshan Yang

Fang Peng

Ming Yan

Yaowei Wang

Changsheng Xu

ObjD VLM

472

15 May 2023

CLRerNet: Improving Confidence of Lane Detection with LaneIoUIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

Hiroto Honda

Yusuke Uchida

286

15 May 2023

Hausdorff Distance Matching with Adaptive Query Denoising for Rotated Detection TransformerIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

Hakjin Lee

Minki Song

Jamyoung Koo

Junghoon Seo

560

12 May 2023

SSD-MonoDETR: Supervised Scale-aware Deformable Transformer for Monocular 3D Object DetectionIEEE Transactions on Intelligent Vehicles (TIV), 2023

Xuan He

Fan Yang

Kailun Yang

342

12 May 2023

Undercover Deepfakes: Detecting Fake Segments in Videos

415

11 May 2023

Real-time instance segmentation with polygons using an Intersection-over-Union lossCanadian Conference on Computer and Robot Vision (CRV), 2023

Katia Jodogne-Del Litto

Guillaume-Alexandre Bilodeau

179

09 May 2023

A Cross-direction Task Decoupling Network for Small Logo DetectionIEEE International Conference on Multimedia and Expo (ICME), 2023

...

149

04 May 2023

MH-DETR: Video Moment and Highlight Detection with Cross-modal TransformerIEEE International Joint Conference on Neural Network (IJCNN), 2023

Yang Li

271

29 Apr 2023

Unified Sequence-to-Sequence Learning for Single- and Multi-Modal Visual Object Tracking

Huchuan Lu

376

27 Apr 2023

End-to-End Spatio-Temporal Action Localisation with Video TransformersComputer Vision and Pattern Recognition (CVPR), 2023

258

24 Apr 2023

Deep Learning in Breast Cancer Imaging: A Decade of Progress and Future DirectionsIEEE Reviews in Biomedical Engineering (RBME), 2023

Luyang Luo

438

13 Apr 2023

Real-time Trajectory-based Social Group DetectionIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023

Simindokht Jahangard

Munawar Hayat

Hamid Rezatofighi

149

12 Apr 2023

Overload: Latency Attacks on Object Detection for Edge DevicesComputer Vision and Pattern Recognition (CVPR), 2023

341

11 Apr 2023

Relational Context Learning for Human-Object Interaction DetectionComputer Vision and Pattern Recognition (CVPR), 2023

Sanghyun Kim

Deunsol Jung

Minsu Cho

278

11 Apr 2023

StageInteractor: Query-based Object Detector with Cross-stage InteractionIEEE International Conference on Computer Vision (ICCV), 2023

312

11 Apr 2023

PlantDet: A benchmark for Plant Detection in the Three-Rivers-Source RegionInternational Conference on Artificial Neural Networks (ICANN), 2023

250

11 Apr 2023

Detection Transformer with Stable MatchingIEEE International Conference on Computer Vision (ICCV), 2023

...

Hang Su

Jun Zhu

Lei Zhang

233

10 Apr 2023

DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region AlignmentComputer Vision and Pattern Recognition (CVPR), 2023

Lewei Yao

Jianhua Han

Xiaodan Liang

Danqian Xu

Wei Zhang

Zhenguo Li

Hang Xu

VLM ObjD CLIP

312

104

10 Apr 2023

Sky-GVINS: a Sky-segmentation Aided GNSS-Visual-Inertial System for Robust Navigation in Urban CanyonsGeo-Spatial Information Science (GSIS), 2023

172

08 Apr 2023

DATE: Domain Adaptive Product Seeker for E-commerceComputer Vision and Pattern Recognition (CVPR), 2023

Zhou Zhao

316

07 Apr 2023

Boundary-Denoising for Video Activity LocalizationInternational Conference on Learning Representations (ICLR), 2023

Juan-Manuel Perez-Rua

Guohao Li

191

06 Apr 2023

Detecting and Grounding Multi-Modal Media ManipulationComputer Vision and Pattern Recognition (CVPR), 2023

Rui Shao

Tianxing Wu

Ziwei Liu

272

111

05 Apr 2023

Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQAAAAI Conference on Artificial Intelligence (AAAI), 2023

Xin Li

172

04 Apr 2023

A Comprehensive Review of YOLO Architectures in Computer Vision: From YOLOv1 to YOLOv8 and YOLO-NASMachine Learning and Knowledge Extraction (MLKE), 2023

Juan R. Terven

Diana-Margarita Córdova-Esparza

722

2,109

02 Apr 2023