v1v2v3v4 (latest)

Deformable DETR: Deformable Transformers for End-to-End Object Detection

International Conference on Learning Representations (ICLR), 2020

8 October 2020

Weijie Su

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)Github (3553★)

Papers citing "Deformable DETR: Deformable Transformers for End-to-End Object Detection"

50 / 2,782 papers shown

AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual SegmentationIEEE transactions on multimedia (TMM), 2025

142

14 Jan 2025

BioPose: Biomechanically-accurate 3D Pose Estimation from Monocular VideosIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025

Farnoosh Koleini

Muhammad Usama Saleem

324

14 Jan 2025

SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing

Varun Biyyala

Bharat Chanderprakash Kathuria

Jialu Li

Youshan Zhang

327

13 Jan 2025

TimberVision: A Multi-Task Dataset and Framework for Log-Component Segmentation and Tracking in Autonomous Forestry OperationsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025

307

13 Jan 2025

Toward Realistic Camouflaged Object Detection: Benchmarks and Method

202

13 Jan 2025

VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video CaptioningAAAI Conference on Artificial Intelligence (AAAI), 2025

152

12 Jan 2025

MapGS: Generalizable Pretraining and Data Augmentation for Online Mapping via Novel View Synthesis

Henrik I. Christensen

Liu Ren

3DGS ViT

227

11 Jan 2025

YO-CSA-T: A Real-time Badminton Tracking System Utilizing YOLO Based on Contextual and Spatial Attention

Yuan Lai

Zhiwei Shi

Chengxi Zhu

11 Jan 2025

Semi-supervised 3D Semantic Scene Completion with 2D Vision Foundation Model GuidanceAAAI Conference on Artificial Intelligence (AAAI), 2024

573

10 Jan 2025

UV-Attack: Physical-World Adversarial Attacks for Person Detection via Dynamic-NeRF-based UV Mapping

295

10 Jan 2025

RSAR: Restricted State Angle Resolver and Rotated SAR BenchmarkComputer Vision and Pattern Recognition (CVPR), 2025

260

08 Jan 2025

AuxDepthNet: Real-Time Monocular 3D Object Detection with Depth-Sensitive FeaturesApplied Sciences (AS), 2025

206

08 Jan 2025

Siamese-DETR for Generic Multi-Object TrackingIEEE Transactions on Image Processing (IEEE TIP), 2023

310

08 Jan 2025

Exploiting Boundary Loss for the Hierarchical Panoptic Segmentation of Plants and Leaves

Madeleine Darbyshire

Elizabeth I. Sklar

Simon Parsons

287

03 Jan 2025

VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language TasksNeural Information Processing Systems (NeurIPS), 2024

...

868

121

03 Jan 2025

Open-Set Object Detection By Aligning Known Class RepresentationsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

Vineeth N. Balasubramanian

ObjD

208

31 Dec 2024

Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer ModelInternational Conference on Information Photonics (ICIP), 2024

233

25 Dec 2024

Evaluating the Adversarial Robustness of Detection Transformers

297

25 Dec 2024

Feature Based Methods in Domain Adaptation for Object Detection: A Review Paper

Helia Mohamadi

Mohammad Ali Keyvanrad

Mohammad Reza Mohammadi

339

23 Dec 2024

Towards Unsupervised Model Selection for Domain Adaptive Object DetectionNeural Information Processing Systems (NeurIPS), 2024

275

23 Dec 2024

NumbOD: A Spatial-Frequency Fusion Attack Against Object DetectorsAAAI Conference on Artificial Intelligence (AAAI), 2024

369

22 Dec 2024

ImagineMap: Enhanced HD Map Construction with SD Maps

Yishen Ji

Zhiqi Li

Tong Lu

322

22 Dec 2024

Anchor3DLane++: 3D Lane Detection via Sample-Adaptive Sparse 3D Anchor RegressionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

401

22 Dec 2024

Object Detection Approaches to Identifying Hand Images with High Forensic ValuesIEEE International Conference on Systems, Man and Cybernetics (SMC), 2024

291

21 Dec 2024

LLaVA-UHD v2: an MLLM Integrating High-Resolution Semantic Pyramid via Hierarchical Window Transformer

...

366

18 Dec 2024

Differential Alignment for Domain Adaptive Object Detection

Xinyu He

Xinhui Li

Xiaojie Guo

348

17 Dec 2024

GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial UnderstandingComputer Vision and Pattern Recognition (CVPR), 2024

503

17 Dec 2024

SAMIC: Segment Anything with In-Context Spatial Prompt Engineering

331

16 Dec 2024

CLDA-YOLO: Visual Contrastive Learning Based Domain Adaptive YOLO Detector

269

16 Dec 2024

Oriented Tiny Object Detection: A Dataset, Benchmark, and Dynamic Unbiased Learning

343

16 Dec 2024

V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D AnnotationsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

303

16 Dec 2024

Heterogeneous Graph Transformer for Multiple Tiny Object Tracking in RGB-T VideosIEEE transactions on multimedia (IEEE TMM), 2024

341

14 Dec 2024

Just a Few Glances: Open-Set Visual Perception with Image Prompt ParadigmAAAI Conference on Artificial Intelligence (AAAI), 2024

246

14 Dec 2024

SoftVQ-VAE: Efficient 1-Dimensional Continuous TokenizerComputer Vision and Pattern Recognition (CVPR), 2024

723

14 Dec 2024

PanSR: An Object-Centric Mask Transformer for Panoptic Segmentation

Lojze Žust

Matej Kristan

ViT

316

13 Dec 2024

UADet: A Remarkably Simple Yet Effective Uncertainty-Aware Open-Set Object Detection Framework

374

12 Dec 2024

TimeRefine: Temporal Grounding with Time Refining Video LLM

492

12 Dec 2024

Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark DatasetComputer Vision and Pattern Recognition (CVPR), 2024

277

09 Dec 2024

GAQAT: gradient-adaptive quantization-aware training for domain generalization

294

07 Dec 2024

Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object DetectionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

K. Hashmi

Talha Uddin Sheikh

Didier Stricker

Muhammad Zeshan Afzal

289

06 Dec 2024

Cubify Anything: Scaling Indoor 3D Object DetectionComputer Vision and Pattern Recognition (CVPR), 2024

254

05 Dec 2024

Towards Real-Time Open-Vocabulary Video Instance SegmentationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

294

05 Dec 2024

Reflective Teacher: Semi-Supervised Multimodal 3D Object Detection in Bird's-Eye-View via Uncertainty MeasureIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

303

05 Dec 2024

Redundant Queries in DETR-Based 3D Detection Methods: Unnecessary and Prunable

491

03 Dec 2024

XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive Generation

421

02 Dec 2024

HandOS: 3D Hand Reconstruction in One StageComputer Vision and Pattern Recognition (CVPR), 2024

504

02 Dec 2024

SyncVIS: Synchronized Video Instance SegmentationNeural Information Processing Systems (NeurIPS), 2024

317

01 Dec 2024

Explaining Object Detectors via Collective Contribution of Pixels

575

01 Dec 2024

LQ-Adapter: ViT-Adapter with Learnable Queries for Gallbladder Cancer Detection from Ultrasound ImageIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

385

30 Nov 2024

Does Self-Attention Need Separate Weights in Transformers?North American Chapter of the Association for Computational Linguistics (NAACL), 2024

Md. Kowsher

Nusrat Jahan Prottasha

Chun-Nam Yu

O. Garibay

Niloofar Yousefi

1.1K

30 Nov 2024