v1v2v3v4 (latest)

Deformable DETR: Deformable Transformers for End-to-End Object Detection

International Conference on Learning Representations (ICLR), 2020

8 October 2020

Weijie Su

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)Github (3553★)

Papers citing "Deformable DETR: Deformable Transformers for End-to-End Object Detection"

50 / 2,782 papers shown

A lightweight model FDM-YOLO for small target improvement based on YOLOv8

Xuerui Zhang

ObjD

258

06 Mar 2025

Manboformer: Learning Gaussian Representations via Spatial-temporal Attention Mechanism

Ziyue Zhao

Qining Qi

Jianfa Ma

217

06 Mar 2025

DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance

368

05 Mar 2025

Boltzmann Attention Sampling for Image Analysis with Small ObjectsComputer Vision and Pattern Recognition (CVPR), 2025

447

04 Mar 2025

UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface

455

03 Mar 2025

MI-DETR: An Object Detection Model with Multi-time Inquiries MechanismComputer Vision and Pattern Recognition (CVPR), 2025

361

03 Mar 2025

Generalized Diffusion Detector: Mining Robust Features from Diffusion Models for Domain-Generalized DetectionComputer Vision and Pattern Recognition (CVPR), 2025

541

03 Mar 2025

Evaluating Stenosis Detection with Grounding DINO, YOLO, and DINO-DETR

Muhammad Musab Ansari

258

03 Mar 2025

Composed Multi-modal Retrieval: A Survey of Approaches and Applications

...

427

03 Mar 2025

RTGen: Real-Time Generative Detection Transformer

420

28 Feb 2025

New Dataset and Methods for Fine-Grained Compositional Referring Expression Comprehension via Specialist-MLLM CollaborationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025

492

27 Feb 2025

BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth GuidanceComputer Vision and Pattern Recognition (CVPR), 2025

438

27 Feb 2025

WalnutData: A UAV Remote Sensing Dataset of Green Walnuts and Model Evaluation

Yibo Wang

...

559

27 Feb 2025

SAC-ViT: Semantic-Aware Clustering Vision Transformer with Early Exit

323

27 Feb 2025

QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating ObjectsAAAI Conference on Artificial Intelligence (AAAI), 2025

Elkhan Ismayilzada

MD Khalequzzaman Chowdhury Sayem

Yihalem Yimolal Tiruneh

342

27 Feb 2025

ProxyTransformation: Preshaping Point Cloud Manifold With Proxy Attention For 3D Visual GroundingComputer Vision and Pattern Recognition (CVPR), 2025

388

26 Feb 2025

CoopDETR: A Unified Cooperative Perception Framework for 3D Detection via Object QueryIEEE International Conference on Robotics and Automation (ICRA), 2025

387

26 Feb 2025

Automatic Vehicle Detection using DETR: A Transformer-Based Approach for Navigating Treacherous Roads

Istiaq Ahmed Fahad

Abdullah Ibne Hanif Arean

Nazmus Sakib Ahmed

Mahmudul Hasan

ViT

159

25 Feb 2025

Improving Transformer Based Line Segment Detection with Matched Predicting and Re-rankingAAAI Conference on Artificial Intelligence (AAAI), 2025

250

25 Feb 2025

Soybean pod and seed counting in both outdoor fields and indoor laboratories using unions of deep neural networks

254

24 Feb 2025

DeepInteraction++: Multi-Modality Interaction for Autonomous DrivingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

483

24 Feb 2025

MVIP -- A Dataset and Methods for Application Oriented Multi-View and Multi-Modal Industrial Part Recognition

Paul Koch

Marian Schluter

Jörg Krüger

306

24 Feb 2025

A Transformer-in-Transformer Network Utilizing Knowledge Distillation for Image Recognition

197

24 Feb 2025

Cross-domain Few-shot Object Detection with Multi-modal Textual Enrichment

336

23 Feb 2025

OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models

344

22 Feb 2025

Visible-Thermal Tiny Object Detection: A Benchmark Dataset and Baselines

...

405

21 Feb 2025

YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object DetectionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

470

145

21 Feb 2025

MindLLM: A Subject-Agnostic and Versatile Model for fMRI-to-Text Decoding

406

18 Feb 2025

GraphMorph: Tubular Structure Extraction by Morphing Predicted GraphsNeural Information Processing Systems (NeurIPS), 2025

290

17 Feb 2025

RT-DEMT: A hybrid real-time acupoint detection model combining mamba and transformer

513

16 Feb 2025

CLoCKDistill: Consistent Location-and-Context-aware Knowledge Distillation for DETRs

Qizhen Lan

Qing Tian

280

15 Feb 2025

Improving action segmentation via explicit similarity measurement

268

15 Feb 2025

Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis

Amir Hosein Fadaei

M. Dehaqani

343

11 Feb 2025

Dense Object Detection Based on De-homogenized Queries

419

11 Feb 2025

Cell Nuclei Detection and Classification in Whole Slide Images with Transformers

Oscar Pina

Eduard Dorca

Verónica Vilaplana

158

10 Feb 2025

Amnesia as a Catalyst for Enhancing Black Box Pixel Attacks in Image Classification and Object DetectionNeural Information Processing Systems (NeurIPS), 2025

491

10 Feb 2025

SMART: Advancing Scalable Map Priors for Driving Topology ReasoningIEEE International Conference on Robotics and Automation (ICRA), 2025

Henrik I. Christensen

Yue Wang

Liu Ren

LRM

363

06 Feb 2025

Foundation Model-Based Apple Ripeness and Size Estimation for Selective HarvestingComputers and Electronics in Agriculture (CEA), 2025

Siddhartha Bhattacharya

R. Lu

Zhaojian Li

355

03 Feb 2025

IROAM: Improving Roadside Monocular 3D Object Detection Learning from Autonomous Vehicle Data DomainIEEE International Conference on Robotics and Automation (ICRA), 2025

183

30 Jan 2025

Not Every Patch is Needed: Towards a More Efficient and Effective Backbone for Video-based Person Re-identificationIEEE Transactions on Image Processing (IEEE TIP), 2025

423

28 Jan 2025

B-RIGHT: Benchmark Re-evaluation for Integrity in Generalized Human-Object Interaction Testing

186

28 Jan 2025

SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice RepresentationIEEE International Conference on Robotics and Automation (ICRA), 2025

389

28 Jan 2025

V2X-DGPE: Addressing Domain Gaps and Pose Errors for Robust Collaborative 3D Object Detection

357

28 Jan 2025

CSPCL: Category Semantic Prior Contrastive Learning for Deformable DETR-Based Prohibited Item Detectors

310

28 Jan 2025

Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection

Heqian Qiu

Hongliang Li

ObjD VLM

1.0K

28 Jan 2025

Object Detection for Medical Image Analysis: Insights from the RT-DETR Model

229

27 Jan 2025

MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation

616

23 Jan 2025

A generalizable 3D framework and model for self-supervised learning in medical imaging

343

20 Jan 2025

LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection

378

18 Jan 2025

Geometric Distortion Guided Transformer for Omnidirectional Image Super-Resolution

457

17 Jan 2025