v1v2v3v4 (latest)

Deformable DETR: Deformable Transformers for End-to-End Object Detection

International Conference on Learning Representations (ICLR), 2020

8 October 2020

Weijie Su

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)Github (3553★)

Papers citing "Deformable DETR: Deformable Transformers for End-to-End Object Detection"

50 / 2,782 papers shown

Background Fades, Foreground Leads: Curriculum-Guided Background Pruning for Efficient Foreground-Centric Collaborative Perception

105

22 Oct 2025

A Unified Detection Pipeline for Robust Object Detection in Fisheye-Based Traffic Surveillance

Neema Jakisa Owor

Joshua Kofi Asamoah

Tanner Muturi

Anneliese Jakisa Owor

149

22 Oct 2025

Comparative Analysis of Object Detection Algorithms for Surface Defect Detection

Arpan Maity

Tamal Ghosh

ObjD

128

21 Oct 2025

SOLE: Hardware-Software Co-design of Softmax and LayerNorm for Efficient Transformer Inference

138

20 Oct 2025

ArmFormer: Lightweight Transformer Architecture for Real-Time Multi-Class Weapon Segmentation and Classification

149

19 Oct 2025

Proto-Former: Unified Facial Landmark Detection by Prototype Transformer

158

17 Oct 2025

ReCon: Region-Controllable Data Augmentation with Rectification and Alignment for Object Detection

168

17 Oct 2025

MatchAttention: Matching the Relative Positions for High-Resolution Cross-View Matching

220

16 Oct 2025

MOBIUS: Big-to-Mobile Universal Instance Segmentation via Multi-modal Bottleneck Fusion and Calibrated Decoder Pruning

Mattia Segu

Marta Tintore Gazulla

Yongqin Xian

Luc Van Gool

Federico Tombari

16 Oct 2025

Complementary Information Guided Occupancy Prediction via Multi-Level Representation FusionIEEE International Conference on Robotics and Automation (ICRA), 2025

178

15 Oct 2025

UniVector: Unified Vector Extraction via Instance-Geometry Interaction

113

15 Oct 2025

What "Not" to Detect: Negation-Aware VLMs via Structured Reasoning and Token Merging

15 Oct 2025

CrossRay3D: Geometry and Distribution Guidance for Efficient Multimodal 3D Detection

...

216

14 Oct 2025

MMOT: The First Challenging Benchmark for Drone-based Multispectral Multi-Object Tracking

119

14 Oct 2025

Detect Anything via Next Point Prediction

211

14 Oct 2025

Source-Free Object Detection with Detection TransformerIEEE Transactions on Image Processing (IEEE TIP), 2025

13 Oct 2025

Unified Open-World Segmentation with Multi-Modal Prompts

106

12 Oct 2025

Complementary and Contrastive Learning for Audio-Visual SegmentationIEEE transactions on multimedia (TMM), 2025

240

11 Oct 2025

KORMo: Korean Open Reasoning Model for Everyone

...

152

10 Oct 2025

Synthetic Object Compositions for Scalable and Accurate Learning in Detection, Segmentation, and Grounding

226

10 Oct 2025

Utilizing dynamic sparsity on pretrained DETR

148

10 Oct 2025

Learning Global Representation from Queries for Vectorized HD Map Construction

111

08 Oct 2025

Deforming Videos to Masks: Flow Matching for Referring Video Segmentation

225

07 Oct 2025

Shaken or Stirred? An Analysis of MetaFormer's Token Mixing for Medical Imaging

185

07 Oct 2025

Flexible and Efficient Spatio-Temporal Transformer for Sequential Visual Place Recognition

120

05 Oct 2025

Referring Expression Comprehension for Small Objects

146

04 Oct 2025

Cross-View Open-Vocabulary Object Detection in Aerial Imagery

197

04 Oct 2025

Align Your Query: Representation Alignment for Multimodality Medical Object Detection

136

03 Oct 2025

PyramidStyler: Transformer-Based Neural Style Transfer with Pyramidal Positional Encoding and Reinforcement Learning

Raahul Krishna Durairaju

K. Saruladha

165

02 Oct 2025

Holistic Order Prediction in Natural Scenes

259

02 Oct 2025

A Comprehensive Review on Artificial Intelligence Empowered Solutions for Enhancing Pedestrian and Cyclist Safety

Muhammad Monjurul Karim

158

30 Sep 2025

Looking Beyond the Known: Towards a Data Discovery Guided Open-World Object Detection

Anay Majee

Amitesh Gangrade

Rishabh K. Iyer

117

30 Sep 2025

A Multi-Camera Vision-Based Approach for Fine-Grained Assembly Quality Control

118

28 Sep 2025

Sim-DETR: Unlock DETR for Temporal Sentence Grounding

301

28 Sep 2025

INSTINCT: Instance-Level Interaction Architecture for Query-Based Collaborative Perception

28 Sep 2025

C3-OWD: A Curriculum Cross-modal Contrastive Learning Framework for Open-World Detection

...

257

27 Sep 2025

UniPose: Unified Cross-modality Pose Prior Propagation towards RGB-D data for Weakly Supervised 3D Human Pose Estimation

112

27 Sep 2025

FMC-DETR: Frequency-Decoupled Multi-Domain Coordination for Aerial-View Object Detection

170

27 Sep 2025

Motion-Aware Transformer for Multi-Object Tracking

Xu Yang

Gady Agam

VOT

391

26 Sep 2025

FSMODNet: A Closer Look at Few-Shot Detection in Multispectral Data

147

25 Sep 2025

Real-Time Object Detection Meets DINOv3

375

25 Sep 2025

Hyperspectral Adapter for Semantic Segmentation with Vision Foundation Models

Juana Valeria Hurtado

Rohit Mohan

Abhinav Valada

177

24 Sep 2025

Knowledge Transfer from Interaction Learning

125

23 Sep 2025

Frequency-Domain Decomposition and Recomposition for Robust Audio-Visual Segmentation

209

23 Sep 2025

SynapFlow: A Modular Framework Towards Large-Scale Analysis of Dendritic Spines

110

23 Sep 2025

MLF-4DRCNet: Multi-Level Fusion with 4D Radar and Camera for 3D Object Detection in Autonomous Driving

121

23 Sep 2025

Track-On2: Enhancing Online Point Tracking with Memory

238

23 Sep 2025

NaviSense: A Multimodal Assistive Mobile application for Object Retrieval by Persons with Visual Impairment

Ajay Narayanan Sridhar

Fuli Qiao

Nelson Daniel Troncoso Aldas

105

23 Sep 2025

Visual Instruction Pretraining for Domain-Specific Foundation Models

289

22 Sep 2025

DepTR-MOT: Unveiling the Potential of Depth-Informed Trajectory Refinement for Multi-Object Tracking

264

22 Sep 2025