v1v2v3v4 (latest)

Deformable DETR: Deformable Transformers for End-to-End Object Detection

International Conference on Learning Representations (ICLR), 2020

8 October 2020

Weijie Su

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)Github (3553★)

Papers citing "Deformable DETR: Deformable Transformers for End-to-End Object Detection"

50 / 2,782 papers shown

DepTR-MOT: Unveiling the Potential of Depth-Informed Trajectory Refinement for Multi-Object Tracking

265

22 Sep 2025

Lattice Boltzmann Model for Learning Real-World Pixel Dynamicity

193

20 Sep 2025

Deep Learning Empowered Super-Resolution: A Comprehensive Survey and Future ProspectsProceedings of the IEEE (Proc. IEEE), 2025

287

19 Sep 2025

The Missing Piece: A Case for Pre-Training in 3D Medical Object DetectionInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025

144

19 Sep 2025

Language-Instructed Reasoning for Group Activity Detection via Multimodal Large Language Model

108

19 Sep 2025

[Re] Improving Interpretation Faithfulness for Vision Transformers

128

18 Sep 2025

Region-Aware Deformable Convolutions

Abolfazl Saheban Maleki

Maryam Imani

152

18 Sep 2025

RaFD: Flow-Guided Radar Detection for Robust Autonomous Driving

106

18 Sep 2025

CAGE: Continuity-Aware edGE Network Unlocks Robust Floorplan Reconstruction

193

18 Sep 2025

FishBEV: Distortion-Resilient Bird's Eye View Segmentation with Surround-View Fisheye Cameras

134

17 Sep 2025

CETUS: Causal Event-Driven Temporal Modeling With Unified Variable-Rate Scheduling

176

17 Sep 2025

EZREAL: Enhancing Zero-Shot Outdoor Robot Navigation toward Distant Targets under Varying Visibility

113

17 Sep 2025

Improving Generalized Visual Grounding with Instance-aware Joint LearningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025

256

17 Sep 2025

VSE-MOT: Multi-Object Tracking in Low-Quality Video Scenes Guided by Visual Semantic Enhancement

144

17 Sep 2025

TexTAR : Textual Attribute Recognition in Multi-domain and Multi-lingual Document ImagesIEEE International Conference on Document Analysis and Recognition (ICDAR), 2025

Rohan Kumar

Jyothi Swaroopa Jinka

Ravi Kiran Sarvadevabhatla

118

16 Sep 2025

Road Obstacle Video Segmentation

Shyam Nandan Rai

Shyamgopal Karthik

Mariana-Iuliana Georgescu

222

16 Sep 2025

Image Tokenizer Needs Post-Training

197

15 Sep 2025

CaR1: A Multi-Modal Baseline for BEV Vehicle Segmentation via Camera-Radar Fusion

Santiago Montiel-Marín

Ángel Llamazares

Miguel Antunes-García

Fabio Sánchez-García

L. Bergasa

147

12 Sep 2025

BEVTraj: Map-Free End-to-End Trajectory Prediction in Bird's-Eye View with Deformable Attention and Sparse Goal Proposals

161

12 Sep 2025

Multimodal SAM-adapter for Semantic SegmentationIEEE Access (IEEE Access), 2025

Iacopo Curti

Pierluigi Zama Ramirez

Alioscia Petrelli

Luigi Di Stefano

137

12 Sep 2025

Online 3D Multi-Camera Perception through Robust 2D Tracking and Depth-based Late Aggregation

135

12 Sep 2025

Dark-ISP: Enhancing RAW Image Processing for Low-Light Object Detection

133

11 Sep 2025

FPI-Det: a face--phone Interaction Dataset for phone-use detection and understanding

141

11 Sep 2025

WAVE-DETR Multi-Modal Visible and Acoustic Real-Life Drone Detector

223

11 Sep 2025

InsFusion: Rethink Instance-level LiDAR-Camera Fusion for 3D Object Detection

182

10 Sep 2025

Dual-Thresholding Heatmaps to Cluster Proposals for Weakly Supervised Object Detection

179

10 Sep 2025

CrowdQuery: Density-Guided Query Module for Enhanced 2D and 3D Detection in Crowded Scenes

Marius Dähling

Sebastian Krebs

J. Marius Zöllner

132

10 Sep 2025

DVLO4D: Deep Visual-Lidar Odometry with Sparse Spatial-temporal FusionIEEE International Conference on Robotics and Automation (ICRA), 2025

149

07 Sep 2025

CRAB: Camera-Radar Fusion for Reducing Depth Ambiguity in Backward Projection based View TransformationIEEE International Conference on Robotics and Automation (ICRA), 2025

136

06 Sep 2025

PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination

229

05 Sep 2025

Heatmap Guided Query Transformers for Robust Astrocyte Detection across Immunostains and Resolutions

Xizhe Zhang

Jiayang Zhu

MedIm

03 Sep 2025

Enabling Federated Object Detection for Connected Autonomous Vehicles: A Deployment-Oriented Evaluation

Komala Subramanyam Cherukuri

Kewei Sha

Zhenhua Huang

150

02 Sep 2025

FICGen: Frequency-Inspired Contextual Disentanglement for Layout-driven Degraded Image Generation

144

01 Sep 2025

SAR-NAS: Lightweight SAR Object Detection with Neural Architecture Search

Xinyi Yu

Zhiwei Lin

Yongtao Wang

100

01 Sep 2025

An End-to-End Framework for Video Multi-Person Pose Estimation

Zhihong Wei

116

01 Sep 2025

MSMVD: Exploiting Multi-scale Image Features via Multi-scale BEV Features for Multi-view Pedestrian Detection

28 Aug 2025

To New Beginnings: A Survey of Unified Perception in Autonomous Vehicle Software

185

28 Aug 2025

HiddenObject: Modality-Agnostic Fusion for Multimodal Hidden Object Detection

205

28 Aug 2025

FlowDet: Overcoming Perspective and Scale Challenges in Real-Time End-to-End Traffic Detection

Yuhang Zhao

Zixing Wang

27 Aug 2025

Image Quality Assessment for Machines: Paradigm, Large-scale Database, and Models

Xiaoqi Wang

Yun Zhang

Weisi Lin

148

27 Aug 2025

WaveHiT-SR: Hierarchical Wavelet Network for Efficient Image Super-Resolution

27 Aug 2025

DQEN: Dual Query Enhancement Network for DETR-based HOI Detection

112

26 Aug 2025

Neural Proteomics Fields for Super-resolved Spatial Proteomics PredictionInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025

24 Aug 2025

SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse WeatherEuropean Conference on Computer Vision (ECCV), 2025

192

22 Aug 2025

Representation Learning with Adaptive Superpixel Coding

128

21 Aug 2025

RATopo: Improving Lane Topology Reasoning via Redundancy Assignment

21 Aug 2025

Reconstruction Using the Invisible: Intuition from NIR and Metadata for Enhanced 3D Gaussian Splatting

125

20 Aug 2025

Fusing Monocular RGB Images with AIS Data to Create a 6D Pose Estimation Dataset for Marine Vessels

Fabian Holst

Emre Gülsoylu

Simone Frintrop

137

20 Aug 2025

Self-Supervised Sparse Sensor Fusion for Long Range Perception

145

19 Aug 2025

SlimComm: Doppler-Guided Sparse Queries for Bandwidth-Efficient Cooperative 3-D Perception

169

18 Aug 2025