ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.04159
  4. Cited By
Deformable DETR: Deformable Transformers for End-to-End Object Detection
v1v2v3v4 (latest)

Deformable DETR: Deformable Transformers for End-to-End Object Detection

International Conference on Learning Representations (ICLR), 2020
8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
    ViT
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)Github (3553★)

Papers citing "Deformable DETR: Deformable Transformers for End-to-End Object Detection"

50 / 2,782 papers shown
DepTR-MOT: Unveiling the Potential of Depth-Informed Trajectory Refinement for Multi-Object Tracking
DepTR-MOT: Unveiling the Potential of Depth-Informed Trajectory Refinement for Multi-Object Tracking
Buyin Deng
Lingxin Huang
Kai Luo
Fei Teng
Kailun Yang
VOT
264
1
0
22 Sep 2025
Lattice Boltzmann Model for Learning Real-World Pixel Dynamicity
Lattice Boltzmann Model for Learning Real-World Pixel Dynamicity
Guangze Zheng
Shijie Lin
Haobo Zuo
Si Si
Ming-Shan Wang
Changhong Fu
Jia Pan
193
0
0
20 Sep 2025
Deep Learning Empowered Super-Resolution: A Comprehensive Survey and Future Prospects
Deep Learning Empowered Super-Resolution: A Comprehensive Survey and Future ProspectsProceedings of the IEEE (Proc. IEEE), 2025
Le Zhang
Ao Li
Qibin Hou
Ce Zhu
Yonina C. Eldar
SupR
286
1
0
19 Sep 2025
The Missing Piece: A Case for Pre-Training in 3D Medical Object Detection
The Missing Piece: A Case for Pre-Training in 3D Medical Object DetectionInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Katharina Eckstein
Constantin Ulrich
Michael Baumgartner
Jessica Kächele
Dimitrios Bounias
Tassilo Wald
R. Floca
Klaus H. Maier-Hein
ViTMedIm
144
1
0
19 Sep 2025
Language-Instructed Reasoning for Group Activity Detection via Multimodal Large Language Model
Language-Instructed Reasoning for Group Activity Detection via Multimodal Large Language Model
Jihua Peng
Qianxiong Xu
Yichen Liu
Chenxi Liu
Cheng Long
Rui Zhao
Ziyue Li
LRM
105
0
0
19 Sep 2025
[Re] Improving Interpretation Faithfulness for Vision Transformers
[Re] Improving Interpretation Faithfulness for Vision Transformers
Izabela Kurek
Wojciech Trejter
Stipe Frkovic
Andro Erdelez
128
0
0
18 Sep 2025
Region-Aware Deformable Convolutions
Region-Aware Deformable Convolutions
Abolfazl Saheban Maleki
Maryam Imani
152
0
0
18 Sep 2025
RaFD: Flow-Guided Radar Detection for Robust Autonomous Driving
RaFD: Flow-Guided Radar Detection for Robust Autonomous Driving
Shuocheng Yang
Zikun Xu
Jiahao Wang
Shahid Nawaz
Jianqiang Wang
Shaobing Xu
103
0
0
18 Sep 2025
CAGE: Continuity-Aware edGE Network Unlocks Robust Floorplan Reconstruction
CAGE: Continuity-Aware edGE Network Unlocks Robust Floorplan Reconstruction
Yiyi Liu
Chunyang Liu
Bohan Wang
Weiqin Jiao
Bojian Wu
Lubin Fan
Yuwei Chen
Fashuai Li
Biao Xiong
3DV
187
0
0
18 Sep 2025
FishBEV: Distortion-Resilient Bird's Eye View Segmentation with Surround-View Fisheye Cameras
FishBEV: Distortion-Resilient Bird's Eye View Segmentation with Surround-View Fisheye Cameras
Hang Li
Dianmo Sheng
Qiankun Dong
Z. Wang
Zhiwei Xu
Tao Li
131
0
0
17 Sep 2025
CETUS: Causal Event-Driven Temporal Modeling With Unified Variable-Rate Scheduling
CETUS: Causal Event-Driven Temporal Modeling With Unified Variable-Rate Scheduling
Hanfang Liang
Bing Wang
Shizhen Zhang
Wen Jiang
Yizhuo Yang
Weixiang Guo
Shenghai Yuan
Mamba
175
0
0
17 Sep 2025
EZREAL: Enhancing Zero-Shot Outdoor Robot Navigation toward Distant Targets under Varying Visibility
EZREAL: Enhancing Zero-Shot Outdoor Robot Navigation toward Distant Targets under Varying Visibility
Tianle Zeng
Jianwei Peng
Hanjing Ye
Guangcheng Chen
Senzi Luo
H. Zhang
113
0
0
17 Sep 2025
Improving Generalized Visual Grounding with Instance-aware Joint Learning
Improving Generalized Visual Grounding with Instance-aware Joint LearningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Ming Dai
Wenxuan Cheng
Jiang-Jiang Liu
Lingfeng Yang
Zhenhua Feng
Wankou Yang
Jingdong Wang
ObjDISeg
256
4
0
17 Sep 2025
VSE-MOT: Multi-Object Tracking in Low-Quality Video Scenes Guided by Visual Semantic Enhancement
VSE-MOT: Multi-Object Tracking in Low-Quality Video Scenes Guided by Visual Semantic Enhancement
Jun Du
Weiwei Xing
Ming Li
Fei Richard Yu
DiffM
143
0
0
17 Sep 2025
TexTAR : Textual Attribute Recognition in Multi-domain and Multi-lingual Document Images
TexTAR : Textual Attribute Recognition in Multi-domain and Multi-lingual Document ImagesIEEE International Conference on Document Analysis and Recognition (ICDAR), 2025
Rohan Kumar
Jyothi Swaroopa Jinka
Ravi Kiran Sarvadevabhatla
118
0
0
16 Sep 2025
Road Obstacle Video Segmentation
Road Obstacle Video Segmentation
Shyam Nandan Rai
Shyamgopal Karthik
Mariana-Iuliana Georgescu
Barbara Caputo
Carlo Masone
Zeynep Akata
VOS
218
0
0
16 Sep 2025
Image Tokenizer Needs Post-Training
Image Tokenizer Needs Post-Training
Kai Qiu
Xiang Li
Hao Chen
Jason Kuen
Xiaohao Xu
Jiuxiang Gu
Yinyi Luo
Bhiksha Raj
Zhe Lin
Marios Savvides
VLM
197
4
0
15 Sep 2025
CaR1: A Multi-Modal Baseline for BEV Vehicle Segmentation via Camera-Radar Fusion
CaR1: A Multi-Modal Baseline for BEV Vehicle Segmentation via Camera-Radar Fusion
Santiago Montiel-Marín
Ángel Llamazares
Miguel Antunes-García
Fabio Sánchez-García
L. Bergasa
147
0
0
12 Sep 2025
BEVTraj: Map-Free End-to-End Trajectory Prediction in Bird's-Eye View with Deformable Attention and Sparse Goal Proposals
BEVTraj: Map-Free End-to-End Trajectory Prediction in Bird's-Eye View with Deformable Attention and Sparse Goal Proposals
Minsang Kong
Myeongjun Kim
Sang Gu Kang
Sang Hun Lee
161
0
0
12 Sep 2025
Multimodal SAM-adapter for Semantic Segmentation
Multimodal SAM-adapter for Semantic SegmentationIEEE Access (IEEE Access), 2025
Iacopo Curti
Pierluigi Zama Ramirez
Alioscia Petrelli
Luigi Di Stefano
137
1
0
12 Sep 2025
Online 3D Multi-Camera Perception through Robust 2D Tracking and Depth-based Late Aggregation
Online 3D Multi-Camera Perception through Robust 2D Tracking and Depth-based Late Aggregation
Vu-Minh Le
Thao-Anh Tran
Duc Huy Do
Xuan Canh Do
Huong Ninh
Hai Yen Tran
135
2
0
12 Sep 2025
Dark-ISP: Enhancing RAW Image Processing for Low-Light Object Detection
Dark-ISP: Enhancing RAW Image Processing for Low-Light Object Detection
Jiasheng Guo
Xin Gao
Yuxiang Yan
Guanghao Li
Jian Pu
131
2
0
11 Sep 2025
FPI-Det: a face--phone Interaction Dataset for phone-use detection and understanding
FPI-Det: a face--phone Interaction Dataset for phone-use detection and understanding
J. Gao
Tianqi Wang
Yu Zhang
Y. Zhang
Chenyuan Wang
Allan Dong
Zihao Wang
CVBM
141
0
0
11 Sep 2025
WAVE-DETR Multi-Modal Visible and Acoustic Real-Life Drone Detector
WAVE-DETR Multi-Modal Visible and Acoustic Real-Life Drone Detector
Razvan Stefanescu
Ethan Oh
Ruben Vazquez
Chris Mesterharm
Constantin Serban
R. Chadha
223
0
0
11 Sep 2025
InsFusion: Rethink Instance-level LiDAR-Camera Fusion for 3D Object Detection
InsFusion: Rethink Instance-level LiDAR-Camera Fusion for 3D Object Detection
Zhongyu Xia
Hansong Yang
Yongtao Wang
3DPC
181
0
0
10 Sep 2025
Dual-Thresholding Heatmaps to Cluster Proposals for Weakly Supervised Object Detection
Dual-Thresholding Heatmaps to Cluster Proposals for Weakly Supervised Object Detection
Yuelin Guo
Haoyu He
Zhiyuan Chen
Zitong Huang
Renhao Lu
Lu Shi
Zejun Wang
Weizhe Zhang
179
0
0
10 Sep 2025
CrowdQuery: Density-Guided Query Module for Enhanced 2D and 3D Detection in Crowded Scenes
CrowdQuery: Density-Guided Query Module for Enhanced 2D and 3D Detection in Crowded Scenes
Marius Dähling
Sebastian Krebs
J. Marius Zöllner
132
1
0
10 Sep 2025
DVLO4D: Deep Visual-Lidar Odometry with Sparse Spatial-temporal Fusion
DVLO4D: Deep Visual-Lidar Odometry with Sparse Spatial-temporal FusionIEEE International Conference on Robotics and Automation (ICRA), 2025
Mengmeng Liu
M. Yang
Jiuming Liu
Yunpeng Zhang
Jiangtao Li
Sander Oude Elberink
G. Vosselman
Hao Cheng
146
1
0
07 Sep 2025
CRAB: Camera-Radar Fusion for Reducing Depth Ambiguity in Backward Projection based View Transformation
CRAB: Camera-Radar Fusion for Reducing Depth Ambiguity in Backward Projection based View TransformationIEEE International Conference on Robotics and Automation (ICRA), 2025
In-Jae Lee
Sihwan Hwang
Youngseok Kim
Wonjune Kim
Sanmin Kim
Dongsuk Kum
136
1
0
06 Sep 2025
PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination
PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination
Ming Dai
Wenxuan Cheng
Jiedong Zhuang
Jiang-Jiang Liu
Hongshen Zhao
Zhenhua Feng
Wankou Yang
ObjD
229
3
0
05 Sep 2025
Heatmap Guided Query Transformers for Robust Astrocyte Detection across Immunostains and Resolutions
Heatmap Guided Query Transformers for Robust Astrocyte Detection across Immunostains and Resolutions
Xizhe Zhang
Jiayang Zhu
MedIm
86
0
0
03 Sep 2025
Enabling Federated Object Detection for Connected Autonomous Vehicles: A Deployment-Oriented Evaluation
Enabling Federated Object Detection for Connected Autonomous Vehicles: A Deployment-Oriented Evaluation
Komala Subramanyam Cherukuri
Kewei Sha
Zhenhua Huang
147
0
0
02 Sep 2025
FICGen: Frequency-Inspired Contextual Disentanglement for Layout-driven Degraded Image Generation
FICGen: Frequency-Inspired Contextual Disentanglement for Layout-driven Degraded Image Generation
Wenzhuang Wang
Yifan Zhao
Mingcan Ma
Ming-Yuan Liu
Zhonglin Jiang
Yong Chen
Jia Li
DiffM
142
1
0
01 Sep 2025
SAR-NAS: Lightweight SAR Object Detection with Neural Architecture Search
SAR-NAS: Lightweight SAR Object Detection with Neural Architecture Search
Xinyi Yu
Zhiwei Lin
Yongtao Wang
97
0
0
01 Sep 2025
An End-to-End Framework for Video Multi-Person Pose Estimation
An End-to-End Framework for Video Multi-Person Pose Estimation
Zhihong Wei
116
0
0
01 Sep 2025
MSMVD: Exploiting Multi-scale Image Features via Multi-scale BEV Features for Multi-view Pedestrian Detection
MSMVD: Exploiting Multi-scale Image Features via Multi-scale BEV Features for Multi-view Pedestrian Detection
Taiga Yamane
Satoshi Suzuki
Ryo Masumura
Shota Orihashi
Tomohiro Tanaka
Mana Ihori
Naoki Makishima
Naotaka Kawata
85
0
0
28 Aug 2025
To New Beginnings: A Survey of Unified Perception in Autonomous Vehicle Software
To New Beginnings: A Survey of Unified Perception in Autonomous Vehicle Software
Loic Stratil
F. Fent
Esteban Rivera
Markus Lienkamp
185
1
0
28 Aug 2025
HiddenObject: Modality-Agnostic Fusion for Multimodal Hidden Object Detection
HiddenObject: Modality-Agnostic Fusion for Multimodal Hidden Object Detection
Harris Song
Tuan-Anh Vu
Sanjith Menon
Sriram Narasimhan
M. Khalid Jawed
201
0
0
28 Aug 2025
FlowDet: Overcoming Perspective and Scale Challenges in Real-Time End-to-End Traffic Detection
FlowDet: Overcoming Perspective and Scale Challenges in Real-Time End-to-End Traffic Detection
Yuhang Zhao
Zixing Wang
93
0
0
27 Aug 2025
Image Quality Assessment for Machines: Paradigm, Large-scale Database, and Models
Image Quality Assessment for Machines: Paradigm, Large-scale Database, and Models
Xiaoqi Wang
Yun Zhang
Weisi Lin
148
0
0
27 Aug 2025
WaveHiT-SR: Hierarchical Wavelet Network for Efficient Image Super-Resolution
WaveHiT-SR: Hierarchical Wavelet Network for Efficient Image Super-Resolution
Fayaz Ali
Muhammad Zawish
Steven Davy
Radu Timofte
99
0
0
27 Aug 2025
DQEN: Dual Query Enhancement Network for DETR-based HOI Detection
DQEN: Dual Query Enhancement Network for DETR-based HOI Detection
Zhehao Li
Chong Wang
Yi Chen
Yinghao Lu
Jiangbo Qian
Jiong Wang
Jiafei Wu
112
0
0
26 Aug 2025
Neural Proteomics Fields for Super-resolved Spatial Proteomics Prediction
Neural Proteomics Fields for Super-resolved Spatial Proteomics PredictionInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Bokai Zhao
Weiyang Shi
Hanqing Chao
Zijiang Yang
Yiyang Zhang
Ming Song
Tianzi Jiang
88
0
0
24 Aug 2025
SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse Weather
SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse WeatherEuropean Conference on Computer Vision (ECCV), 2025
Edoardo Palladin
Roland Dietze
Praveen Narayanan
Mario Bijelic
Felix Heide
192
11
0
22 Aug 2025
Representation Learning with Adaptive Superpixel Coding
Representation Learning with Adaptive Superpixel Coding
Mahmoud Khalil
Ahmad Khalil
A. Ngom
ViT
128
0
0
21 Aug 2025
RATopo: Improving Lane Topology Reasoning via Redundancy Assignment
RATopo: Improving Lane Topology Reasoning via Redundancy Assignment
Han Li
Shaofei Huang
Longfei Xu
Yulu Gao
Beipeng Mu
Si Liu
88
0
0
21 Aug 2025
Reconstruction Using the Invisible: Intuition from NIR and Metadata for Enhanced 3D Gaussian Splatting
Reconstruction Using the Invisible: Intuition from NIR and Metadata for Enhanced 3D Gaussian Splatting
Gyusam Chang
Tuan-Anh Vu
Vivek Alumootil
Harris Song
Deanna Pham
Sangpil Kim
M. Khalid Jawed
3DGS
125
1
0
20 Aug 2025
Fusing Monocular RGB Images with AIS Data to Create a 6D Pose Estimation Dataset for Marine Vessels
Fusing Monocular RGB Images with AIS Data to Create a 6D Pose Estimation Dataset for Marine Vessels
Fabian Holst
Emre Gülsoylu
Simone Frintrop
137
1
0
20 Aug 2025
Self-Supervised Sparse Sensor Fusion for Long Range Perception
Self-Supervised Sparse Sensor Fusion for Long Range Perception
Edoardo Palladin
Samuel Brucker
Filippo Ghilotti
Praveen Narayanan
Mario Bijelic
Felix Heide
SSL
145
1
0
19 Aug 2025
SlimComm: Doppler-Guided Sparse Queries for Bandwidth-Efficient Cooperative 3-D Perception
SlimComm: Doppler-Guided Sparse Queries for Bandwidth-Efficient Cooperative 3-D Perception
Melih Yazgan
Qiyuan Wu
Iramm Hamdard
Shiqi Li
J. Zoellner
162
2
0
18 Aug 2025
Previous
123456...545556
Next