Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2010.04159
Cited By
v1
v2
v3
v4 (latest)
Deformable DETR: Deformable Transformers for End-to-End Object Detection
International Conference on Learning Representations (ICLR), 2020
8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Github (3553★)
Papers citing
"Deformable DETR: Deformable Transformers for End-to-End Object Detection"
50 / 2,782 papers shown
DepTR-MOT: Unveiling the Potential of Depth-Informed Trajectory Refinement for Multi-Object Tracking
Buyin Deng
Lingxin Huang
Kai Luo
Fei Teng
Kailun Yang
VOT
264
1
0
22 Sep 2025
Lattice Boltzmann Model for Learning Real-World Pixel Dynamicity
Guangze Zheng
Shijie Lin
Haobo Zuo
Si Si
Ming-Shan Wang
Changhong Fu
Jia Pan
193
0
0
20 Sep 2025
Deep Learning Empowered Super-Resolution: A Comprehensive Survey and Future Prospects
Proceedings of the IEEE (Proc. IEEE), 2025
Le Zhang
Ao Li
Qibin Hou
Ce Zhu
Yonina C. Eldar
SupR
286
1
0
19 Sep 2025
The Missing Piece: A Case for Pre-Training in 3D Medical Object Detection
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Katharina Eckstein
Constantin Ulrich
Michael Baumgartner
Jessica Kächele
Dimitrios Bounias
Tassilo Wald
R. Floca
Klaus H. Maier-Hein
ViT
MedIm
144
1
0
19 Sep 2025
Language-Instructed Reasoning for Group Activity Detection via Multimodal Large Language Model
Jihua Peng
Qianxiong Xu
Yichen Liu
Chenxi Liu
Cheng Long
Rui Zhao
Ziyue Li
LRM
105
0
0
19 Sep 2025
[Re] Improving Interpretation Faithfulness for Vision Transformers
Izabela Kurek
Wojciech Trejter
Stipe Frkovic
Andro Erdelez
128
0
0
18 Sep 2025
Region-Aware Deformable Convolutions
Abolfazl Saheban Maleki
Maryam Imani
152
0
0
18 Sep 2025
RaFD: Flow-Guided Radar Detection for Robust Autonomous Driving
Shuocheng Yang
Zikun Xu
Jiahao Wang
Shahid Nawaz
Jianqiang Wang
Shaobing Xu
103
0
0
18 Sep 2025
CAGE: Continuity-Aware edGE Network Unlocks Robust Floorplan Reconstruction
Yiyi Liu
Chunyang Liu
Bohan Wang
Weiqin Jiao
Bojian Wu
Lubin Fan
Yuwei Chen
Fashuai Li
Biao Xiong
3DV
187
0
0
18 Sep 2025
FishBEV: Distortion-Resilient Bird's Eye View Segmentation with Surround-View Fisheye Cameras
Hang Li
Dianmo Sheng
Qiankun Dong
Z. Wang
Zhiwei Xu
Tao Li
131
0
0
17 Sep 2025
CETUS: Causal Event-Driven Temporal Modeling With Unified Variable-Rate Scheduling
Hanfang Liang
Bing Wang
Shizhen Zhang
Wen Jiang
Yizhuo Yang
Weixiang Guo
Shenghai Yuan
Mamba
175
0
0
17 Sep 2025
EZREAL: Enhancing Zero-Shot Outdoor Robot Navigation toward Distant Targets under Varying Visibility
Tianle Zeng
Jianwei Peng
Hanjing Ye
Guangcheng Chen
Senzi Luo
H. Zhang
113
0
0
17 Sep 2025
Improving Generalized Visual Grounding with Instance-aware Joint Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Ming Dai
Wenxuan Cheng
Jiang-Jiang Liu
Lingfeng Yang
Zhenhua Feng
Wankou Yang
Jingdong Wang
ObjD
ISeg
256
4
0
17 Sep 2025
VSE-MOT: Multi-Object Tracking in Low-Quality Video Scenes Guided by Visual Semantic Enhancement
Jun Du
Weiwei Xing
Ming Li
Fei Richard Yu
DiffM
143
0
0
17 Sep 2025
TexTAR : Textual Attribute Recognition in Multi-domain and Multi-lingual Document Images
IEEE International Conference on Document Analysis and Recognition (ICDAR), 2025
Rohan Kumar
Jyothi Swaroopa Jinka
Ravi Kiran Sarvadevabhatla
118
0
0
16 Sep 2025
Road Obstacle Video Segmentation
Shyam Nandan Rai
Shyamgopal Karthik
Mariana-Iuliana Georgescu
Barbara Caputo
Carlo Masone
Zeynep Akata
VOS
218
0
0
16 Sep 2025
Image Tokenizer Needs Post-Training
Kai Qiu
Xiang Li
Hao Chen
Jason Kuen
Xiaohao Xu
Jiuxiang Gu
Yinyi Luo
Bhiksha Raj
Zhe Lin
Marios Savvides
VLM
197
4
0
15 Sep 2025
CaR1: A Multi-Modal Baseline for BEV Vehicle Segmentation via Camera-Radar Fusion
Santiago Montiel-Marín
Ángel Llamazares
Miguel Antunes-García
Fabio Sánchez-García
L. Bergasa
147
0
0
12 Sep 2025
BEVTraj: Map-Free End-to-End Trajectory Prediction in Bird's-Eye View with Deformable Attention and Sparse Goal Proposals
Minsang Kong
Myeongjun Kim
Sang Gu Kang
Sang Hun Lee
161
0
0
12 Sep 2025
Multimodal SAM-adapter for Semantic Segmentation
IEEE Access (IEEE Access), 2025
Iacopo Curti
Pierluigi Zama Ramirez
Alioscia Petrelli
Luigi Di Stefano
137
1
0
12 Sep 2025
Online 3D Multi-Camera Perception through Robust 2D Tracking and Depth-based Late Aggregation
Vu-Minh Le
Thao-Anh Tran
Duc Huy Do
Xuan Canh Do
Huong Ninh
Hai Yen Tran
135
2
0
12 Sep 2025
Dark-ISP: Enhancing RAW Image Processing for Low-Light Object Detection
Jiasheng Guo
Xin Gao
Yuxiang Yan
Guanghao Li
Jian Pu
131
2
0
11 Sep 2025
FPI-Det: a face--phone Interaction Dataset for phone-use detection and understanding
J. Gao
Tianqi Wang
Yu Zhang
Y. Zhang
Chenyuan Wang
Allan Dong
Zihao Wang
CVBM
141
0
0
11 Sep 2025
WAVE-DETR Multi-Modal Visible and Acoustic Real-Life Drone Detector
Razvan Stefanescu
Ethan Oh
Ruben Vazquez
Chris Mesterharm
Constantin Serban
R. Chadha
223
0
0
11 Sep 2025
InsFusion: Rethink Instance-level LiDAR-Camera Fusion for 3D Object Detection
Zhongyu Xia
Hansong Yang
Yongtao Wang
3DPC
181
0
0
10 Sep 2025
Dual-Thresholding Heatmaps to Cluster Proposals for Weakly Supervised Object Detection
Yuelin Guo
Haoyu He
Zhiyuan Chen
Zitong Huang
Renhao Lu
Lu Shi
Zejun Wang
Weizhe Zhang
179
0
0
10 Sep 2025
CrowdQuery: Density-Guided Query Module for Enhanced 2D and 3D Detection in Crowded Scenes
Marius Dähling
Sebastian Krebs
J. Marius Zöllner
132
1
0
10 Sep 2025
DVLO4D: Deep Visual-Lidar Odometry with Sparse Spatial-temporal Fusion
IEEE International Conference on Robotics and Automation (ICRA), 2025
Mengmeng Liu
M. Yang
Jiuming Liu
Yunpeng Zhang
Jiangtao Li
Sander Oude Elberink
G. Vosselman
Hao Cheng
146
1
0
07 Sep 2025
CRAB: Camera-Radar Fusion for Reducing Depth Ambiguity in Backward Projection based View Transformation
IEEE International Conference on Robotics and Automation (ICRA), 2025
In-Jae Lee
Sihwan Hwang
Youngseok Kim
Wonjune Kim
Sanmin Kim
Dongsuk Kum
136
1
0
06 Sep 2025
PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination
Ming Dai
Wenxuan Cheng
Jiedong Zhuang
Jiang-Jiang Liu
Hongshen Zhao
Zhenhua Feng
Wankou Yang
ObjD
229
3
0
05 Sep 2025
Heatmap Guided Query Transformers for Robust Astrocyte Detection across Immunostains and Resolutions
Xizhe Zhang
Jiayang Zhu
MedIm
86
0
0
03 Sep 2025
Enabling Federated Object Detection for Connected Autonomous Vehicles: A Deployment-Oriented Evaluation
Komala Subramanyam Cherukuri
Kewei Sha
Zhenhua Huang
147
0
0
02 Sep 2025
FICGen: Frequency-Inspired Contextual Disentanglement for Layout-driven Degraded Image Generation
Wenzhuang Wang
Yifan Zhao
Mingcan Ma
Ming-Yuan Liu
Zhonglin Jiang
Yong Chen
Jia Li
DiffM
142
1
0
01 Sep 2025
SAR-NAS: Lightweight SAR Object Detection with Neural Architecture Search
Xinyi Yu
Zhiwei Lin
Yongtao Wang
97
0
0
01 Sep 2025
An End-to-End Framework for Video Multi-Person Pose Estimation
Zhihong Wei
116
0
0
01 Sep 2025
MSMVD: Exploiting Multi-scale Image Features via Multi-scale BEV Features for Multi-view Pedestrian Detection
Taiga Yamane
Satoshi Suzuki
Ryo Masumura
Shota Orihashi
Tomohiro Tanaka
Mana Ihori
Naoki Makishima
Naotaka Kawata
85
0
0
28 Aug 2025
To New Beginnings: A Survey of Unified Perception in Autonomous Vehicle Software
Loic Stratil
F. Fent
Esteban Rivera
Markus Lienkamp
185
1
0
28 Aug 2025
HiddenObject: Modality-Agnostic Fusion for Multimodal Hidden Object Detection
Harris Song
Tuan-Anh Vu
Sanjith Menon
Sriram Narasimhan
M. Khalid Jawed
201
0
0
28 Aug 2025
FlowDet: Overcoming Perspective and Scale Challenges in Real-Time End-to-End Traffic Detection
Yuhang Zhao
Zixing Wang
93
0
0
27 Aug 2025
Image Quality Assessment for Machines: Paradigm, Large-scale Database, and Models
Xiaoqi Wang
Yun Zhang
Weisi Lin
148
0
0
27 Aug 2025
WaveHiT-SR: Hierarchical Wavelet Network for Efficient Image Super-Resolution
Fayaz Ali
Muhammad Zawish
Steven Davy
Radu Timofte
99
0
0
27 Aug 2025
DQEN: Dual Query Enhancement Network for DETR-based HOI Detection
Zhehao Li
Chong Wang
Yi Chen
Yinghao Lu
Jiangbo Qian
Jiong Wang
Jiafei Wu
112
0
0
26 Aug 2025
Neural Proteomics Fields for Super-resolved Spatial Proteomics Prediction
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Bokai Zhao
Weiyang Shi
Hanqing Chao
Zijiang Yang
Yiyang Zhang
Ming Song
Tianzi Jiang
88
0
0
24 Aug 2025
SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse Weather
European Conference on Computer Vision (ECCV), 2025
Edoardo Palladin
Roland Dietze
Praveen Narayanan
Mario Bijelic
Felix Heide
192
11
0
22 Aug 2025
Representation Learning with Adaptive Superpixel Coding
Mahmoud Khalil
Ahmad Khalil
A. Ngom
ViT
128
0
0
21 Aug 2025
RATopo: Improving Lane Topology Reasoning via Redundancy Assignment
Han Li
Shaofei Huang
Longfei Xu
Yulu Gao
Beipeng Mu
Si Liu
88
0
0
21 Aug 2025
Reconstruction Using the Invisible: Intuition from NIR and Metadata for Enhanced 3D Gaussian Splatting
Gyusam Chang
Tuan-Anh Vu
Vivek Alumootil
Harris Song
Deanna Pham
Sangpil Kim
M. Khalid Jawed
3DGS
125
1
0
20 Aug 2025
Fusing Monocular RGB Images with AIS Data to Create a 6D Pose Estimation Dataset for Marine Vessels
Fabian Holst
Emre Gülsoylu
Simone Frintrop
137
1
0
20 Aug 2025
Self-Supervised Sparse Sensor Fusion for Long Range Perception
Edoardo Palladin
Samuel Brucker
Filippo Ghilotti
Praveen Narayanan
Mario Bijelic
Felix Heide
SSL
145
1
0
19 Aug 2025
SlimComm: Doppler-Guided Sparse Queries for Bandwidth-Efficient Cooperative 3-D Perception
Melih Yazgan
Qiyuan Wu
Iramm Hamdard
Shiqi Li
J. Zoellner
162
2
0
18 Aug 2025
Previous
1
2
3
4
5
6
...
54
55
56
Next