Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2010.04159
Cited By
v1
v2
v3
v4 (latest)
Deformable DETR: Deformable Transformers for End-to-End Object Detection
International Conference on Learning Representations (ICLR), 2020
8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Github (3553★)
Papers citing
"Deformable DETR: Deformable Transformers for End-to-End Object Detection"
50 / 2,782 papers shown
YOLOA: Real-Time Affordance Detection via LLM Adapter
Yuqi Ji
Junjie Ke
Lihuo He
J. Liu
Kaifan Zhang
Yu-Kun Lai
Guiguang Ding
Xinbo Gao
136
0
0
03 Dec 2025
NavMapFusion: Diffusion-based Fusion of Navigation Maps for Online Vectorized HD Map Construction
T. Monninger
Zihan Zhang
Steffen Staab
Sihao Ding
136
1
0
03 Dec 2025
From Detection to Association: Learning Discriminative Object Embeddings for Multi-Object Tracking
Yuqing Shao
Yuchen Yang
Rui Yu
Weilong Li
Xu Guo
HuaiCheng Yan
Wei Wang
Xiao Sun
VOT
314
0
0
02 Dec 2025
DF-Mamba: Deformable State Space Modeling for 3D Hand Pose Estimation in Interactions
Yifan Zhou
Takehiko Ohkawa
Guwenxiao Zhou
Kanoko Goto
Takumi Hirose
Yusuke Sekikawa
Nakamasa Inoue
3DH
Mamba
428
0
0
02 Dec 2025
ViT
3
^3
3
: Unlocking Test-Time Training in Vision
Dongchen Han
Y. Li
Tianyu Li
Z. Cao
Ziming Wang
Jun Song
Yu Cheng
Bo Zheng
Gao Huang
ViT
72
0
0
01 Dec 2025
Bridging the Scale Gap: Balanced Tiny and General Object Detection in Remote Sensing Imagery
Zhicheng Zhao
Y. Huang
Lingma Sun
Chenglong Li
Jin Tang
61
0
0
01 Dec 2025
ELVIS: Enhance Low-Light for Video Instance Segmentation in the Dark
Joanne Lin
Ruirui Lin
Yini Li
David Bull
Nantheera Anantrasirichai
161
0
0
01 Dec 2025
ReactionMamba: Generating Short &Long Human Reaction Sequences
Hajra Anwar Beg
Baptiste Chopin
Hao Tang
Mohamed Daoudi
Mamba
183
0
0
28 Nov 2025
Visual-Geometry Diffusion Policy: Robust Generalization via Complementarity-Aware Multimodal Fusion
Yikai Tang
Haoran Geng
Sheng Zang
Pieter Abbeel
Jitendra Malik
57
0
0
27 Nov 2025
UMind-VL: A Generalist Ultrasound Vision-Language Model for Unified Grounded Perception and Comprehensive Interpretation
Dengbo Chen
Ziwei Zhao
Kexin Zhang
Shishuang Zhao
J. Hou
...
AnLan Sun
Fei Gao
Jia Ding
Y. Liu
Dong Wang
VLM
125
0
0
27 Nov 2025
Exploring State-of-the-art models for Early Detection of Forest Fires
Sharjeel Ahmed
Daim Armaghan
Fatima Naweed
Umair Yousaf
Ahmad Zubair
Murtaza Taj
80
0
0
25 Nov 2025
SelfMOTR: Revisiting MOTR with Self-Generating Detection Priors
Fabian Gülhan
Emil Mededovic
Yuli Wu
Johannes Stegmaier
VOT
ViT
362
0
0
25 Nov 2025
HybriDLA: Hybrid Generation for Document Layout Analysis
Yufan Chen
Omar Moured
R. Liu
Junwei Zheng
Kunyu Peng
Jiaming Zhang
Rainer Stiefelhagen
82
0
0
25 Nov 2025
From Features to Reference Points: Lightweight and Adaptive Fusion for Cooperative Autonomous Driving
Yongqi Zhu
Morui Zhu
Qi Chen
Deyuan Qu
Song Fu
Q. Yang
Qing Yang
240
0
0
24 Nov 2025
QueryOcc: Query-based Self-Supervision for 3D Semantic Occupancy
Adam Lilja
Ji Lan
Junsheng Fu
Lars Hammarstrand
3DPC
213
1
0
21 Nov 2025
Physically Realistic Sequence-Level Adversarial Clothing for Robust Human-Detection Evasion
Dingkun Zhou
Patrick P. K. Chan
Hengxu Wu
Shikang Zheng
Ruiqi Huang
Yuanjie Zhao
AAML
172
0
0
20 Nov 2025
Graph Query Networks for Object Detection with Automotive Radar
Loveneet Saini
Hasan Tercan
Tobias Meisen
GNN
240
0
0
19 Nov 2025
MaskMed: Decoupled Mask and Class Prediction for Medical Image Segmentation
Bin Xie
Gady Agam
MedIm
302
0
0
19 Nov 2025
IPTQ-ViT: Post-Training Quantization of Non-linear Functions for Integer-only Vision Transformers
Gihwan Kim
Jemin Lee
Hyungshin Kim
MQ
132
0
0
19 Nov 2025
Enhancing End-to-End Autonomous Driving with Risk Semantic Distillaion from VLM
Jack Qin
Zhitao Wang
Yinan Zheng
Keyu Chen
Yang Zhou
Yuanxin Zhong
Siyuan Cheng
136
0
0
18 Nov 2025
QUILL: An Algorithm-Architecture Co-Design for Cache-Local Deformable Attention
Hyunwoo Oh
Hanning Chen
Sanggeon Yun
Yang Ni
Wenjun Huang
Tamoghno Das
Suyeon Jang
Mohsen Imani
VLM
162
0
0
17 Nov 2025
Decoupling Scene Perception and Ego Status: A Multi-Context Fusion Approach for Enhanced Generalization in End-to-End Autonomous Driving
Jiacheng Tang
Mingyue Feng
Jiachao Liu
Yaonong Wang
Jian Pu
195
0
0
17 Nov 2025
Monocular 3D Lane Detection via Structure Uncertainty-Aware Network with Curve-Point Queries
Ruixin Liu
Zejian Yuan
UQCV
271
0
0
17 Nov 2025
Fine-Grained Representation for Lane Topology Reasoning
Guoqing Xu
Y. Li
Yang Yang
155
0
0
16 Nov 2025
RadarMP: Motion Perception for 4D mmWave Radar in Autonomous Driving
Ruiqi Cheng
Huijun Di
Jian Li
Feng Liu
Wei Liang
155
0
0
15 Nov 2025
SOTFormer: A Minimal Transformer for Unified Object Tracking and Trajectory Prediction
Zhongping Dong
Pengyang Yu
Shuangjian Li
L. Chen
Mohand Tahar Kechadi
76
0
0
14 Nov 2025
High-Quality Proposal Encoding and Cascade Denoising for Imaginary Supervised Object Detection
Zhiyuan Chen
Yuelin Guo
Zitong Huang
Haoyu He
Renhao Lu
Weizhe Zhang
88
0
0
11 Nov 2025
RAPTR: Radar-based 3D Pose Estimation using Transformer
Sorachi Kato
Ryoma Yataka
Pu Perry Wang
Pedro Miraldo
T. Fujihashi
P. Boufounos
80
0
0
11 Nov 2025
Automatic Music Mixing using a Generative Model of Effect Embeddings
Eloi Moliner
Marco A. Martínez-Ramírez
Junghyun Koo
Wei-Hsiang Liao
K. Cheuk
Joan Serrà
Vesa Valimaki
Yuki Mitsufuji
DiffM
166
0
0
11 Nov 2025
Twist and Compute: The Cost of Pose in 3D Generative Diffusion
Kyle Fogarty
Jack Foster
Boqiao Zhang
Jing Yang
Cengiz Öztireli
DiffM
144
0
0
11 Nov 2025
Beyond Boundaries: Leveraging Vision Foundation Models for Source-Free Object Detection
Huizai Yao
Sicheng Zhao
Pengteng Li
Yi Cui
Shuo Lu
Weiyu Guo
Yunfan Lu
Ziyang Chen
Hui Xiong
VLM
110
0
0
10 Nov 2025
On Modality Incomplete Infrared-Visible Object Detection: An Architecture Compatibility Perspective
Shuo Yang
Yinghui Xing
Shizhou Zhang
Zhilong Niu
91
0
0
09 Nov 2025
LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation
Zijie Wang
Weiming Zhang
Wei Zhang
Xiao Tan
Hongxing Liu
Y. X. R. Wang
G. Li
DiffM
181
0
0
09 Nov 2025
Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation
Lin Li
Chuhan Zhang
Dong Zhang
Chong Sun
Chen Li
L. Chen
155
0
0
08 Nov 2025
MicroAUNet: Boundary-Enhanced Multi-scale Fusion with Knowledge Distillation for Colonoscopy Polyp Image Segmentation
Ziyi Wang
Yuanmei Zhang
Dorna Esrafilzadeh
Ali R. Jalili
Suncheng Xiang
155
0
0
03 Nov 2025
Layer-Wise Modality Decomposition for Interpretable Multimodal Sensor Fusion
Jaehyun Park
Konyul Park
Daehun Kim
Junseo Park
Jun-Won Choi
108
0
0
02 Nov 2025
OmniTrack++: Omnidirectional Multi-Object Tracking by Learning Large-FoV Trajectory Feedback
Kai Luo
Hao Shi
Kunyu Peng
Fei Teng
Sheng Wu
Kaiwei Wang
Kailun Yang
122
0
0
01 Nov 2025
MLPerf Automotive
Radoyeh Shojaei
Predrag Djurdjevic
Mostafa El-Khamy
James Goel
Kasper Mecklenburg
John Owens
Pınar Muyan-Özçelik
T. S. John
Jinho Suh
Arjun Suresh
VLM
132
0
0
31 Oct 2025
Parameterized Prompt for Incremental Object Detection
Zijia An
Boyu Diao
R. Liu
Libo Huang
Chuanguang Yang
Fei Wang
Zhulin An
Yongjun Xu
CLL
VLM
206
0
0
31 Oct 2025
Foundation Models for Trajectory Planning in Autonomous Driving: A Review of Progress and Open Challenges
Kemal Oksuz
Alexandru Buburuzan
Anthony Knittel
Yuhan Yao
P. Dokania
92
0
0
31 Oct 2025
SA
2
^{2}
2
Net: Scale-Adaptive Structure-Affinity Transformation for Spine Segmentation from Ultrasound Volume Projection Imaging
Hao Xie
Zixun Huang
Yushen Zuo
Yakun Ju
F. Leung
N. F. Law
Kin-Man Lam
Y. Zheng
Sai Ho Ling
98
0
0
30 Oct 2025
GaTector+: A Unified Head-free Framework for Gaze Object and Gaze Following Prediction
Yang Jin
Guangyu Guo
Binglu Wang
103
0
0
29 Oct 2025
FruitProm: Probabilistic Maturity Estimation and Detection of Fruits and Vegetables
Sidharth Rai
Rahul Harsha Cheppally
Benjamin Vail
Keziban Yalçın Dokumacı
Ajay Sharda
134
0
0
28 Oct 2025
SynAD: Enhancing Real-World End-to-End Autonomous Driving Models through Synthetic Data Integration
Jongsuk Kim
Jaeyoung Lee
Gyojin Han
Dongjae Lee
Minki Jeong
Junmo Kim
110
0
0
28 Oct 2025
MIC-BEV: Multi-Infrastructure Camera Bird's-Eye-View Transformer with Relation-Aware Fusion for 3D Object Detection
Yun Zhang
Zhaoliang Zheng
Johnson Liu
Zhiyu Huang
Zewei Zhou
Zonglin Meng
Tianhui Cai
Jiaqi Ma
130
0
0
28 Oct 2025
DQ3D: Depth-guided Query for Transformer-Based 3D Object Detection in Traffic Scenarios
Ziyu Wang
Wenhao Li
Ji Wu
111
0
0
27 Oct 2025
DAMap: Distance-aware MapNet for High Quality HD Map Construction
Jinpeng Dong
Chen Li
Yutong Lin
Jingwen Fu
Sanping Zhou
N. Zheng
130
1
0
26 Oct 2025
Dynamic Semantic-Aware Correlation Modeling for UAV Tracking
Xinyu Zhou
Tongxin Pan
Lingyi Hong
Pinxue Guo
Haijing Guo
Zhaoyu Chen
Kaixun Jiang
Wenqiang Zhang
80
0
0
24 Oct 2025
WaveSeg: Enhancing Segmentation Precision via High-Frequency Prior and Mamba-Driven Spectrum Decomposition
Guoan Xu
Yang Xiao
Wenjing Jia
Guangwei Gao
Guo-Jun Qi
Chia-Wen Lin
Mamba
216
0
0
24 Oct 2025
Controllable-LPMoE: Adapting to Challenging Object Segmentation via Dynamic Local Priors from Mixture-of-Experts
Yanguang Sun
Jiawei Lian
Jian Yang
Lei Luo
123
1
0
24 Oct 2025
1
2
3
4
...
54
55
56
Next