ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.04159
  4. Cited By
Deformable DETR: Deformable Transformers for End-to-End Object Detection
v1v2v3v4 (latest)

Deformable DETR: Deformable Transformers for End-to-End Object Detection

International Conference on Learning Representations (ICLR), 2020
8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
    ViT
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)Github (3553★)

Papers citing "Deformable DETR: Deformable Transformers for End-to-End Object Detection"

50 / 2,782 papers shown
Background Fades, Foreground Leads: Curriculum-Guided Background Pruning for Efficient Foreground-Centric Collaborative Perception
Background Fades, Foreground Leads: Curriculum-Guided Background Pruning for Efficient Foreground-Centric Collaborative Perception
Yuheng Wu
Xiangbo Gao
Quang Tau
Zhengzhong Tu
Dongman Lee
105
0
0
22 Oct 2025
A Unified Detection Pipeline for Robust Object Detection in Fisheye-Based Traffic Surveillance
A Unified Detection Pipeline for Robust Object Detection in Fisheye-Based Traffic Surveillance
Neema Jakisa Owor
Joshua Kofi Asamoah
Tanner Muturi
Anneliese Jakisa Owor
Blessing Agyei Kyem
Andrews Danyo
Y. Adu-Gyamfi
Armstrong Aboah
149
3
0
22 Oct 2025
Comparative Analysis of Object Detection Algorithms for Surface Defect Detection
Comparative Analysis of Object Detection Algorithms for Surface Defect Detection
Arpan Maity
Tamal Ghosh
ObjD
128
0
0
21 Oct 2025
SOLE: Hardware-Software Co-design of Softmax and LayerNorm for Efficient Transformer Inference
SOLE: Hardware-Software Co-design of Softmax and LayerNorm for Efficient Transformer Inference
Wenxun Wang
Shuchang Zhou
Wenyu Sun
Peiqin Sun
Y. Liu
138
38
0
20 Oct 2025
ArmFormer: Lightweight Transformer Architecture for Real-Time Multi-Class Weapon Segmentation and Classification
ArmFormer: Lightweight Transformer Architecture for Real-Time Multi-Class Weapon Segmentation and Classification
Akhila Kambhatla
Taminul Islam
Khaled R Ahmed
ViT
149
0
0
19 Oct 2025
Proto-Former: Unified Facial Landmark Detection by Prototype Transformer
Proto-Former: Unified Facial Landmark Detection by Prototype Transformer
Shengkai Hu
Haozhe Qi
Jun Wan
Jiaxing Huang
Lefei Zhang
Hang Sun
Dacheng Tao
ViT
158
3
0
17 Oct 2025
ReCon: Region-Controllable Data Augmentation with Rectification and Alignment for Object Detection
ReCon: Region-Controllable Data Augmentation with Rectification and Alignment for Object Detection
Haowei Zhu
Tianxiang Pan
Rui Qin
Jun-Hai Yong
Bin Wang
DiffM
168
1
0
17 Oct 2025
MatchAttention: Matching the Relative Positions for High-Resolution Cross-View Matching
MatchAttention: Matching the Relative Positions for High-Resolution Cross-View Matching
Tingman Yan
Tao Liu
Xilian Yang
Qunfei Zhao
Zeyang Xia
3DV
220
0
0
16 Oct 2025
MOBIUS: Big-to-Mobile Universal Instance Segmentation via Multi-modal Bottleneck Fusion and Calibrated Decoder Pruning
MOBIUS: Big-to-Mobile Universal Instance Segmentation via Multi-modal Bottleneck Fusion and Calibrated Decoder Pruning
Mattia Segu
Marta Tintore Gazulla
Yongqin Xian
Luc Van Gool
Federico Tombari
86
0
0
16 Oct 2025
Complementary Information Guided Occupancy Prediction via Multi-Level Representation Fusion
Complementary Information Guided Occupancy Prediction via Multi-Level Representation FusionIEEE International Conference on Robotics and Automation (ICRA), 2025
Rongtao Xu
Jinzhou Lin
Jialei Zhou
Jiahua Dong
Changwei Wang
Ruisheng Wang
Li Guo
Shibiao Xu
Xiaodan Liang
3DPC
178
0
0
15 Oct 2025
UniVector: Unified Vector Extraction via Instance-Geometry Interaction
UniVector: Unified Vector Extraction via Instance-Geometry Interaction
Yinglong Yan
Jun Yue
Shaobo Xia
Hanmeng Sun
Tianxu Ying
Chengcheng Wu
Sifan Lan
Min He
Pedram Ghamisi
Leyuan Fang
113
0
0
15 Oct 2025
What "Not" to Detect: Negation-Aware VLMs via Structured Reasoning and Token Merging
What "Not" to Detect: Negation-Aware VLMs via Structured Reasoning and Token Merging
Inha Kang
Youngsun Lim
S. Lee
Jiho Choi
Junsuk Choe
Hyunjung Shim
99
0
0
15 Oct 2025
CrossRay3D: Geometry and Distribution Guidance for Efficient Multimodal 3D Detection
CrossRay3D: Geometry and Distribution Guidance for Efficient Multimodal 3D Detection
Huiming Yang
Wenzhuo Liu
Yicheng Qiao
Lei Yang
Xianzhu Zeng
...
Zhiwei Li
Zijian Zeng
Zhiying Jiang
Huaping Liu
Kunfeng Wang
216
0
0
14 Oct 2025
MMOT: The First Challenging Benchmark for Drone-based Multispectral Multi-Object Tracking
MMOT: The First Challenging Benchmark for Drone-based Multispectral Multi-Object Tracking
Tianhao Li
Tingfa Xu
Ying Wang
Haolin Qin
Xu Lin
Jianan Li
119
0
0
14 Oct 2025
Detect Anything via Next Point Prediction
Detect Anything via Next Point Prediction
Qing Jiang
Junan Huo
Xingyu Chen
Yuda Xiong
Zhaoyang Zeng
Yihao Chen
Tianhe Ren
Junzhi Yu
Lei Zhang
ObjD
211
11
0
14 Oct 2025
Source-Free Object Detection with Detection Transformer
Source-Free Object Detection with Detection TransformerIEEE Transactions on Image Processing (IEEE TIP), 2025
Huizai Yao
Sicheng Zhao
Shuo Lu
Hui Chen
Yangyang Li
Guoping Liu
Tengfei Xing
C. Yan
Jianhua Tao
Guiguang Ding
ViT
90
3
0
13 Oct 2025
Unified Open-World Segmentation with Multi-Modal Prompts
Unified Open-World Segmentation with Multi-Modal Prompts
Yang Liu
Yufei Yin
Chenchen Jing
M. Zhu
Hao Chen
Yuling Xi
Bo Feng
Hao Wang
Shiyu Li
Chunhua Shen
VLM
106
0
0
12 Oct 2025
Complementary and Contrastive Learning for Audio-Visual Segmentation
Complementary and Contrastive Learning for Audio-Visual SegmentationIEEE transactions on multimedia (TMM), 2025
Sitong Gong
Yunzhi Zhuge
Lu Zhang
Pingping Zhang
Huchuan Lu
VOS
240
3
0
11 Oct 2025
KORMo: Korean Open Reasoning Model for Everyone
KORMo: Korean Open Reasoning Model for Everyone
Minjun Kim
HyeonSeok Lim
Hangyeol Yoo
Inho Won
Seungwoo Song
...
Dongjae Shin
Huige Lee
Hoyun Song
Alice Oh
Kyungtae Lim
ALMLRM
152
0
0
10 Oct 2025
Synthetic Object Compositions for Scalable and Accurate Learning in Detection, Segmentation, and Grounding
Synthetic Object Compositions for Scalable and Accurate Learning in Detection, Segmentation, and Grounding
Weikai Huang
Jieyu Zhang
Taoyang Jia
Chenhao Zheng
Ziqi Gao
J. S. Park
Winson Han
Ranjay Krishna
226
0
0
10 Oct 2025
Utilizing dynamic sparsity on pretrained DETR
Utilizing dynamic sparsity on pretrained DETR
Reza Sedghi
Anand Subramoney
David Kappel
MoE
148
0
0
10 Oct 2025
Learning Global Representation from Queries for Vectorized HD Map Construction
Learning Global Representation from Queries for Vectorized HD Map Construction
Shoumeng Qiu
Xinrun Li
Yang Long
Xiangyang Xue
Varun Ojha
Jian Pu
111
1
0
08 Oct 2025
Deforming Videos to Masks: Flow Matching for Referring Video Segmentation
Deforming Videos to Masks: Flow Matching for Referring Video Segmentation
Zanyi Wang
Dengyang Jiang
Liuzhuozheng Li
Sizhe Dang
Chengzu Li
H. Yang
Guang Dai
Mengmeng Wang
Jingdong Wang
VOSVGen
225
0
0
07 Oct 2025
Shaken or Stirred? An Analysis of MetaFormer's Token Mixing for Medical Imaging
Shaken or Stirred? An Analysis of MetaFormer's Token Mixing for Medical Imaging
Ron Keuth
Paul Kaftan
Mattias P. Heinrich
MedIm
185
0
0
07 Oct 2025
Flexible and Efficient Spatio-Temporal Transformer for Sequential Visual Place Recognition
Flexible and Efficient Spatio-Temporal Transformer for Sequential Visual Place Recognition
Yu Kiu
Chao-Yeh Chen
Ge Jin
Chen Feng
ViT
120
0
0
05 Oct 2025
Referring Expression Comprehension for Small Objects
Referring Expression Comprehension for Small Objects
Kanoko Goto
Takumi Hirose
Mahiro Ukai
Shuhei Kurita
Nakamasa Inoue
ObjD
146
1
0
04 Oct 2025
Cross-View Open-Vocabulary Object Detection in Aerial Imagery
Cross-View Open-Vocabulary Object Detection in Aerial Imagery
Jyoti Kini
Rohit Gupta
Mubarak Shah
ObjDVLM
197
0
0
04 Oct 2025
Align Your Query: Representation Alignment for Multimodality Medical Object Detection
Align Your Query: Representation Alignment for Multimodality Medical Object Detection
Ara Seo
Bryan S Kim
Hyungjin Chung
Jong Chul Ye
136
0
0
03 Oct 2025
PyramidStyler: Transformer-Based Neural Style Transfer with Pyramidal Positional Encoding and Reinforcement Learning
PyramidStyler: Transformer-Based Neural Style Transfer with Pyramidal Positional Encoding and Reinforcement Learning
Raahul Krishna Durairaju
K. Saruladha
165
0
0
02 Oct 2025
Holistic Order Prediction in Natural Scenes
Holistic Order Prediction in Natural Scenes
Pierre Musacchio
Hyunmin Lee
Jaesik Park
3DV
259
0
0
02 Oct 2025
A Comprehensive Review on Artificial Intelligence Empowered Solutions for Enhancing Pedestrian and Cyclist Safety
A Comprehensive Review on Artificial Intelligence Empowered Solutions for Enhancing Pedestrian and Cyclist Safety
Shucheng Zhang
Yan Shi
Bingzhang Wang
Yuang Zhang
Muhammad Monjurul Karim
Kehua Chen
Chenxi Liu
Mehrdad Nasri
Yinhai Wang
158
0
0
30 Sep 2025
Looking Beyond the Known: Towards a Data Discovery Guided Open-World Object Detection
Looking Beyond the Known: Towards a Data Discovery Guided Open-World Object Detection
Anay Majee
Amitesh Gangrade
Rishabh K. Iyer
117
0
0
30 Sep 2025
A Multi-Camera Vision-Based Approach for Fine-Grained Assembly Quality Control
A Multi-Camera Vision-Based Approach for Fine-Grained Assembly Quality Control
Ali Nazeri
Shashank Mishra
A. Wagner
Martin Ruskowski
Didier Stricker
J. Rambach
118
0
0
28 Sep 2025
Sim-DETR: Unlock DETR for Temporal Sentence Grounding
Sim-DETR: Unlock DETR for Temporal Sentence Grounding
Jiajin Tang
Zhengxuan Wei
Yuchen Zhu
Cheng Shi
Guanbin Li
Guanbin Li
Sibei Yang
PINN
301
1
0
28 Sep 2025
INSTINCT: Instance-Level Interaction Architecture for Query-Based Collaborative Perception
INSTINCT: Instance-Level Interaction Architecture for Query-Based Collaborative Perception
Y. Xu
Lingzhi Li
Jin Wang
Yupeng Ouyang
Benyuan Yang
82
0
0
28 Sep 2025
C3-OWD: A Curriculum Cross-modal Contrastive Learning Framework for Open-World Detection
C3-OWD: A Curriculum Cross-modal Contrastive Learning Framework for Open-World Detection
Siheng Wang
Zhengdao Li
Yanshu Li
Canran Xiao
Haibo Zhan
...
Zhikang Dong
Jifeng Shen
Junhao Dong
Qiang Sun
Piotr Koniusz
ObjDVLM
257
6
0
27 Sep 2025
UniPose: Unified Cross-modality Pose Prior Propagation towards RGB-D data for Weakly Supervised 3D Human Pose Estimation
UniPose: Unified Cross-modality Pose Prior Propagation towards RGB-D data for Weakly Supervised 3D Human Pose Estimation
Jinghong Zheng
Changlong Jiang
Jiaqi Li
Haohong Kuang
Hang Xu
Tingbing Yan
3DH
112
0
0
27 Sep 2025
FMC-DETR: Frequency-Decoupled Multi-Domain Coordination for Aerial-View Object Detection
FMC-DETR: Frequency-Decoupled Multi-Domain Coordination for Aerial-View Object Detection
Ben Liang
Yuan Liu
Bingwen Qiu
Yihong Wang
Xiubao Sui
Qian Chen
170
0
0
27 Sep 2025
Motion-Aware Transformer for Multi-Object Tracking
Motion-Aware Transformer for Multi-Object Tracking
Xu Yang
Gady Agam
VOT
391
0
0
26 Sep 2025
FSMODNet: A Closer Look at Few-Shot Detection in Multispectral Data
FSMODNet: A Closer Look at Few-Shot Detection in Multispectral Data
Manuel Nkegoum
M. Pham
Elisa Fromont
Bruno Avignon
Sébastien Lefèvre
147
1
0
25 Sep 2025
Real-Time Object Detection Meets DINOv3
Real-Time Object Detection Meets DINOv3
Shihua Huang
Yongjie Hou
Longfei Liu
Xuanlong Yu
Xi Shen
ObjD3DHPINNVLM
375
6
0
25 Sep 2025
Hyperspectral Adapter for Semantic Segmentation with Vision Foundation Models
Hyperspectral Adapter for Semantic Segmentation with Vision Foundation Models
Juana Valeria Hurtado
Rohit Mohan
Abhinav Valada
177
1
0
24 Sep 2025
Knowledge Transfer from Interaction Learning
Knowledge Transfer from Interaction Learning
Yilin Gao
Kangyi Chen
Zhongxing Peng
Hengjie Lu
Shugong Xu
VLM
125
0
0
23 Sep 2025
Frequency-Domain Decomposition and Recomposition for Robust Audio-Visual Segmentation
Frequency-Domain Decomposition and Recomposition for Robust Audio-Visual Segmentation
Yunzhe Shen
Kai Peng
Leiye Liu
Wei Ji
Jingjing Li
Miao Zhang
Yongri Piao
Huchuan Lu
VOS
209
0
0
23 Sep 2025
SynapFlow: A Modular Framework Towards Large-Scale Analysis of Dendritic Spines
SynapFlow: A Modular Framework Towards Large-Scale Analysis of Dendritic Spines
Pamela Osuna-Vargas
Altug Kamacioglu
Dominik F. Aschauer
Petros E. Vlachos
Sercan Alipek
Jochen Triesch
Simon Rumpel
Matthias Kaschube
110
0
0
23 Sep 2025
MLF-4DRCNet: Multi-Level Fusion with 4D Radar and Camera for 3D Object Detection in Autonomous Driving
MLF-4DRCNet: Multi-Level Fusion with 4D Radar and Camera for 3D Object Detection in Autonomous Driving
Yuzhi Wu
Li Xiao
Jun Liu
Guangfeng Jiang
X. Xia
121
0
0
23 Sep 2025
Track-On2: Enhancing Online Point Tracking with Memory
Track-On2: Enhancing Online Point Tracking with Memory
Görkay Aydemir
Weidi Xie
Fatma Guney
VOT3DV
238
0
0
23 Sep 2025
NaviSense: A Multimodal Assistive Mobile application for Object Retrieval by Persons with Visual Impairment
NaviSense: A Multimodal Assistive Mobile application for Object Retrieval by Persons with Visual Impairment
Ajay Narayanan Sridhar
Fuli Qiao
Nelson Daniel Troncoso Aldas
Yanpei Shi
M. Mahdavi
Laurent Itti
V. Narayanan
105
1
0
23 Sep 2025
Visual Instruction Pretraining for Domain-Specific Foundation Models
Visual Instruction Pretraining for Domain-Specific Foundation Models
Yuxuan Li
Y. Zhang
Wenhao Tang
Yimian Dai
Ming-Ming Cheng
Xiang Li
Jian Yang
LRM
289
3
0
22 Sep 2025
DepTR-MOT: Unveiling the Potential of Depth-Informed Trajectory Refinement for Multi-Object Tracking
DepTR-MOT: Unveiling the Potential of Depth-Informed Trajectory Refinement for Multi-Object Tracking
Buyin Deng
Lingxin Huang
Kai Luo
Fei Teng
Kailun Yang
VOT
264
1
0
22 Sep 2025
Previous
12345...545556
Next