ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.04159
  4. Cited By
Deformable DETR: Deformable Transformers for End-to-End Object Detection
v1v2v3v4 (latest)

Deformable DETR: Deformable Transformers for End-to-End Object Detection

International Conference on Learning Representations (ICLR), 2020
8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
    ViT
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)Github (3553★)

Papers citing "Deformable DETR: Deformable Transformers for End-to-End Object Detection"

50 / 2,782 papers shown
CAT: Cross-Attention Transformer for One-Shot Object Detection
CAT: Cross-Attention Transformer for One-Shot Object Detection
Weidong Lin
Yuyang Deng
Yang Gao
Ning Wang
Jinghao Zhou
Lingqiao Liu
Lei Zhang
Peng Wang
ViT
235
11
0
30 Apr 2021
Keypoint Transformer: Solving Joint Identification in Challenging Hands
  and Object Interactions for Accurate 3D Pose Estimation
Keypoint Transformer: Solving Joint Identification in Challenging Hands and Object Interactions for Accurate 3D Pose EstimationComputer Vision and Pattern Recognition (CVPR), 2021
Shreyas Hampali
S. Sarkar
Mahdi Rad
Vincent Lepetit
3DH
429
161
0
29 Apr 2021
Twins: Revisiting the Design of Spatial Attention in Vision Transformers
Twins: Revisiting the Design of Spatial Attention in Vision TransformersNeural Information Processing Systems (NeurIPS), 2021
Xiangxiang Chu
Zhi Tian
Yuqing Wang
Bo Zhang
Haibing Ren
Xiaolin K. Wei
Huaxia Xia
Chunhua Shen
ViT
651
1,226
0
28 Apr 2021
Segmentation-Based Bounding Box Generation for Omnidirectional
  Pedestrian Detection
Segmentation-Based Bounding Box Generation for Omnidirectional Pedestrian DetectionThe Visual Computer (TVC), 2021
Masato Tamura
Tomoaki Yoshinaga
296
2
0
28 Apr 2021
ConTNet: Why not use convolution and transformer at the same time?
ConTNet: Why not use convolution and transformer at the same time?
Haotian Yan
Zhe Li
Weijian Li
Changhu Wang
Ming Wu
Chuang Zhang
ViT
274
92
0
27 Apr 2021
Vision Transformers with Patch Diversification
Vision Transformers with Patch Diversification
Chengyue Gong
Dilin Wang
Meng Li
Vikas Chandra
Qiang Liu
ViT
257
68
0
26 Apr 2021
Diverse Image Inpainting with Bidirectional and Autoregressive
  Transformers
Diverse Image Inpainting with Bidirectional and Autoregressive TransformersACM Multimedia (ACM MM), 2021
Yingchen Yu
Fangneng Zhan
Rongliang Wu
Jianxiong Pan
Kaiwen Cui
Shijian Lu
Feiying Ma
Xuansong Xie
Chunyan Miao
ViT
499
168
0
26 Apr 2021
Visual Saliency Transformer
Visual Saliency TransformerIEEE International Conference on Computer Vision (ICCV), 2021
Nian Liu
Ni Zhang
Kaiyuan Wan
Ling Shao
Junwei Han
ViT
563
446
0
25 Apr 2021
M3DeTR: Multi-representation, Multi-scale, Mutual-relation 3D Object
  Detection with Transformers
M3DeTR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with TransformersIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021
Tianrui Guan
Jun Wang
Shiyi Lan
Rohan Chandra
Zuxuan Wu
Larry S. Davis
Tianyi Zhou
ViT3DPC
221
157
0
24 Apr 2021
Region-Adaptive Deformable Network for Image Quality Assessment
Region-Adaptive Deformable Network for Image Quality Assessment
Shu Shi
Qingyan Bai
Ming Cao
Weihao Xia
Jiahao Wang
Yifan Chen
Yujiu Yang
141
25
0
23 Apr 2021
All Tokens Matter: Token Labeling for Training Better Vision
  Transformers
All Tokens Matter: Token Labeling for Training Better Vision TransformersNeural Information Processing Systems (NeurIPS), 2021
Zihang Jiang
Qibin Hou
Li-xin Yuan
Daquan Zhou
Yujun Shi
Xiaojie Jin
Anran Wang
Jiashi Feng
ViT
396
237
0
22 Apr 2021
Generative Transformer for Accurate and Reliable Salient Object
  Detection
Generative Transformer for Accurate and Reliable Salient Object Detection
Yuxin Mao
Jing Zhang
Zhexiong Wan
Yuchao Dai
Aixuan Li
Yun-Qiu Lv
Xinyu Tian
Deng-Ping Fan
Nick Barnes
ViT
461
49
0
20 Apr 2021
M2TR: Multi-modal Multi-scale Transformers for Deepfake Detection
M2TR: Multi-modal Multi-scale Transformers for Deepfake DetectionInternational Conference on Multimedia Retrieval (ICMR), 2021
Junke Wang
Zuxuan Wu
Wenhao Ouyang
Xintong Han
Yue Yu
Ser-Nam Lim
Yu-Gang Jiang
ViT
309
349
0
20 Apr 2021
TransVG: End-to-End Visual Grounding with Transformers
TransVG: End-to-End Visual Grounding with TransformersIEEE International Conference on Computer Vision (ICCV), 2021
Jiajun Deng
Zhengyuan Yang
Tianlang Chen
Wen-gang Zhou
Houqiang Li
ViT
621
442
0
17 Apr 2021
Vision Transformer Pruning
Vision Transformer Pruning
Mingjian Zhu
Yehui Tang
Kai Han
ViT
478
112
0
17 Apr 2021
Vision Transformer using Low-level Chest X-ray Feature Corpus for
  COVID-19 Diagnosis and Severity Quantification
Vision Transformer using Low-level Chest X-ray Feature Corpus for COVID-19 Diagnosis and Severity QuantificationMedical Image Analysis (MedIA), 2021
Sangjoon Park
Gwanghyun Kim
Y. Oh
J. Seo
Sang Min Lee
Jin Hwan Kim
Sungjun Moon
Jae-Kwang Lim
Jong Chul Ye
ViTMedIm
236
111
0
15 Apr 2021
Co-Scale Conv-Attentional Image Transformers
Co-Scale Conv-Attentional Image TransformersIEEE International Conference on Computer Vision (ICCV), 2021
Weijian Xu
Yifan Xu
Tyler A. Chang
Zhuowen Tu
ViT
271
436
0
13 Apr 2021
HIH: Towards More Accurate Face Alignment via Heatmap in Heatmap
HIH: Towards More Accurate Face Alignment via Heatmap in Heatmap
Xing Lan
Qinghao Hu
Qiang Chen
Jian Xue
Jian Cheng
CVBM
226
27
0
07 Apr 2021
Efficient DETR: Improving End-to-End Object Detector with Dense Prior
Efficient DETR: Improving End-to-End Object Detector with Dense Prior
Z. Yao
Jiangbo Ai
Boxun Li
Fangqiu Yi
ViT
283
304
0
03 Apr 2021
TubeR: Tubelet Transformer for Video Action Detection
TubeR: Tubelet Transformer for Video Action DetectionComputer Vision and Pattern Recognition (CVPR), 2021
Jiaojiao Zhao
Yanyi Zhang
Xinyu Li
Hao Chen
Shuai Bing
...
Yuanjun Xiong
Davide Modolo
I. Marsic
Cees G. M. Snoek
Joseph Tighe
ViT
348
93
0
02 Apr 2021
Bridging Global Context Interactions for High-Fidelity Image Completion
Bridging Global Context Interactions for High-Fidelity Image CompletionComputer Vision and Pattern Recognition (CVPR), 2021
Chuanxia Zheng
Tat-Jen Cham
Jianfei Cai
Dinh Q. Phung
ViT
169
101
0
02 Apr 2021
Next Generation Multitarget Trackers: Random Finite Set Methods vs
  Transformer-based Deep Learning
Next Generation Multitarget Trackers: Random Finite Set Methods vs Transformer-based Deep LearningFusion (Fusion), 2021
Juliano Pinto
Georg Hess
William Ljungbergh
Yuxuan Xia
Lennart Svensson
H. Wymeersch
299
24
0
01 Apr 2021
DA-DETR: Domain Adaptive Detection Transformer with Information Fusion
DA-DETR: Domain Adaptive Detection Transformer with Information FusionComputer Vision and Pattern Recognition (CVPR), 2021
Jingyi Zhang
Jiaxing Huang
Zhipeng Luo
Gongjie Zhang
Xiaoqin Zhang
Shijian Lu
ViT
233
58
0
31 Mar 2021
Rethinking Spatial Dimensions of Vision Transformers
Rethinking Spatial Dimensions of Vision TransformersIEEE International Conference on Computer Vision (ICCV), 2021
Byeongho Heo
Sangdoo Yun
Dongyoon Han
Sanghyuk Chun
Junsuk Choe
Seong Joon Oh
ViT
1.5K
693
0
30 Mar 2021
CvT: Introducing Convolutions to Vision Transformers
CvT: Introducing Convolutions to Vision TransformersIEEE International Conference on Computer Vision (ICCV), 2021
Haiping Wu
Bin Xiao
Noel Codella
Xiyang Dai
Xiyang Dai
Lu Yuan
Lei Zhang
ViT
525
2,280
0
29 Mar 2021
Generic Attention-model Explainability for Interpreting Bi-Modal and
  Encoder-Decoder Transformers
Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder TransformersIEEE International Conference on Computer Vision (ICCV), 2021
Hila Chefer
Shir Gur
Lior Wolf
ViT
358
412
0
29 Mar 2021
On the Adversarial Robustness of Vision Transformers
On the Adversarial Robustness of Vision Transformers
Rulin Shao
Zhouxing Shi
Jinfeng Yi
Pin-Yu Chen
Cho-Jui Hsieh
ViT
439
176
0
29 Mar 2021
Multi-Scale Vision Longformer: A New Vision Transformer for
  High-Resolution Image Encoding
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image EncodingIEEE International Conference on Computer Vision (ICCV), 2021
Pengchuan Zhang
Xiyang Dai
Jianwei Yang
Bin Xiao
Lu Yuan
Lei Zhang
Jianfeng Gao
ViT
306
373
0
29 Mar 2021
TFPose: Direct Human Pose Estimation with Transformers
TFPose: Direct Human Pose Estimation with Transformers
Wei Mao
Yongtao Ge
Chunhua Shen
Zhi Tian
Xinlong Wang
Zhibin Wang
ViT
218
103
0
29 Mar 2021
TransCenter: Transformers with Dense Representations for Multiple-Object
  Tracking
TransCenter: Transformers with Dense Representations for Multiple-Object TrackingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Yihong Xu
Yutong Ban
Guillaume Delorme
Chuang Gan
Daniela Rus
Xavier Alameda-Pineda
VOT
349
146
0
28 Mar 2021
Looking Beyond Two Frames: End-to-End Multi-Object Tracking Using
  Spatial and Temporal Transformers
Looking Beyond Two Frames: End-to-End Multi-Object Tracking Using Spatial and Temporal TransformersIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Tianyu Zhu
Markus Hiller
Mahsa Ehsanpour
Rongkai Ma
Tom Drummond
Ian Reid
Hamid Rezatofighi
VOT
360
56
0
27 Mar 2021
Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose
  Estimation
Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose EstimationIEEE transactions on multimedia (IEEE Trans. Multimedia), 2021
Shuheng Zhou
Hong Liu
Runwei Ding
Mengyuan Liu
Pichao Wang
Wenming Yang
ViT
583
254
0
26 Mar 2021
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Swin Transformer: Hierarchical Vision Transformer using Shifted WindowsIEEE International Conference on Computer Vision (ICCV), 2021
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
ViT
3.1K
28,501
0
25 Mar 2021
USB: Universal-Scale Object Detection Benchmark
USB: Universal-Scale Object Detection BenchmarkBritish Machine Vision Conference (BMVC), 2021
Yosuke Shinya
ObjD
356
17
0
25 Mar 2021
BossNAS: Exploring Hybrid CNN-transformers with Block-wisely
  Self-supervised Neural Architecture Search
BossNAS: Exploring Hybrid CNN-transformers with Block-wisely Self-supervised Neural Architecture SearchIEEE International Conference on Computer Vision (ICCV), 2021
Changlin Li
Tao Tang
Guangrun Wang
Jiefeng Peng
Bing Wang
Xiaodan Liang
Xiaojun Chang
ViT
407
120
0
23 Mar 2021
Conditional Training with Bounding Map for Universal Lesion Detection
Conditional Training with Bounding Map for Universal Lesion DetectionInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2021
Han Li
Long Chen
Hu Han
S. Kevin Zhou
MedImAI4CE
231
10
0
23 Mar 2021
End-to-End Trainable Multi-Instance Pose Estimation with Transformers
End-to-End Trainable Multi-Instance Pose Estimation with Transformers
Lucas Stoffl
Maxime Vidal
Alexander Mathis
ViT
195
55
0
22 Mar 2021
Incorporating Convolution Designs into Visual Transformers
Incorporating Convolution Designs into Visual TransformersIEEE International Conference on Computer Vision (ICCV), 2021
Kun Yuan
Shaopeng Guo
Ziwei Liu
Aojun Zhou
F. Yu
Wei Wu
ViT
300
566
0
22 Mar 2021
Meta-DETR: Image-Level Few-Shot Object Detection with Inter-Class
  Correlation Exploitation
Meta-DETR: Image-Level Few-Shot Object Detection with Inter-Class Correlation Exploitation
Gongjie Zhang
Zhipeng Luo
Kaiwen Cui
Shijian Lu
ViT
299
84
0
22 Mar 2021
Scalable Vision Transformers with Hierarchical Pooling
Scalable Vision Transformers with Hierarchical PoolingIEEE International Conference on Computer Vision (ICCV), 2021
Zizheng Pan
Bohan Zhuang
Jing Liu
Haoyu He
Jianfei Cai
ViT
230
146
0
19 Mar 2021
UNETR: Transformers for 3D Medical Image Segmentation
UNETR: Transformers for 3D Medical Image SegmentationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021
Ali Hatamizadeh
Yucheng Tang
Vishwesh Nath
Dong Yang
Andriy Myronenko
Bennett Landman
H. Roth
Daguang Xu
ViTMedIm
562
2,235
0
18 Mar 2021
3D Human Pose Estimation with Spatial and Temporal Transformers
3D Human Pose Estimation with Spatial and Temporal TransformersIEEE International Conference on Computer Vision (ICCV), 2021
Ce Zheng
Sijie Zhu
Matías Mendieta
Taojiannan Yang
Chong Chen
Zhengming Ding
ViT
428
588
0
18 Mar 2021
Consistency-based Active Learning for Object Detection
Consistency-based Active Learning for Object Detection
Weiping Yu
Sijie Zhu
Taojiannan Yang
Chong Chen
ObjD
287
62
0
18 Mar 2021
Suppress-and-Refine Framework for End-to-End 3D Object Detection
Suppress-and-Refine Framework for End-to-End 3D Object DetectionSocial Science Research Network (SSRN), 2021
Zili Liu
Guodong Xu
Honghui Yang
Minghao Chen
Kuoliang Wu
Zheng Yang
Haifeng Liu
Deng Cai
3DPC
159
4
0
18 Mar 2021
TransFG: A Transformer Architecture for Fine-grained Recognition
TransFG: A Transformer Architecture for Fine-grained RecognitionAAAI Conference on Artificial Intelligence (AAAI), 2021
Ju He
Jieneng Chen
Shuai Liu
Adam Kortylewski
Cheng Yang
Yutong Bai
Changhu Wang
ViT
472
476
0
14 Mar 2021
Probabilistic two-stage detection
Probabilistic two-stage detection
Xingyi Zhou
V. Koltun
Philipp Krahenbuhl
ObjD
234
257
0
12 Mar 2021
Spatially Consistent Representation Learning
Spatially Consistent Representation LearningComputer Vision and Pattern Recognition (CVPR), 2021
Byungseok Roh
Wuhyun Shin
Ildoo Kim
Sungwoong Kim
SSL
172
97
0
10 Mar 2021
TransMed: Transformers Advance Multi-modal Medical Image Classification
TransMed: Transformers Advance Multi-modal Medical Image Classification
Yin Dai
Yifan Gao
ViTMedIm
225
381
0
10 Mar 2021
SSTN: Self-Supervised Domain Adaptation Thermal Object Detection for
  Autonomous Driving
SSTN: Self-Supervised Domain Adaptation Thermal Object Detection for Autonomous DrivingIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021
Farzeen Munir
Shoaib Azam
M. Jeon
ViT
176
44
0
04 Mar 2021
CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image
  Segmentation
CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image SegmentationInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2021
Yutong Xie
Jianpeng Zhang
Chunhua Shen
Yong-quan Xia
ViTMedIm
239
628
0
04 Mar 2021
Previous
123...545556
Next
Page 55 of 56
Pageof 56