Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.04159
Cited By
Deformable DETR: Deformable Transformers for End-to-End Object Detection
8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deformable DETR: Deformable Transformers for End-to-End Object Detection"
50 / 533 papers shown
Title
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers
Zhiqi Li
Wenhai Wang
Hongyang Li
Enze Xie
Chonghao Sima
Tong Lu
Qiao Yu
Jifeng Dai
15
1,236
0
31 Mar 2022
TR-MOT: Multi-Object Tracking by Reference
Mingfei Chen
Yue Liao
Si Liu
Fei-Yue Wang
Jenq-Neng Hwang
VOT
28
9
0
30 Mar 2022
AdaMixer: A Fast-Converging Query-Based Object Detector
Ziteng Gao
Limin Wang
Bing Han
Sheng Guo
ObjD
22
104
0
30 Mar 2022
Omni-DETR: Omni-Supervised Object Detection with Transformers
Pei Wang
Zhaowei Cai
Hao Yang
Gurumurthy Swaminathan
Nuno Vasconcelos
Bernt Schiele
Stefano Soatto
13
40
0
30 Mar 2022
SIOD: Single Instance Annotated Per Category Per Image for Object Detection
Hanjun Li
Xingjia Pan
Ke Yan
Fan Tang
Weihao Zheng
25
17
0
29 Mar 2022
ObjectFormer for Image Manipulation Detection and Localization
Junke Wang
Zuxuan Wu
Jingjing Chen
Xintong Han
Abhinav Shrivastava
Ser-Nam Lim
Yu-Gang Jiang
26
105
0
28 Mar 2022
Give Me Your Attention: Dot-Product Attention Considered Harmful for Adversarial Patch Robustness
Giulio Lovisotto
Nicole Finnie
Mauricio Muñoz
Chaithanya Kumar Mummadi
J. H. Metzen
AAML
ViT
17
32
0
25 Mar 2022
Unsupervised Learning of Temporal Abstractions with Slot-based Transformers
Anand Gopalakrishnan
Kazuki Irie
Jürgen Schmidhuber
Sjoerd van Steenkiste
OffRL
19
16
0
25 Mar 2022
BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
Likun Cai
Zhi-Li Zhang
Yi Zhu
Li Zhang
Mu Li
Xiangyang Xue
VLM
ObjD
24
40
0
24 Mar 2022
MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection
Renrui Zhang
Han Qiu
Tai Wang
Ziyu Guo
Xuan Xu
Xuanzhuo Xu
Ziteng Cui
Peng Gao
Hongsheng Li
Hongsheng Li
ViT
MDE
37
82
0
24 Mar 2022
Focal Modulation Networks
Jianwei Yang
Chunyuan Li
Xiyang Dai
Lu Yuan
Jianfeng Gao
3DPC
22
261
0
22 Mar 2022
Open-Vocabulary DETR with Conditional Matching
Yuhang Zang
Wei Li
Kaiyang Zhou
Chen Huang
Chen Change Loy
ObjD
VLM
4
196
0
22 Mar 2022
TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers
Xuyang Bai
Zeyu Hu
Xinge Zhu
Qingqiu Huang
Yilun Chen
Hongbo Fu
Chiew-Lan Tai
ViT
3DPC
11
576
0
22 Mar 2022
Iwin: Human-Object Interaction Detection via Transformer with Irregular Windows
Danyang Tu
Xiongkuo Min
Huiyu Duan
G. Guo
Guangtao Zhai
Wei Shen
ViT
22
24
0
20 Mar 2022
MatchFormer: Interleaving Attention in Transformers for Feature Matching
Qing Wang
Jiaming Zhang
Kailun Yang
Kunyu Peng
Rainer Stiefelhagen
ViT
31
141
0
17 Mar 2022
Towards Data-Efficient Detection Transformers
Wen Wang
Jing Zhang
Yang Cao
Yongliang Shen
Dacheng Tao
ViT
16
56
0
17 Mar 2022
Style Transformer for Image Inversion and Editing
Xueqi Hu
Qiusheng Huang
Zhengyi Shi
Siyuan Li
Changxin Gao
Li Sun
Qingli Li
22
55
0
15 Mar 2022
Progressive End-to-End Object Detection in Crowded Scenes
Anlin Zheng
Yuang Zhang
X. Zhang
Xiao Qi
Jian-jun Sun
ObjD
16
60
0
15 Mar 2022
Accelerating DETR Convergence via Semantic-Aligned Matching
Gongjie Zhang
Zhipeng Luo
Yingchen Yu
Kaiwen Cui
Shijian Lu
ViT
43
100
0
14 Mar 2022
The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy
Tianlong Chen
Zhenyu (Allen) Zhang
Yu Cheng
Ahmed Hassan Awadallah
Zhangyang Wang
ViT
20
37
0
12 Mar 2022
Deformable VisTR: Spatio temporal deformable attention for video instance segmentation
Sudhir Yarram
Jialian Wu
Pan Ji
Yi Tian Xu
Junsong Yuan
ViT
11
2
0
12 Mar 2022
PETR: Position Embedding Transformation for Multi-View 3D Object Detection
Yingfei Liu
Tiancai Wang
X. Zhang
Jian-jun Sun
3DPC
15
523
0
10 Mar 2022
Backbone is All Your Need: A Simplified Architecture for Visual Object Tracking
Boyu Chen
Peixia Li
Lei Bai
Leixian Qiao
Qiuhong Shen
Bo-wen Li
Weihao Gan
Wei Wu
Wanli Ouyang
ViT
VOT
20
182
0
10 Mar 2022
CP-ViT: Cascade Vision Transformer Pruning via Progressive Sparsity Prediction
Zhuoran Song
Yihong Xu
Zhezhi He
Li Jiang
Naifeng Jing
Xiaoyao Liang
ViT
18
39
0
09 Mar 2022
BEVSegFormer: Bird's Eye View Semantic Segmentation From Arbitrary Camera Rigs
Lang Peng
Zhirong Chen
Zhang-Hua Fu
Pengpeng Liang
Erkang Cheng
17
131
0
08 Mar 2022
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection
Hao Zhang
Feng Li
Shilong Liu
Lei Zhang
Hang Su
Jun Zhu
L. Ni
H. Shum
ViT
8
1,358
0
07 Mar 2022
Exploring Dual-task Correlation for Pose Guided Person Image Generation
Peng Zhang
Lingxiao Yang
Jianhuang Lai
Xiaohua Xie
ViT
21
79
0
06 Mar 2022
Boosting Crowd Counting via Multifaceted Attention
Hui Lin
Zhiheng Ma
Rongrong Ji
Yaowei Wang
Xiaopeng Hong
23
145
0
05 Mar 2022
Adaptive Cross-Layer Attention for Image Restoration
Yancheng Wang
N. Xu
Yingzhen Yang
15
3
0
04 Mar 2022
Bending Reality: Distortion-aware Transformers for Adapting to Panoramic Semantic Segmentation
Jiaming Zhang
Kailun Yang
Chaoxiang Ma
Simon Reiß
Kunyu Peng
Rainer Stiefelhagen
ViT
16
72
0
02 Mar 2022
DN-DETR: Accelerate DETR Training by Introducing Query DeNoising
Feng Li
Hao Zhang
Shi-guang Liu
Jian Guo
L. Ni
Lei Zhang
ViT
19
636
0
02 Mar 2022
Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection
Jing Tan
Yuhong Wang
Gangshan Wu
Limin Wang
39
14
0
01 Mar 2022
Provable Stochastic Optimization for Global Contrastive Learning: Small Batch Does Not Harm Performance
Zhuoning Yuan
Yuexin Wu
Zi-qi Qiu
Xianzhi Du
Lijun Zhang
Denny Zhou
Tianbao Yang
22
26
0
24 Feb 2022
CATs++: Boosting Cost Aggregation with Convolutions and Transformers
Seokju Cho
Sunghwan Hong
Seung Wook Kim
ViT
19
34
0
14 Feb 2022
Deep Learning Approaches on Image Captioning: A Review
Taraneh Ghandi
H. Pourreza
H. Mahyar
VLM
8
88
0
31 Jan 2022
DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR
Shilong Liu
Feng Li
Hao Zhang
X. Yang
Xianbiao Qi
Hang Su
Jun Zhu
Lei Zhang
ViT
138
703
0
28 Jan 2022
RelTR: Relation Transformer for Scene Graph Generation
Yuren Cong
M. Yang
Bodo Rosenhahn
ViT
87
130
0
27 Jan 2022
UniFormer: Unifying Convolution and Self-attention for Visual Recognition
Kunchang Li
Yali Wang
Junhao Zhang
Peng Gao
Guanglu Song
Yu Liu
Hongsheng Li
Yu Qiao
ViT
142
360
0
24 Jan 2022
Dynamic Label Assignment for Object Detection by Combining Predicted IoUs and Anchor IoUs
Tianxiao Zhang
Bo Luo
A. Sharda
Guanghui Wang
30
17
0
23 Jan 2022
Poseur: Direct Human Pose Regression with Transformers
Wei Mao
Yongtao Ge
Chunhua Shen
Zhi Tian
Xinlong Wang
Zhibin Wang
A. Hengel
ViT
25
80
0
19 Jan 2022
RestoreFormer: High-Quality Blind Face Restoration from Undegraded Key-Value Pairs
Zhouxia Wang
Jiawei Zhang
Runjian Chen
Wenping Wang
Ping Luo
CVBM
11
108
0
17 Jan 2022
Sparse Cross-scale Attention Network for Efficient LiDAR Panoptic Segmentation
Shuangjie Xu
Rui Wan
Maosheng Ye
Xiaoyi Zou
Tongyi Cao
3DPC
11
32
0
16 Jan 2022
RFormer: Transformer-based Generative Adversarial Network for Real Fundus Image Restoration on A New Clinical Benchmark
Zhuo Deng
Yuanhao Cai
Lu Chen
Zheng Gong
Qiqi Bao
Xue Yao
D. Fang
Shaochong Zhang
Lan Ma
ViT
MedIm
18
52
0
03 Jan 2022
Pale Transformer: A General Vision Transformer Backbone with Pale-Shaped Attention
Sitong Wu
Tianyi Wu
Hao Hao Tan
G. Guo
ViT
23
70
0
28 Dec 2021
Miti-DETR: Object Detection based on Transformers with Mitigatory Self-Attention Convergence
Wenchi Ma
Tianxiao Zhang
Guanghui Wang
ViT
23
14
0
26 Dec 2021
Learning Cross-Scale Weighted Prediction for Efficient Neural Video Compression
Zongyu Guo
Runsen Feng
Zhizheng Zhang
Xin Jin
Zhibo Chen
19
14
0
26 Dec 2021
HODOR: High-level Object Descriptors for Object Re-segmentation in Video Learned from Static Images
A. Athar
Jonathon Luiten
Alexander Hermans
Deva Ramanan
Bastian Leibe
VOS
22
25
0
16 Dec 2021
Short and Long Range Relation Based Spatio-Temporal Transformer for Micro-Expression Recognition
Liangfei Zhang
Xiaopeng Hong
Ognjen Arandjelovic
Guoying Zhao
ViT
20
45
0
10 Dec 2021
Fast Point Transformer
Chunghyun Park
Yoonwoo Jeong
Minsu Cho
Jaesik Park
3DPC
ViT
12
166
0
09 Dec 2021
DualFormer: Local-Global Stratified Transformer for Efficient Video Recognition
Yuxuan Liang
Pan Zhou
Roger Zimmermann
Shuicheng Yan
ViT
18
21
0
09 Dec 2021
Previous
1
2
3
...
10
11
8
9
Next