ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.04159
  4. Cited By
Deformable DETR: Deformable Transformers for End-to-End Object Detection

Deformable DETR: Deformable Transformers for End-to-End Object Detection

8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
    ViT
ArXivPDFHTML

Papers citing "Deformable DETR: Deformable Transformers for End-to-End Object Detection"

50 / 533 papers shown
Title
Recurrent Glimpse-based Decoder for Detection with Transformer
Recurrent Glimpse-based Decoder for Detection with Transformer
Zhe Chen
Jing Zhang
Dacheng Tao
ViT
11
27
0
09 Dec 2021
Masked-attention Mask Transformer for Universal Image Segmentation
Masked-attention Mask Transformer for Universal Image Segmentation
Bowen Cheng
Ishan Misra
A. Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
17
2,245
0
02 Dec 2021
Confidence Propagation Cluster: Unleash Full Potential of Object
  Detectors
Confidence Propagation Cluster: Unleash Full Potential of Object Detectors
Yichun Shen
Wanli Jiang
Zhen Xu
Rundong Li
Junghyun Kwon
Siyi Li
ObjD
14
9
0
01 Dec 2021
CT-block: a novel local and global features extractor for point cloud
CT-block: a novel local and global features extractor for point cloud
Shangwei Guo
Jun Li
Zhengchao Lai
Xiantong Meng
Shaokun Han
ViT
3DPC
11
1
0
30 Nov 2021
Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point
  Modeling
Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point Modeling
Xumin Yu
Lulu Tang
Yongming Rao
Tiejun Huang
Jie Zhou
Jiwen Lu
3DPC
20
644
0
29 Nov 2021
On the Integration of Self-Attention and Convolution
On the Integration of Self-Attention and Convolution
Xuran Pan
Chunjiang Ge
Rui Lu
S. Song
Guanfu Chen
Zeyi Huang
Gao Huang
SSL
12
281
0
29 Nov 2021
Building extraction with vision transformer
Building extraction with vision transformer
Libo Wang
Shenghui Fang
Rui Li
Xiaoliang Meng
ViT
12
156
0
29 Nov 2021
SWAT: Spatial Structure Within and Among Tokens
SWAT: Spatial Structure Within and Among Tokens
Kumara Kahatapitiya
Michael S. Ryoo
12
6
0
26 Nov 2021
BoxeR: Box-Attention for 2D and 3D Transformers
BoxeR: Box-Attention for 2D and 3D Transformers
Duy-Kien Nguyen
Jihong Ju
Olaf Booji
Martin R. Oswald
Cees G. M. Snoek
ViT
23
36
0
25 Nov 2021
PU-Transformer: Point Cloud Upsampling Transformer
PU-Transformer: Point Cloud Upsampling Transformer
Shi Qiu
Saeed Anwar
Nick Barnes
3DPC
ViT
19
50
0
24 Nov 2021
A Survey of Visual Transformers
A Survey of Visual Transformers
Yang Liu
Yao Zhang
Yixin Wang
Feng Hou
Jin Yuan
Jiang Tian
Yang Zhang
Zhongchao Shi
Jianping Fan
Zhiqiang He
3DGS
ViT
66
325
0
11 Nov 2021
Blending Anti-Aliasing into Vision Transformer
Blending Anti-Aliasing into Vision Transformer
Shengju Qian
Hao Shao
Yi Zhu
Mu Li
Jiaya Jia
13
20
0
28 Oct 2021
TransFusion: Cross-view Fusion with Transformer for 3D Human Pose
  Estimation
TransFusion: Cross-view Fusion with Transformer for 3D Human Pose Estimation
Haoyu Ma
Liangjian Chen
Deying Kong
Zhe Wang
Xingwei Liu
Hao Tang
Xiangyi Yan
Yusheng Xie
Shi-yao Lin
Xiaohui Xie
ViT
19
61
0
18 Oct 2021
ASFormer: Transformer for Action Segmentation
ASFormer: Transformer for Action Segmentation
Fangqiu Yi
Hongyu Wen
Tingting Jiang
ViT
69
168
0
16 Oct 2021
Transformer for Polyp Detection
Transformer for Polyp Detection
Shijie Liu
Hongyu Zhou
Xiaozhou Shi
Junwen Pan
ViT
MedIm
10
4
0
14 Oct 2021
Object DGCNN: 3D Object Detection using Dynamic Graphs
Object DGCNN: 3D Object Detection using Dynamic Graphs
Yue Wang
Justin Solomon
3DPC
143
103
0
13 Oct 2021
ViDT: An Efficient and Effective Fully Transformer-based Object Detector
ViDT: An Efficient and Effective Fully Transformer-based Object Detector
Hwanjun Song
Deqing Sun
Sanghyuk Chun
Varun Jampani
Dongyoon Han
Byeongho Heo
Wonjae Kim
Ming-Hsuan Yang
78
75
0
08 Oct 2021
ATISS: Autoregressive Transformers for Indoor Scene Synthesis
ATISS: Autoregressive Transformers for Indoor Scene Synthesis
Despoina Paschalidou
Amlan Kar
Maria Shugrina
Karsten Kreis
Andreas Geiger
Sanja Fidler
3DV
ViT
27
147
0
07 Oct 2021
Sound Event Detection Transformer: An Event-based End-to-End Model for
  Sound Event Detection
Sound Event Detection Transformer: An Event-based End-to-End Model for Sound Event Detection
Zhi-qin Ye
Xiangdong Wang
Hong Liu
Yueliang Qian
Ruijie Tao
Long Yan
Kazushige Ouchi
ViT
19
15
0
05 Oct 2021
Bringing Generalization to Deep Multi-View Pedestrian Detection
Bringing Generalization to Deep Multi-View Pedestrian Detection
Jeet K. Vora
Swetanjal Dutta
Kanishk Jain
Shyamgopal Karthik
Vineet Gandhi
8
5
0
24 Sep 2021
End-to-End Dense Video Grounding via Parallel Regression
End-to-End Dense Video Grounding via Parallel Regression
Fengyuan Shi
Weilin Huang
Limin Wang
30
10
0
23 Sep 2021
PnP-DETR: Towards Efficient Visual Analysis with Transformers
PnP-DETR: Towards Efficient Visual Analysis with Transformers
Tao Wang
Li Yuan
Yunpeng Chen
Jiashi Feng
Shuicheng Yan
ViT
11
81
0
15 Sep 2021
Encoder-decoder with Multi-level Attention for 3D Human Shape and Pose
  Estimation
Encoder-decoder with Multi-level Attention for 3D Human Shape and Pose Estimation
Ziniu Wan
Zhengjia Li
Maoqing Tian
Jianbo Liu
Shuai Yi
Hongsheng Li
3DH
22
80
0
06 Sep 2021
TE-YOLOF: Tiny and efficient YOLOF for blood cell detection
TE-YOLOF: Tiny and efficient YOLOF for blood cell detection
Fanxin Xu
Xiangkui Li
Hang Yang
Yali Wang
Wei Xiang
19
27
0
27 Aug 2021
A Comparison of Deep Saliency Map Generators on Multispectral Data in
  Object Detection
A Comparison of Deep Saliency Map Generators on Multispectral Data in Object Detection
Jens Bayer
David Munch
Michael Arens
3DPC
17
3
0
26 Aug 2021
Deep neural networks approach to microbial colony detection -- a
  comparative analysis
Deep neural networks approach to microbial colony detection -- a comparative analysis
Sylwia Majchrowska
J. Pawlowski
Natalia Czerep
Aleksander Górecki
Jakub Kuciñski
Tomasz Golan
8
5
0
23 Aug 2021
MM-ViT: Multi-Modal Video Transformer for Compressed Video Action
  Recognition
MM-ViT: Multi-Modal Video Transformer for Compressed Video Action Recognition
Jiawei Chen
C. Ho
ViT
24
76
0
20 Aug 2021
End-to-End Dense Video Captioning with Parallel Decoding
End-to-End Dense Video Captioning with Parallel Decoding
Teng Wang
Ruimao Zhang
Zhichao Lu
Feng Zheng
Ran Cheng
Ping Luo
3DV
25
179
0
17 Aug 2021
Understanding the computational demands underlying visual reasoning
Understanding the computational demands underlying visual reasoning
Mohit Vaishnav
Rémi Cadène
A. Alamia
Drew Linsley
Rufin VanRullen
Thomas Serre
GNN
CoGe
27
16
0
08 Aug 2021
Automatic Rail Component Detection Based on AttnConv-Net
Automatic Rail Component Detection Based on AttnConv-Net
Tian-hu Wang
Zijun Zhang
Fangfang Yang
K. Tsui
13
11
0
05 Aug 2021
Armour: Generalizable Compact Self-Attention for Vision Transformers
Armour: Generalizable Compact Self-Attention for Vision Transformers
Lingchuan Meng
ViT
11
3
0
03 Aug 2021
AGAR a microbial colony dataset for deep learning detection
AGAR a microbial colony dataset for deep learning detection
Sylwia Majchrowska
J. Pawlowski
Grzegorz Gula
T. Bonus
Agata Hanas
Adam Loch
A. Pawlak
J. Roszkowiak
Tomasz Golan
Z. Drulis-Kawa
14
24
0
03 Aug 2021
HiFT: Hierarchical Feature Transformer for Aerial Tracking
HiFT: Hierarchical Feature Transformer for Aerial Tracking
Ziang Cao
Changhong Fu
Junjie Ye
Bowen Li
Yiming Li
13
194
0
31 Jul 2021
DPT: Deformable Patch-based Transformer for Visual Recognition
DPT: Deformable Patch-based Transformer for Visual Recognition
Zhiyang Chen
Yousong Zhu
Chaoyang Zhao
Guosheng Hu
Wei Zeng
Jinqiao Wang
Ming Tang
ViT
12
98
0
30 Jul 2021
A Unified Efficient Pyramid Transformer for Semantic Segmentation
A Unified Efficient Pyramid Transformer for Semantic Segmentation
Fangrui Zhu
Yi Zhu
Li Zhang
Chongruo Wu
Yanwei Fu
Mu Li
ViT
11
29
0
29 Jul 2021
PPT Fusion: Pyramid Patch Transformerfor a Case Study in Image Fusion
PPT Fusion: Pyramid Patch Transformerfor a Case Study in Image Fusion
Yu Fu
Tianyang Xu
Xiaojun Wu
J. Kittler
ViT
17
37
0
29 Jul 2021
Image Fusion Transformer
Image Fusion Transformer
VS Vibashan
Jeya Maria Jose Valanarasu
Poojan Oza
Vishal M. Patel
ViT
11
115
0
19 Jul 2021
Video Crowd Localization with Multi-focus Gaussian Neighborhood
  Attention and a Large-Scale Benchmark
Video Crowd Localization with Multi-focus Gaussian Neighborhood Attention and a Large-Scale Benchmark
Haopeng Li
Lingbo Liu
Kunlin Yang
Shinan Liu
Junyuan Gao
Bin Zhao
Rui Zhang
Jun Hou
37
14
0
19 Jul 2021
RAMS-Trans: Recurrent Attention Multi-scale Transformer forFine-grained
  Image Recognition
RAMS-Trans: Recurrent Attention Multi-scale Transformer forFine-grained Image Recognition
Yunqing Hu
Xuan Jin
Yin Zhang
Ha Hong
Jingfeng Zhang
Yuan He
Hui Xue
ViT
11
97
0
17 Jul 2021
Semi-Supervised Object Detection with Adaptive Class-Rebalancing
  Self-Training
Semi-Supervised Object Detection with Adaptive Class-Rebalancing Self-Training
Fangyuan Zhang
Tianxiang Pan
Bin Wang
26
54
0
11 Jul 2021
GLiT: Neural Architecture Search for Global and Local Image Transformer
GLiT: Neural Architecture Search for Global and Local Image Transformer
Boyu Chen
Peixia Li
Chuming Li
Baopu Li
Lei Bai
Chen Lin
Ming-hui Sun
Junjie Yan
Wanli Ouyang
ViT
17
85
0
07 Jul 2021
Focal Self-attention for Local-Global Interactions in Vision
  Transformers
Focal Self-attention for Local-Global Interactions in Vision Transformers
Jianwei Yang
Chunyuan Li
Pengchuan Zhang
Xiyang Dai
Bin Xiao
Lu Yuan
Jianfeng Gao
ViT
17
422
0
01 Jul 2021
CBNet: A Composite Backbone Network Architecture for Object Detection
CBNet: A Composite Backbone Network Architecture for Object Detection
Tingting Liang
Xiao Chu
Yudong Liu
Yongtao Wang
Zhi Tang
Wei Chu
Jingdong Chen
Haibin Ling
ObjD
13
161
0
01 Jul 2021
K-Net: Towards Unified Image Segmentation
K-Net: Towards Unified Image Segmentation
Wenwei Zhang
Jiangmiao Pang
Kai-xiang Chen
Chen Change Loy
ISeg
11
356
0
28 Jun 2021
Post-Training Quantization for Vision Transformer
Post-Training Quantization for Vision Transformer
Zhenhua Liu
Yunhe Wang
Kai Han
Siwei Ma
Wen Gao
ViT
MQ
11
319
0
27 Jun 2021
Probing Inter-modality: Visual Parsing with Self-Attention for
  Vision-Language Pre-training
Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training
Hongwei Xue
Yupan Huang
Bei Liu
Houwen Peng
Jianlong Fu
Houqiang Li
Jiebo Luo
22
88
0
25 Jun 2021
IA-RED$^2$: Interpretability-Aware Redundancy Reduction for Vision
  Transformers
IA-RED2^22: Interpretability-Aware Redundancy Reduction for Vision Transformers
Bowen Pan
Rameswar Panda
Yifan Jiang
Zhangyang Wang
Rogerio Feris
A. Oliva
VLM
ViT
12
153
0
23 Jun 2021
Probabilistic Attention for Interactive Segmentation
Probabilistic Attention for Interactive Segmentation
Prasad Gabbur
Manjot Bilkhu
J. Movellan
16
13
0
23 Jun 2021
P2T: Pyramid Pooling Transformer for Scene Understanding
P2T: Pyramid Pooling Transformer for Scene Understanding
Yu-Huan Wu
Yun-Hai Liu
Xin Zhan
Mingg-Ming Cheng
ViT
13
218
0
22 Jun 2021
Towards Biologically Plausible Convolutional Networks
Towards Biologically Plausible Convolutional Networks
Roman Pogodin
Yash Mehta
Timothy Lillicrap
P. Latham
11
22
0
22 Jun 2021
Previous
123...10119
Next