ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.12872
  4. Cited By
End-to-End Object Detection with Transformers

End-to-End Object Detection with Transformers

26 May 2020
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
    ViT
    3DV
    PINN
ArXivPDFHTML

Papers citing "End-to-End Object Detection with Transformers"

50 / 1,501 papers shown
Title
On the Adversarial Robustness of Vision Transformers
On the Adversarial Robustness of Vision Transformers
Rulin Shao
Zhouxing Shi
Jinfeng Yi
Pin-Yu Chen
Cho-Jui Hsieh
ViT
15
137
0
29 Mar 2021
Multi-Scale Vision Longformer: A New Vision Transformer for
  High-Resolution Image Encoding
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding
Pengchuan Zhang
Xiyang Dai
Jianwei Yang
Bin Xiao
Lu Yuan
Lei Zhang
Jianfeng Gao
ViT
21
328
0
29 Mar 2021
TransCenter: Transformers with Dense Representations for Multiple-Object
  Tracking
TransCenter: Transformers with Dense Representations for Multiple-Object Tracking
Yihong Xu
Yutong Ban
Guillaume Delorme
Chuang Gan
Daniela Rus
Xavier Alameda-Pineda
VOT
25
91
0
28 Mar 2021
Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose
  Estimation
Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation
Wenhao Li
Hong Liu
Runwei Ding
Mengyuan Liu
Pichao Wang
Wenming Yang
ViT
10
188
0
26 Mar 2021
COTR: Correspondence Transformer for Matching Across Images
COTR: Correspondence Transformer for Matching Across Images
Wei Jiang
Eduard Trulls
J. Hosang
Andrea Tagliasacchi
K. M. Yi
ViT
42
258
0
25 Mar 2021
High-Fidelity Pluralistic Image Completion with Transformers
High-Fidelity Pluralistic Image Completion with Transformers
Ziyu Wan
Jingbo Zhang
Dongdong Chen
Jing Liao
ViT
11
231
0
25 Mar 2021
Instance-level Image Retrieval using Reranking Transformers
Instance-level Image Retrieval using Reranking Transformers
Fuwen Tan
Jiangbo Yuan
Vicente Ordonez
ViT
8
89
0
22 Mar 2021
End-to-End Trainable Multi-Instance Pose Estimation with Transformers
End-to-End Trainable Multi-Instance Pose Estimation with Transformers
Lucas Stoffl
Maxime Vidal
Alexander Mathis
ViT
18
49
0
22 Mar 2021
Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual
  Tracking
Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking
Ning Wang
Wen-gang Zhou
Jie Wang
Houqiang Li
ViT
11
516
0
22 Mar 2021
ConViT: Improving Vision Transformers with Soft Convolutional Inductive
  Biases
ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases
Stéphane dÁscoli
Hugo Touvron
Matthew L. Leavitt
Ari S. Morcos
Giulio Biroli
Levent Sagun
ViT
18
800
0
19 Mar 2021
Scalable Vision Transformers with Hierarchical Pooling
Scalable Vision Transformers with Hierarchical Pooling
Zizheng Pan
Bohan Zhuang
Jing Liu
Haoyu He
Jianfei Cai
ViT
20
126
0
19 Mar 2021
3D Human Pose Estimation with Spatial and Temporal Transformers
3D Human Pose Estimation with Spatial and Temporal Transformers
Ce Zheng
Sijie Zhu
Matías Mendieta
Taojiannan Yang
C. L. P. Chen
Zhengming Ding
ViT
13
436
0
18 Mar 2021
Hierarchical Attention-based Age Estimation and Bias Estimation
Hierarchical Attention-based Age Estimation and Bias Estimation
Shakediel Hiba
Y. Keller
CVBM
18
10
0
17 Mar 2021
You Only Look One-level Feature
You Only Look One-level Feature
Qiang Chen
Yingming Wang
Tong Yang
X. Zhang
Jian Cheng
Jian-jun Sun
ObjD
24
506
0
17 Mar 2021
QueryDet: Cascaded Sparse Query for Accelerating High-Resolution Small
  Object Detection
QueryDet: Cascaded Sparse Query for Accelerating High-Resolution Small Object Detection
Chenhongyi Yang
Zehao Huang
Naiyan Wang
ObjD
16
224
0
16 Mar 2021
TransFG: A Transformer Architecture for Fine-grained Recognition
TransFG: A Transformer Architecture for Fine-grained Recognition
Ju He
Jieneng Chen
Shuai Liu
Adam Kortylewski
Cheng Yang
Yutong Bai
Changhu Wang
ViT
22
373
0
14 Mar 2021
Probabilistic two-stage detection
Probabilistic two-stage detection
Xingyi Zhou
V. Koltun
Philipp Krahenbuhl
ObjD
25
223
0
12 Mar 2021
Unknown Object Segmentation from Stereo Images
Unknown Object Segmentation from Stereo Images
M. Durner
W. Boerdijk
M. Sundermeyer
W. Friedl
Zoltán-Csaba Márton
Rudolph Triebel
17
34
0
11 Mar 2021
Involution: Inverting the Inherence of Convolution for Visual
  Recognition
Involution: Inverting the Inherence of Convolution for Visual Recognition
Duo Li
Jie Hu
Changhu Wang
Xiangtai Li
Qi She
Lei Zhu
Tong Zhang
Qifeng Chen
BDL
11
303
0
10 Mar 2021
U-Net Transformer: Self and Cross Attention for Medical Image
  Segmentation
U-Net Transformer: Self and Cross Attention for Medical Image Segmentation
Olivier Petit
Nicolas Thome
Clément Rambour
L. Soler
ViT
MedIm
12
236
0
10 Mar 2021
QPIC: Query-Based Pairwise Human-Object Interaction Detection with
  Image-Wide Contextual Information
QPIC: Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information
Masato Tamura
Hiroki Ohashi
Tomoaki Yoshinaga
28
206
0
09 Mar 2021
Causal Attention for Vision-Language Tasks
Causal Attention for Vision-Language Tasks
Xu Yang
Hanwang Zhang
Guojun Qi
Jianfei Cai
CML
23
147
0
05 Mar 2021
Perceiver: General Perception with Iterative Attention
Perceiver: General Perception with Iterative Attention
Andrew Jaegle
Felix Gimeno
Andrew Brock
Andrew Zisserman
Oriol Vinyals
João Carreira
VLM
ViT
MDE
48
972
0
04 Mar 2021
Domain Generalization: A Survey
Domain Generalization: A Survey
Kaiyang Zhou
Ziwei Liu
Yu Qiao
Tao Xiang
Chen Change Loy
OOD
AI4CE
14
979
0
03 Mar 2021
Panoramic Panoptic Segmentation: Towards Complete Surrounding
  Understanding via Unsupervised Contrastive Learning
Panoramic Panoptic Segmentation: Towards Complete Surrounding Understanding via Unsupervised Contrastive Learning
A. Jaus
Kailun Yang
Rainer Stiefelhagen
32
36
0
01 Mar 2021
Transformer in Transformer
Transformer in Transformer
Kai Han
An Xiao
Enhua Wu
Jianyuan Guo
Chunjing Xu
Yunhe Wang
ViT
282
1,518
0
27 Feb 2021
Localization Distillation for Dense Object Detection
Localization Distillation for Dense Object Detection
Zhaohui Zheng
Rongguang Ye
Ping Wang
Dongwei Ren
W. Zuo
Qibin Hou
Ming-Ming Cheng
ObjD
96
114
0
24 Feb 2021
Predicting times of waiting on red signals using BERT
Predicting times of waiting on red signals using BERT
Witold Szejgis
Anna Warno
P. Góra
11
1
0
20 Feb 2021
RMS-Net: Regression and Masking for Soccer Event Spotting
RMS-Net: Regression and Masking for Soccer Event Spotting
Matteo Tomei
Lorenzo Baraldi
Simone Calderara
Simone Bronzin
Rita Cucchiara
27
28
0
15 Feb 2021
Is Space-Time Attention All You Need for Video Understanding?
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
278
1,978
0
09 Feb 2021
PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector
  Representation for 3D Object Detection
PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection
Shaoshuai Shi
Li Jiang
Jiajun Deng
Zhe Wang
Chaoxu Guo
Jianping Shi
Xiaogang Wang
Hongsheng Li
3DPC
135
402
0
31 Jan 2021
Augmenting Proposals by the Detector Itself
Augmenting Proposals by the Detector Itself
Xiaopei Wan
Zhenhua Guo
Chao He
Yujiu Yang
Fangbo Tao
ObjD
26
2
0
28 Jan 2021
Channel Boosting Feature Ensemble for Radar-based Object Detection
Channel Boosting Feature Ensemble for Radar-based Object Detection
Shoaib Azam
Farzeen Munir
M. Jeon
24
7
0
10 Jan 2021
MSED: a multi-modal sleep event detection model for clinical sleep
  analysis
MSED: a multi-modal sleep event detection model for clinical sleep analysis
Alexander Neergaard Zahid
P. Jennum
Emmanuel Mignot
H. Sørensen
25
10
0
07 Jan 2021
TransTrack: Multiple Object Tracking with Transformer
TransTrack: Multiple Object Tracking with Transformer
Pei Sun
Jinkun Cao
Yi-Xin Jiang
Rufeng Zhang
Enze Xie
Zehuan Yuan
Changhu Wang
Ping Luo
ViT
VOT
241
564
0
31 Dec 2020
SceneFormer: Indoor Scene Generation with Transformers
SceneFormer: Indoor Scene Generation with Transformers
Xinpeng Wang
Chandan Yeshwanth
Matthias Nießner
ViT
3DPC
16
145
0
17 Dec 2020
DETR for Crowd Pedestrian Detection
DETR for Crowd Pedestrian Detection
Matthieu Lin
Chuming Li
Xingyuan Bu
Ming-hui Sun
Chen Lin
Junjie Yan
Wanli Ouyang
Zhidong Deng
ViT
10
11
0
12 Dec 2020
MOLTR: Multiple Object Localisation, Tracking, and Reconstruction from
  Monocular RGB Videos
MOLTR: Multiple Object Localisation, Tracking, and Reconstruction from Monocular RGB Videos
Kejie Li
Hamid Rezatofighi
Ian Reid
53
25
0
09 Dec 2020
Detecting 32 Pedestrian Attributes for Autonomous Vehicles
Detecting 32 Pedestrian Attributes for Autonomous Vehicles
Taylor Mordan
Matthieu Cord
P. Pérez
Alexandre Alahi
22
22
0
04 Dec 2020
Single-shot Path Integrated Panoptic Segmentation
Single-shot Path Integrated Panoptic Segmentation
Sukjun Hwang
Seoung Wug Oh
Seon Joo Kim
27
9
0
03 Dec 2020
One Metric to Measure them All: Localisation Recall Precision (LRP) for
  Evaluating Visual Detection Tasks
One Metric to Measure them All: Localisation Recall Precision (LRP) for Evaluating Visual Detection Tasks
Kemal Oksuz
Baris Can Cam
Sinan Kalkan
Emre Akbas
19
32
0
21 Nov 2020
Dense Label Encoding for Boundary Discontinuity Free Rotation Detection
Dense Label Encoding for Boundary Discontinuity Free Rotation Detection
Xue Yang
Liping Hou
Yue Zhou
Wentao Wang
Junchi Yan
27
239
0
19 Nov 2020
Image Representations Learned With Unsupervised Pre-Training Contain
  Human-like Biases
Image Representations Learned With Unsupervised Pre-Training Contain Human-like Biases
Ryan Steed
Aylin Caliskan
SSL
17
156
0
28 Oct 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at
  Scale
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
8
38,811
0
22 Oct 2020
Masked Contrastive Representation Learning for Reinforcement Learning
Masked Contrastive Representation Learning for Reinforcement Learning
Jinhua Zhu
Yingce Xia
Lijun Wu
Jiajun Deng
Wen-gang Zhou
Tao Qin
Houqiang Li
SSL
OffRL
6
54
0
15 Oct 2020
Efficient Transformers: A Survey
Efficient Transformers: A Survey
Yi Tay
Mostafa Dehghani
Dara Bahri
Donald Metzler
VLM
63
1,097
0
14 Sep 2020
Visual Transformers: Token-based Image Representation and Processing for
  Computer Vision
Visual Transformers: Token-based Image Representation and Processing for Computer Vision
Bichen Wu
Chenfeng Xu
Xiaoliang Dai
Alvin Wan
Peizhao Zhang
Zhicheng Yan
M. Tomizuka
Joseph E. Gonzalez
Kurt Keutzer
Peter Vajda
ViT
13
543
0
05 Jun 2020
A Survey on 3D Skeleton-Based Action Recognition Using Learning Method
A Survey on 3D Skeleton-Based Action Recognition Using Learning Method
Bin Ren
Mengyuan Liu
Runwei Ding
Hong Liu
19
120
0
14 Feb 2020
Learn to Predict Sets Using Feed-Forward Neural Networks
Learn to Predict Sets Using Feed-Forward Neural Networks
H. Rezatofighi
Tianyu Zhu
Roman Kaskman
F. Motlagh
Javen Qinfeng Shi
Anton Milan
Daniel Cremers
Laura Leal-Taixé
Ian Reid
SSL
46
15
0
30 Jan 2020
An Exploration of Embodied Visual Exploration
An Exploration of Embodied Visual Exploration
Santhosh Kumar Ramakrishnan
Dinesh Jayaraman
Kristen Grauman
LM&Ro
25
98
0
07 Jan 2020
Previous
123...293031
Next