Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.05751
Cited By
Image Transformer
15 February 2018
Niki Parmar
Ashish Vaswani
Jakob Uszkoreit
Lukasz Kaiser
Noam M. Shazeer
Alexander Ku
Dustin Tran
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Image Transformer"
50 / 277 papers shown
Title
Moving Towards Centers: Re-ranking with Attention and Memory for Re-identification
Yunhao Zhou
Yi Wang
Lap-Pui Chau
49
10
0
04 May 2021
AGMB-Transformer: Anatomy-Guided Multi-Branch Transformer Network for Automated Evaluation of Root Canal Therapy
Yunxiang Li
G. Zeng
Yifan Zhang
Jun Wang
Qianni Zhang
...
Neng Xia
Ruizi Peng
Kai Tang
Yaqi Wang
Shuai Wang
MedIm
AI4CE
92
28
0
02 May 2021
Inpainting Transformer for Anomaly Detection
Jonathan Pirnay
K. Chai
ViT
104
165
0
28 Apr 2021
ConTNet: Why not use convolution and transformer at the same time?
Haotian Yan
Zhe Li
Weijian Li
Changhu Wang
Ming Wu
Chuang Zhang
ViT
12
76
0
27 Apr 2021
Dual Transformer for Point Cloud Analysis
Xian-Feng Han
Yi-Fei Jin
Hui Cheng
Guoqiang Xiao
ViT
34
73
0
27 Apr 2021
All Tokens Matter: Token Labeling for Training Better Vision Transformers
Zihang Jiang
Qibin Hou
Li-xin Yuan
Daquan Zhou
Yujun Shi
Xiaojie Jin
Anran Wang
Jiashi Feng
ViT
12
203
0
22 Apr 2021
TransVG: End-to-End Visual Grounding with Transformers
Jiajun Deng
Zhengyuan Yang
Tianlang Chen
Wen-gang Zhou
Houqiang Li
ViT
21
329
0
17 Apr 2021
Learning Position and Target Consistency for Memory-based Video Object Segmentation
Liucheng Hu
Peng Zhang
Bang Zhang
Pan Pan
Yinghui Xu
R. L. Jin
VOS
24
111
0
09 Apr 2021
Multiple Object Tracking with Correlation Learning
Qiang Wang
Yun Zheng
Pan Pan
Yinghui Xu
VOT
26
147
0
08 Apr 2021
Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning
Zhicheng Huang
Zhaoyang Zeng
Yupan Huang
Bei Liu
Dongmei Fu
Jianlong Fu
VLM
ViT
34
271
0
07 Apr 2021
Going deeper with Image Transformers
Hugo Touvron
Matthieu Cord
Alexandre Sablayrolles
Gabriel Synnaeve
Hervé Jégou
ViT
23
986
0
31 Mar 2021
ViViT: A Video Vision Transformer
Anurag Arnab
Mostafa Dehghani
G. Heigold
Chen Sun
Mario Lucic
Cordelia Schmid
ViT
30
2,086
0
29 Mar 2021
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding
Pengchuan Zhang
Xiyang Dai
Jianwei Yang
Bin Xiao
Lu Yuan
Lei Zhang
Jianfeng Gao
ViT
23
328
0
29 Mar 2021
High-Fidelity Pluralistic Image Completion with Transformers
Ziyu Wan
Jingbo Zhang
Dongdong Chen
Jing Liao
ViT
23
231
0
25 Mar 2021
Vision Transformers for Dense Prediction
René Ranftl
Alexey Bochkovskiy
V. Koltun
ViT
MDE
36
1,659
0
24 Mar 2021
Scaling Local Self-Attention for Parameter Efficient Visual Backbones
Ashish Vaswani
Prajit Ramachandran
A. Srinivas
Niki Parmar
Blake A. Hechtman
Jonathon Shlens
16
395
0
23 Mar 2021
BossNAS: Exploring Hybrid CNN-transformers with Block-wisely Self-supervised Neural Architecture Search
Changlin Li
Tao Tang
Guangrun Wang
Jiefeng Peng
Bing Wang
Xiaodan Liang
Xiaojun Chang
ViT
37
105
0
23 Mar 2021
Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking
Ning Wang
Wen-gang Zhou
Jie Wang
Houqiang Li
ViT
20
518
0
22 Mar 2021
Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning
Mandela Patrick
Yuki M. Asano
Bernie Huang
Ishan Misra
Florian Metze
Joao Henriques
Andrea Vedaldi
AI4TS
16
33
0
18 Mar 2021
Involution: Inverting the Inherence of Convolution for Visual Recognition
Duo Li
Jie Hu
Changhu Wang
Xiangtai Li
Qi She
Lei Zhu
Tong Zhang
Qifeng Chen
BDL
15
304
0
10 Mar 2021
Reformulating HOI Detection as Adaptive Set Prediction
Mingfei Chen
Yue Liao
Si Liu
Zhiyuan Chen
Fei-Yue Wang
Chao Qian
27
142
0
10 Mar 2021
QPIC: Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information
Masato Tamura
Hiroki Ohashi
Tomoaki Yoshinaga
28
206
0
09 Mar 2021
Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive Models
Sam Bond-Taylor
Adam Leach
Yang Long
Chris G. Willcocks
VLM
TPM
36
478
0
08 Mar 2021
SpecTr: Spectral Transformer for Hyperspectral Pathology Image Segmentation
Boxiang Yun
Yan Wang
Jieneng Chen
Huiyu Wang
Wei Shen
Qingli Li
ViT
MedIm
10
49
0
05 Mar 2021
Perceiver: General Perception with Iterative Attention
Andrew Jaegle
Felix Gimeno
Andrew Brock
Andrew Zisserman
Oriol Vinyals
João Carreira
VLM
ViT
MDE
48
973
0
04 Mar 2021
Generative Adversarial Transformers
Drew A. Hudson
C. L. Zitnick
ViT
23
179
0
01 Mar 2021
Predicting times of waiting on red signals using BERT
Witold Szejgis
Anna Warno
P. Góra
21
1
0
20 Feb 2021
Improved Denoising Diffusion Probabilistic Models
Alex Nichol
Prafulla Dhariwal
DiffM
60
3,511
0
18 Feb 2021
LambdaNetworks: Modeling Long-Range Interactions Without Attention
Irwan Bello
267
179
0
17 Feb 2021
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
280
1,981
0
09 Feb 2021
Colorization Transformer
Manoj Kumar
Dirk Weissenborn
Nal Kalchbrenner
ViT
224
156
0
08 Feb 2021
Investigating Bi-Level Optimization for Learning and Vision from a Unified Perspective: A Survey and Beyond
Risheng Liu
Jiaxin Gao
Jin Zhang
Deyu Meng
Zhouchen Lin
AI4CE
43
221
0
27 Jan 2021
Adversarial Text-to-Image Synthesis: A Review
Stanislav Frolov
Tobias Hinz
Federico Raue
Jörn Hees
Andreas Dengel
EGVM
14
176
0
25 Jan 2021
Maximum Likelihood Training of Score-Based Diffusion Models
Yang Song
Conor Durkan
Iain Murray
Stefano Ermon
DiffM
62
621
0
22 Jan 2021
UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers
Siyi Hu
Fengda Zhu
Xiaojun Chang
Xiaodan Liang
OffRL
23
71
0
20 Jan 2021
Transformers in Vision: A Survey
Salman Khan
Muzammal Naseer
Munawar Hayat
Syed Waqas Zamir
F. Khan
M. Shah
ViT
227
2,428
0
04 Jan 2021
End-to-End Human Pose and Mesh Reconstruction with Transformers
Kevin Qinghong Lin
Lijuan Wang
Zicheng Liu
ViT
34
613
0
17 Dec 2020
AdaBins: Depth Estimation using Adaptive Bins
S. Bhat
Ibraheem Alhashim
Peter Wonka
3DV
MDE
ViT
8
834
0
28 Nov 2020
Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on Images
R. Child
BDL
VLM
31
336
0
20 Nov 2020
ConvTransformer: A Convolutional Transformer Network for Video Frame Synthesis
Zhouyong Liu
S. Luo
Wubin Li
Jingben Lu
Yufan Wu
Shilei Sun
Chunguo Li
Luxi Yang
ViT
17
79
0
20 Nov 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
41
39,217
0
22 Oct 2020
Bayesian Attention Modules
Xinjie Fan
Shujian Zhang
Bo Chen
Mingyuan Zhou
107
59
0
20 Oct 2020
Rethinking Attention with Performers
K. Choromanski
Valerii Likhosherstov
David Dohan
Xingyou Song
Andreea Gane
...
Afroz Mohiuddin
Lukasz Kaiser
David Belanger
Lucy J. Colwell
Adrian Weller
8
1,517
0
30 Sep 2020
DeepRemaster: Temporal Source-Reference Attention Networks for Comprehensive Video Enhancement
S. Iizuka
E. Simo-Serra
19
39
0
18 Sep 2020
Efficient Transformers: A Survey
Yi Tay
Mostafa Dehghani
Dara Bahri
Donald Metzler
VLM
74
1,101
0
14 Sep 2020
Autoregressive Unsupervised Image Segmentation
Yassine Ouali
C´eline Hudelot
Myriam Tami
SSL
24
86
0
16 Jul 2020
Can neural networks acquire a structural bias from raw linguistic data?
Alex Warstadt
Samuel R. Bowman
AI4CE
20
53
0
14 Jul 2020
Direct Feedback Alignment Scales to Modern Deep Learning Tasks and Architectures
Julien Launay
Iacopo Poli
Franccois Boniface
Florent Krzakala
25
62
0
23 Jun 2020
Locally Masked Convolution for Autoregressive Models
Ajay Jain
Pieter Abbeel
Deepak Pathak
DiffM
OffRL
30
31
0
22 Jun 2020
A Survey on Generative Adversarial Networks: Variants, Applications, and Training
Abdul Jabbar
Xi Li
Bourahla Omar
25
266
0
09 Jun 2020
Previous
1
2
3
4
5
6
Next