Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2108.08810
Cited By
Do Vision Transformers See Like Convolutional Neural Networks?
19 August 2021
M. Raghu
Thomas Unterthiner
Simon Kornblith
Chiyuan Zhang
Alexey Dosovitskiy
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Do Vision Transformers See Like Convolutional Neural Networks?"
50 / 440 papers shown
Title
UGformer for Robust Left Atrium and Scar Segmentation Across Scanners
Tianyi Liu
Size Hou
Jiayu Zhu
Zilong Zhao
Haochuan Jiang
MedIm
14
2
0
11 Oct 2022
Are All Vision Models Created Equal? A Study of the Open-Loop to Closed-Loop Causality Gap
Mathias Lechner
Ramin Hasani
Alexander Amini
Tsun-Hsuan Wang
T. Henzinger
Daniela Rus
CML
OOD
21
7
0
09 Oct 2022
Strong Gravitational Lensing Parameter Estimation with Vision Transformer
Kuan-Wei Huang
G. Chen
Po-Wen Chang
Sheng-Chieh Lin
C. Hsu
Vishal Thengane
J. Lin
24
7
0
09 Oct 2022
The Lie Derivative for Measuring Learned Equivariance
Nate Gruver
Marc Finzi
Micah Goldblum
A. Wilson
16
34
0
06 Oct 2022
Towards Flexible Inductive Bias via Progressive Reparameterization Scheduling
Yunsung Lee
Gyuseong Lee
Kwang-seok Ryoo
Hyojun Go
Jihye Park
Seung Wook Kim
24
5
0
04 Oct 2022
Dual-former: Hybrid Self-attention Transformer for Efficient Image Restoration
Sixiang Chen
Tian-Chun Ye
Yun-Peng Liu
Erkang Chen
ViT
26
15
0
03 Oct 2022
Enhancing Fine-Grained 3D Object Recognition using Hybrid Multi-Modal Vision Transformer-CNN Models
Songsong Xiong
Georgios Tziafas
H. Kasaei
ViT
23
3
0
03 Oct 2022
A Comparison of Transformer, Convolutional, and Recurrent Neural Networks on Phoneme Recognition
Kyuhong Shim
Wonyong Sung
25
2
0
01 Oct 2022
All are Worth Words: A ViT Backbone for Diffusion Models
Fan Bao
Shen Nie
Kaiwen Xue
Yue Cao
Chongxuan Li
Hang Su
Jun Zhu
VLM
26
313
0
25 Sep 2022
Effective Adaptation in Multi-Task Co-Training for Unified Autonomous Driving
Xiwen Liang
Yangxin Wu
Jianhua Han
Hang Xu
Chunjing Xu
Xiaodan Liang
22
31
0
19 Sep 2022
EchoCoTr: Estimation of the Left Ventricular Ejection Fraction from Spatiotemporal Echocardiography
Rand Muhtaseb
Mohammad Yaqub
ViT
19
24
0
09 Sep 2022
Transformer-CNN Cohort: Semi-supervised Semantic Segmentation by the Best of Both Students
Xueye Zheng
Yuan Luo
Hao Wang
Chong Fu
Lin Wang
ViT
36
17
0
06 Sep 2022
MAFormer: A Transformer Network with Multi-scale Attention Fusion for Visual Recognition
Y. Wang
H. Sun
Xiaodi Wang
Bin Zhang
Chaonan Li
Ying Xin
Baochang Zhang
Errui Ding
Shumin Han
ViT
23
9
0
31 Aug 2022
Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment
Mustafa Shukor
Guillaume Couairon
Matthieu Cord
VLM
CLIP
19
27
0
29 Aug 2022
ProtoPFormer: Concentrating on Prototypical Parts in Vision Transformers for Interpretable Image Recognition
Mengqi Xue
Qihan Huang
Haofei Zhang
Lechao Cheng
Jie Song
Ming-hui Wu
Mingli Song
ViT
25
52
0
22 Aug 2022
Prompt Vision Transformer for Domain Generalization
Zangwei Zheng
Xiangyu Yue
Kai Wang
Yang You
VLM
VPVLM
MDE
30
51
0
18 Aug 2022
Conv-Adapter: Exploring Parameter Efficient Transfer Learning for ConvNets
Hao Chen
R. Tao
Han Zhang
Yidong Wang
Xiang Li
Weirong Ye
Jindong Wang
Guosheng Hu
Marios Savvides
VPVLM
16
52
0
15 Aug 2022
Hierarchical Attention Network for Few-Shot Object Detection via Meta-Contrastive Learning
Dong Huk Park
Jongmin Lee
ObjD
12
11
0
15 Aug 2022
The Weighting Game: Evaluating Quality of Explainability Methods
Lassi Raatikainen
Esa Rahtu
FAtt
XAI
21
4
0
12 Aug 2022
Auto-ViT-Acc: An FPGA-Aware Automatic Acceleration Framework for Vision Transformer with Mixed-Scheme Quantization
Z. Li
Mengshu Sun
Alec Lu
Haoyu Ma
Geng Yuan
...
Yanyu Li
M. Leeser
Zhangyang Wang
Xue Lin
Zhenman Fang
ViT
MQ
14
49
0
10 Aug 2022
Attention Hijacking in Trojan Transformers
Weimin Lyu
Songzhu Zheng
Teng Ma
Haibin Ling
Chao Chen
27
6
0
09 Aug 2022
Global Hierarchical Attention for 3D Point Cloud Analysis
Dan Jia
Alexander Hermans
Bastian Leibe
3DPC
21
0
0
07 Aug 2022
Making the Best of Both Worlds: A Domain-Oriented Transformer for Unsupervised Domain Adaptation
Wen-hui Ma
Jinming Zhang
Shuang Li
Chi Harold Liu
Yulin Wang
Wei Li
13
14
0
02 Aug 2022
Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks
Tilman Raukur
A. Ho
Stephen Casper
Dylan Hadfield-Menell
AAML
AI4CE
18
124
0
27 Jul 2022
Cost Aggregation with 4D Convolutional Swin Transformer for Few-Shot Segmentation
Sunghwan Hong
Seokju Cho
Jisu Nam
Stephen Lin
Seung Wook Kim
ViT
19
122
0
22 Jul 2022
An Efficient Spatio-Temporal Pyramid Transformer for Action Detection
Yuetian Weng
Zizheng Pan
Mingfei Han
Xiaojun Chang
Bohan Zhuang
ViT
19
25
0
21 Jul 2022
Locality Guidance for Improving Vision Transformers on Tiny Datasets
Kehan Li
Runyi Yu
Zhennan Wang
Li-ming Yuan
Guoli Song
Jie Chen
ViT
24
43
0
20 Jul 2022
On the Versatile Uses of Partial Distance Correlation in Deep Learning
Xingjian Zhen
Zihang Meng
Rudrasis Chakraborty
Vikas Singh
OODD
32
27
0
20 Jul 2022
Towards Trustworthy Healthcare AI: Attention-Based Feature Learning for COVID-19 Screening With Chest Radiography
Kai Ma
Pengcheng Xi
K. Habashy
Ashkan Ebadi
Stéphane Tremblay
Alexander Wong
ViT
MedIm
11
1
0
19 Jul 2022
Defect Transformer: An Efficient Hybrid Transformer Architecture for Surface Defect Detection
Junpu Wang
Guili Xu
Fuju Yan
Jinjin Wang
Zhengsheng Wang
ViT
MedIm
21
65
0
17 Jul 2022
2D Self-Organized ONN Model For Handwritten Text Recognition
Hanadi Hassen Mohammed
Junaid Malik
Somaya Al-Madeed
S. Kiranyaz
14
5
0
17 Jul 2022
ESFPNet: efficient deep learning architecture for real-time lesion segmentation in autofluorescence bronchoscopic video
Qi Chang
Danish Ahmad
J.W. Toth
R. Bascom
W. Higgins
MedIm
19
49
0
15 Jul 2022
eX-ViT: A Novel eXplainable Vision Transformer for Weakly Supervised Semantic Segmentation
Lu Yu
Wei Xiang
Juan Fang
Yi-Ping Phoebe Chen
Lianhua Chi
ViT
24
24
0
12 Jul 2022
How many perturbations break this model? Evaluating robustness beyond adversarial accuracy
R. Olivier
Bhiksha Raj
AAML
29
5
0
08 Jul 2022
BYOL-S: Learning Self-supervised Speech Representations by Bootstrapping
Gasser Elbanna
Neil Scheidwasser
M. Kegler
P. Beckmann
Karl El Hajal
Milos Cernak
SSL
29
21
0
24 Jun 2022
Measuring Representational Robustness of Neural Networks Through Shared Invariances
Vedant Nanda
Till Speicher
Camila Kolling
John P. Dickerson
Krishna P. Gummadi
Adrian Weller
9
12
0
23 Jun 2022
LargeKernel3D: Scaling up Kernels in 3D Sparse CNNs
Yukang Chen
Jianhui Liu
X. Zhang
Xiaojuan Qi
Jiaya Jia
46
85
0
21 Jun 2022
Global Context Vision Transformers
Ali Hatamizadeh
Hongxu Yin
Greg Heinrich
Jan Kautz
Pavlo Molchanov
ViT
17
120
0
20 Jun 2022
Understanding Robust Learning through the Lens of Representation Similarities
Christian Cianfarani
A. Bhagoji
Vikash Sehwag
Ben Y. Zhao
Prateek Mittal
Haitao Zheng
OOD
19
16
0
20 Jun 2022
EATFormer: Improving Vision Transformer Inspired by Evolutionary Algorithm
Jiangning Zhang
Xiangtai Li
Yabiao Wang
Chengjie Wang
Yibo Yang
Yong Liu
Dacheng Tao
ViT
30
32
0
19 Jun 2022
BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning
Xiao Xu
Chenfei Wu
Shachar Rosenman
Vasudev Lal
Wanxiang Che
Nan Duan
43
64
0
17 Jun 2022
Peripheral Vision Transformer
Juhong Min
Yucheng Zhao
Chong Luo
Minsu Cho
ViT
MDE
24
30
0
14 Jun 2022
Multimodal Learning with Transformers: A Survey
P. Xu
Xiatian Zhu
David A. Clifton
ViT
41
525
0
13 Jun 2022
SERE: Exploring Feature Self-relation for Self-supervised Transformer
Zhong-Yu Li
Shanghua Gao
Ming-Ming Cheng
ViT
MDE
26
14
0
10 Jun 2022
Spatial Entropy as an Inductive Bias for Vision Transformers
E. Peruzzo
E. Sangineto
Yahui Liu
Marco De Nadai
Wei Bi
Bruno Lepri
N. Sebe
ViT
MDE
28
1
0
09 Jun 2022
DORA: Exploring Outlier Representations in Deep Neural Networks
Kirill Bykov
Mayukh Deb
Dennis Grinwald
Klaus-Robert Muller
Marina M.-C. Höhne
19
12
0
09 Jun 2022
CASS: Cross Architectural Self-Supervision for Medical Image Analysis
Pranav Singh
E. Sizikova
Jacopo Cirrone
OOD
49
8
0
08 Jun 2022
Semi-Supervised Segmentation of Mitochondria from Electron Microscopy Images Using Spatial Continuity
Yunpeng Xiao
Youpeng Zhao
Ge Yang
17
3
0
06 Jun 2022
Entangled Residual Mappings
Mathias Lechner
Ramin Hasani
Z. Babaiee
Radu Grosu
Daniela Rus
T. Henzinger
Sepp Hochreiter
6
4
0
02 Jun 2022
Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives
Jun Li
Junyu Chen
Yucheng Tang
Ce Wang
Bennett A. Landman
S. K. Zhou
ViT
OOD
MedIm
21
20
0
02 Jun 2022
Previous
1
2
3
4
5
6
7
8
9
Next