v1v2v3v4 (latest)

How Do Vision Transformers Work?

International Conference on Learning Representations (ICLR), 2022

14 February 2022

Namuk Park

Songkuk Kim

ViT

ArXiv (abs)PDF HTML Github (815★)

Papers citing "How Do Vision Transformers Work?"

50 / 258 papers shown

WaveBound: Dynamic Error Bounds for Stable Time Series ForecastingNeural Information Processing Systems (NeurIPS), 2022

185

25 Oct 2022

Clinically-Inspired Multi-Agent Transformers for Disease Trajectory Forecasting from Multimodal DataIEEE Transactions on Medical Imaging (IEEE TMI), 2022

Matthew B. Blaschko

177

25 Oct 2022

G2NetPL: Generic Game-Theoretic Network for Partial-Label Image Classification

188

20 Oct 2022

Similarity of Neural Architectures using Adversarial Attack TransferabilityEuropean Conference on Computer Vision (ECCV), 2022

538

20 Oct 2022

Scratching Visual Transformer's Back with Uniform AttentionIEEE International Conference on Computer Vision (ICCV), 2022

1.1K

16 Oct 2022

Vision Transformer Visualization: What Neurons Tell and How Neurons Behave?

Van-Anh Nguyen

Trung Le

14 Oct 2022

How to Train Vision Transformer on Small-scale Datasets?British Machine Vision Conference (BMVC), 2022

201

13 Oct 2022

Bridging the Gap Between Vision Transformers and Convolutional Neural Networks on Small DatasetsNeural Information Processing Systems (NeurIPS), 2022

261

12 Oct 2022

Curved Representation Space of Vision TransformersAAAI Conference on Artificial Intelligence (AAAI), 2022

282

11 Oct 2022

Natural Color Fool: Towards Boosting Black-box Unrestricted AttacksNeural Information Processing Systems (NeurIPS), 2022

Lianli Gao

Jingkuan Song

239

05 Oct 2022

Towards Flexible Inductive Bias via Progressive Reparameterization Scheduling

136

04 Oct 2022

A Comparison of Transformer, Convolutional, and Recurrent Neural Networks on Phoneme Recognition

Kyuhong Shim

Wonyong Sung

168

01 Oct 2022

On the Surprising Effectiveness of Transformers in Low-Labeled Video Recognition

258

15 Sep 2022

On the interplay of adversarial robustness and architecture components: patches, convolution and attention

Francesco Croce

Matthias Hein

215

14 Sep 2022

Transformer-CNN Cohort: Semi-supervised Semantic Segmentation by the Best of Both StudentsIEEE International Conference on Robotics and Automation (ICRA), 2022

Lin Wang

259

06 Sep 2022

Transformers in Remote Sensing: A SurveyRemote Sensing (RS), 2022

Abdulaziz Amer Aleissaee

Amandeep Kumar

Rao Muhammad Anwer

Salman Khan

Hisham Cholakkal

Guisong Xia

Fahad Shahbaz Khan

ViT

224

283

02 Sep 2022

Exploring Adversarial Robustness of Vision Transformers in the Spectral PerspectiveIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

144

20 Aug 2022

The 8-Point Algorithm as an Inductive Bias for Relative Pose Prediction by ViTsInternational Conference on 3D Vision (3DV), 2022

239

18 Aug 2022

Hierarchical Attention Network for Few-Shot Object Detection via Meta-Contrastive Learning

Dong Huk Park

Jongmin Lee

ObjD

434

15 Aug 2022

Attention Hijacking in Trojan Transformers

193

09 Aug 2022

End-to-end View Synthesis via NeRF Attention

Zelin Zhao

Jiaya Jia

295

29 Jul 2022

Magic ELF: Image Deraining Meets Association Learning and TransformerACM Multimedia (ACM MM), 2022

Zheng Wang

172

21 Jul 2022

An Efficient Spatio-Temporal Pyramid Transformer for Action DetectionEuropean Conference on Computer Vision (ECCV), 2022

Yuetian Weng

Zizheng Pan

Mingfei Han

Xiaojun Chang

Bohan Zhuang

ViT

175

21 Jul 2022

SplitMixer: Fat Trimmed From MLP-like Models

Ali Borji

Sikun Lin

188

21 Jul 2022

Next-ViT: Next Generation Vision Transformer for Efficient Deployment in Realistic Industrial Scenarios

Rui Wang

Min Zheng

Xin Pan

ViT

229

199

12 Jul 2022

Attention mechanisms for physiological signal deep learning: which attention should we take?International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2022

127

04 Jul 2022

Automatic Sleep Scoring from Large-scale Multi-channel Pediatric EEG

Harlin Lee

Aaqib Saeed

149

30 Jun 2022

Continual Learning with Transformers for Image Classification

185

28 Jun 2022

Rarity Score : A New Metric to Evaluate the Uncommonness of Synthesized ImagesInternational Conference on Learning Representations (ICLR), 2022

234

17 Jun 2022

Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives

Bennett A. Landman

433

144

02 Jun 2022

3D-C2FT: Coarse-to-fine Transformer for Multi-view 3D ReconstructionAsian Conference on Computer Vision (ACCV), 2022

Leslie Ching Ow Tiong

Dick Sigmund

Andrew Beng Jin Teoh

3DV ViT

150

29 May 2022

A Closer Look at Self-Supervised Lightweight Vision TransformersInternational Conference on Machine Learning (ICML), 2022

275

28 May 2022

Architecture-Agnostic Masked Image Modeling -- From ViT back to CNNInternational Conference on Machine Learning (ICML), 2022

Siyuan Li

226

27 May 2022

Fast Vision Transformers with HiLo AttentionNeural Information Processing Systems (NeurIPS), 2022

Zizheng Pan

Jianfei Cai

Bohan Zhuang

444

244

26 May 2022

Inception TransformerNeural Information Processing Systems (NeurIPS), 2022

Weihao Yu

338

256

25 May 2022

Towards Unified Keyframe Propagation Models

124

19 May 2022

Vision Transformer Adapter for Dense PredictionsInternational Conference on Learning Representations (ICLR), 2022

Yu Qiao

894

755

17 May 2022

Continual Hippocampus Segmentation with Transformers

144

17 Apr 2022

ResT V2: Simpler, Faster and StrongerNeural Information Processing Systems (NeurIPS), 2022

Qing-Long Zhang

Yubin Yang

ViT

246

15 Apr 2022

Machine Learning State-of-the-Art with Uncertainties

11 Apr 2022

Improving Vision Transformers by Revisiting High-frequency ComponentsEuropean Conference on Computer Vision (ECCV), 2022

319

118

03 Apr 2022

CRAFT: Cross-Attentional Flow Transformer for Robust Optical FlowComputer Vision and Pattern Recognition (CVPR), 2022

Yong Liu

240

124

31 Mar 2022

FAMLP: A Frequency-Aware MLP-Like Architecture For Domain Generalization

Yang Cao

182

24 Mar 2022

PaCa-ViT: Learning Patch-to-Cluster Attention in Vision TransformersComputer Vision and Pattern Recognition (CVPR), 2022

161

22 Mar 2022

Are Vision Transformers Robust to Spurious Correlations?International Journal of Computer Vision (IJCV), 2022

236

17 Mar 2022

LDP: Learnable Dynamic Precision for Efficient Deep Neural Network Training and Inference

172

15 Mar 2022

Deep Transformers Thirst for Comprehensive-Frequency Data

277

14 Mar 2022

When Do Flat Minima Optimizers Work?Neural Information Processing Systems (NeurIPS), 2022

526

01 Feb 2022

How Expressive are Transformers in Spectral Domain for Graphs?

245

23 Jan 2022

Swin Transformer coupling CNNs Makes Strong Contextual Encoders for VHR Image Road Extraction

Tao Chen

10 Jan 2022