Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2202.06709
Cited By
v1
v2
v3
v4 (latest)
How Do Vision Transformers Work?
International Conference on Learning Representations (ICLR), 2022
14 February 2022
Namuk Park
Songkuk Kim
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Github (815★)
Papers citing
"How Do Vision Transformers Work?"
50 / 258 papers shown
Setting the Record Straight on Transformer Oversmoothing
G. Dovonon
M. Bronstein
Matt J. Kusner
403
12
0
09 Jan 2024
A Cost-Efficient FPGA Implementation of Tiny Transformer Model using Neural ODE
Ikumi Okubo
Keisuke Sugiura
Hiroki Matsutani
240
2
0
05 Jan 2024
GTA: Guided Transfer of Spatial Attention from Object-Centric Representations
SeokHyun Seo
Jinwoo Hong
Jungwoo Chae
Kyungyul Kim
Sangheum Hwang
181
0
0
05 Jan 2024
PnPNet: Pull-and-Push Networks for Volumetric Segmentation with Boundary Confusion
Xin You
Ming Ding
Minghui Zhang
Hanxiao Zhang
Yi Yu
Jie Yang
Yun Gu
1.1K
5
0
13 Dec 2023
AdaptIR: Parameter Efficient Multi-task Adaptation for Pre-trained Image Restoration Models
Neural Information Processing Systems (NeurIPS), 2023
Hang Guo
Tao Dai
Yuanchao Bai
Bin Chen
Shu-Tao Xia
Zexuan Zhu
181
1
0
12 Dec 2023
PEAN: A Diffusion-Based Prior-Enhanced Attention Network for Scene Text Image Super-Resolution
ACM Multimedia (ACM MM), 2023
Zuoyan Zhao
Hui Xue
Pengfei Fang
Shipeng Zhu
DiffM
252
11
0
29 Nov 2023
Aligning Non-Causal Factors for Transformer-Based Source-Free Domain Adaptation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Sunandini Sanyal
Ashish Ramayee Asokan
Suvaansh Bhambri
YM Pradyumna
Akshay Ravindra Kulkarni
Jogendra Nath Kundu
R. V. Babu
CML
213
7
0
27 Nov 2023
Dynamic Association Learning of Self-Attention and Convolution in Image Restoration
Kui Jiang
Xuemei Jia
Wenxin Huang
Wenbin Wang
Zheng Wang
Junjun Jiang
189
1
0
09 Nov 2023
SS-MAE: Spatial-Spectral Masked Auto-Encoder for Multi-Source Remote Sensing Image Classification
Junyan Lin
Feng Gao
Xiaochen Shi
Junyu Dong
Q. Du
176
79
0
08 Nov 2023
On the Convergence of Encoder-only Shallow Transformers
Neural Information Processing Systems (NeurIPS), 2023
Yongtao Wu
Fanghui Liu
Grigorios G. Chrysos
Volkan Cevher
219
13
0
02 Nov 2023
Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Srijan Das
Tanmay Jain
Dominick Reilly
P. Balaji
Soumyajit Karmakar
Shyam Marjit
Xiang Li
Abhijit Das
Michael S. Ryoo
307
24
0
31 Oct 2023
Analyzing Vision Transformers for Image Classification in Class Embedding Space
Neural Information Processing Systems (NeurIPS), 2023
Martina G. Vilas
Timothy Schaumlöffel
Gemma Roig
ViT
214
34
0
29 Oct 2023
Circuit as Set of Points
Neural Information Processing Systems (NeurIPS), 2023
Jialv Zou
Xinggang Wang
Jiahao Guo
Wenyu Liu
Qian Zhang
Chang Huang
GNN
3DV
3DPC
165
5
0
26 Oct 2023
Frequency-Aware Transformer for Learned Image Compression
International Conference on Learning Representations (ICLR), 2023
Han Li
Shaohui Li
Wenrui Dai
Chenglin Li
Junni Zou
H. Xiong
ViT
389
63
0
25 Oct 2023
Domain Generalization Using Large Pretrained Models with Mixture-of-Adapters
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Gyuseong Lee
Wooseok Jang
Jin Hyeon Kim
Jaewoo Jung
Seungryong Kim
MoE
OOD
223
9
0
17 Oct 2023
A Simple and Robust Framework for Cross-Modality Medical Image Segmentation applied to Vision Transformers
Matteo Bastico
David Ryckelynck
Laurent Corté
Yannick Tillier
Etienne Decencière
MedIm
ViT
195
4
0
09 Oct 2023
AdaFuse: Adaptive Medical Image Fusion Based on Spatial-Frequential Cross Attention
Xianming Gu
Lihui Wang
Zeyu Deng
Ying Cao
Xingyu Huang
Y. Zhu
MedIm
323
5
0
09 Oct 2023
Sub-token ViT Embedding via Stochastic Resonance Transformers
International Conference on Machine Learning (ICML), 2023
Dong Lao
Yangchao Wu
Tian Yu Liu
Alex Wong
Stefano Soatto
VOS
256
7
0
06 Oct 2023
R-divergence for Estimating Model-oriented Distribution Discrepancy
Neural Information Processing Systems (NeurIPS), 2023
Zhilin Zhao
Longbing Cao
379
2
0
02 Oct 2023
CINFormer: Transformer network with multi-stage CNN feature injection for surface defect segmentation
Xiaoheng Jiang
Kaiyi Guo
Yang Lu
Feng Yan
Hao Liu
Jiale Cao
Mingliang Xu
Dacheng Tao
MedIm
ViT
UQCV
162
2
0
22 Sep 2023
FreeU: Free Lunch in Diffusion U-Net
Computer Vision and Pattern Recognition (CVPR), 2023
Chenyang Si
Ziqi Huang
Yuming Jiang
Ziwei Liu
DiffM
348
215
0
20 Sep 2023
Hierarchical Attention and Graph Neural Networks: Toward Drift-Free Pose Estimation
Kathia Melbouci
F. Nashashibi
158
0
0
18 Sep 2023
RingMo-lite: A Remote Sensing Multi-task Lightweight Network with CNN-Transformer Hybrid Framework
Yuelei Wang
Ting Zhang
Liangjin Zhao
Lin Hu
Zhechao Wang
...
Kaiqiang Chen
Xuan Zeng
Zhirui Wang
Hongqi Wang
Xian Sun
280
9
0
16 Sep 2023
Biased Attention: Do Vision Transformers Amplify Gender Bias More than Convolutional Neural Networks?
British Machine Vision Conference (BMVC), 2023
Abhishek Mandal
Susan Leavy
Suzanne Little
ViT
229
8
0
15 Sep 2023
Hydra: Multi-head Low-rank Adaptation for Parameter Efficient Fine-tuning
Neural Networks (Neural Netw.), 2023
Sanghyeon Kim
Hyunmo Yang
Younghyun Kim
Youngjoon Hong
Eunbyung Park
AI4CE
211
31
0
13 Sep 2023
Dynamic Spectrum Mixer for Visual Recognition
Zhiqiang Hu
Tao Yu
215
5
0
13 Sep 2023
MB-TaylorFormer: Multi-branch Efficient Transformer Expanded by Taylor Formula for Image Dehazing
IEEE International Conference on Computer Vision (ICCV), 2023
Yuwei Qiu
Kaihao Zhang
Chenxi Wang
Tong Lu
Hongdong Li
Zhi Jin
ViT
264
183
0
27 Aug 2023
EFormer: Enhanced Transformer towards Semantic-Contour Features of Foreground for Portraits Matting
Computer Vision and Pattern Recognition (CVPR), 2023
Zitao Wang
Qiguang Miao
Peipei Zhao
Yue Xi
ViT
195
5
0
24 Aug 2023
NPF-200: A Multi-Modal Eye Fixation Dataset and Method for Non-Photorealistic Videos
ACM Multimedia (ACM MM), 2023
Ziyuan Yang
Sucheng Ren
Zongwei Wu
Nanxuan Zhao
Junle Wang
Jing Qin
Shengfeng He
192
3
0
23 Aug 2023
SPANet: Frequency-balancing Token Mixer using Spectral Pooling Aggregation Modulation
IEEE International Conference on Computer Vision (ICCV), 2023
Guhnoo Yun
J. Yoo
Kijung Kim
Jeongho Lee
Dong Hwan Kim
MoE
218
29
0
22 Aug 2023
Improving Adversarial Robustness of Masked Autoencoders via Test-time Frequency-domain Prompting
IEEE International Conference on Computer Vision (ICCV), 2023
Qidong Huang
Xiaoyi Dong
DongDong Chen
Yinpeng Chen
Lu Yuan
Gang Hua
Weiming Zhang
Neng H. Yu
AAML
294
11
0
20 Aug 2023
Diverse Cotraining Makes Strong Semi-Supervised Segmentor
IEEE International Conference on Computer Vision (ICCV), 2023
Yijiang Li
Xinjiang Wang
Lihe Yang
Xue Jiang
Wayne Zhang
Ying Gao
202
39
0
18 Aug 2023
Long-Range Grouping Transformer for Multi-View 3D Reconstruction
IEEE International Conference on Computer Vision (ICCV), 2023
Liying Yang
Zhenwei Zhu
Xuxin Lin
Jian Nong
Yanyan Liang
ViT
220
10
0
17 Aug 2023
Revisiting Vision Transformer from the View of Path Ensemble
IEEE International Conference on Computer Vision (ICCV), 2023
Shuning Chang
Pichao Wang
Haowen Luo
Fan Wang
Mike Zheng Shou
ViT
169
7
0
12 Aug 2023
Learning to Generate Training Datasets for Robust Semantic Segmentation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Marwane Hariat
Olivier Laurent
Rémi Kazmierczak
Shihao Zhang
Andrei Bursuc
Angela Yao
Gianni Franchi
UQCV
300
4
0
01 Aug 2023
Improving Pixel-based MIM by Reducing Wasted Modeling Capability
IEEE International Conference on Computer Vision (ICCV), 2023
Yuan Liu
Songyang Zhang
Jiacheng Chen
Zhaohui Yu
Kai-xiang Chen
Dahua Lin
209
41
0
01 Aug 2023
LGViT: Dynamic Early Exiting for Accelerating Vision Transformer
ACM Multimedia (ACM MM), 2023
Guanyu Xu
Jiawei Hao
Li Shen
Han Hu
Yong Luo
Hui Lin
J. Shen
226
30
0
01 Aug 2023
Partitioned Saliency Ranking with Dense Pyramid Transformers
ACM Multimedia (ACM MM), 2023
Chengxiao Sun
Yan Xu
Jialun Pei
Haopeng Fang
He Tang
ViT
163
6
0
01 Aug 2023
Conditional Cross Attention Network for Multi-Space Embedding without Entanglement in Only a SINGLE Network
IEEE International Conference on Computer Vision (ICCV), 2023
Chull Hwan Song
Taebaek Hwang
Jooyoung Yoon
Shunghyun Choi
Y. Gu
207
2
0
25 Jul 2023
On the Effectiveness of Spectral Discriminators for Perceptual Quality Improvement
IEEE International Conference on Computer Vision (ICCV), 2023
Xin Luo
Yunan Zhu
Shunxin Xu
Dong Liu
278
15
0
22 Jul 2023
PINNsFormer: A Transformer-Based Framework For Physics-Informed Neural Networks
International Conference on Learning Representations (ICLR), 2023
Leo Zhao
Xueying Ding
B. Prakash
PINN
AI4CE
251
58
0
21 Jul 2023
Towards Building More Robust Models with Frequency Bias
IEEE International Conference on Computer Vision (ICCV), 2023
Qingwen Bu
Dong Huang
Heming Cui
AAML
253
19
0
19 Jul 2023
Deficiency-Aware Masked Transformer for Video Inpainting
Yongsheng Yu
Hengrui Fan
Libo Zhang
VGen
250
10
0
17 Jul 2023
Complementary Frequency-Varying Awareness Network for Open-Set Fine-Grained Image Recognition
Qiulei Dong
Hong Wang
Qiulei Dong
304
1
0
14 Jul 2023
DiffuseGAE: Controllable and High-fidelity Image Manipulation from Disentangled Representation
ACM Multimedia Asia (MA), 2023
Yi Leng
Qiangjuan Huang
Zhiyuan Wang
Yangyang Liu
Haoyu Zhang
DiffM
185
6
0
12 Jul 2023
Connectional-Style-Guided Contextual Representation Learning for Brain Disease Diagnosis
Gongshu Wang
Ning Jiang
Yunxiao Ma
Tiantian Liu
Duanduan Chen
Jinglong Wu
Guoqi Li
Dong Liang
Tianyi Yan
MedIm
230
2
0
08 Jun 2023
Multi-Architecture Multi-Expert Diffusion Models
AAAI Conference on Artificial Intelligence (AAAI), 2023
Yunsung Lee
Jin-Young Kim
Hyojun Go
Myeongho Jeong
Shinhyeok Oh
Seungtaek Choi
DiffM
355
39
0
08 Jun 2023
Graph Inductive Biases in Transformers without Message Passing
International Conference on Machine Learning (ICML), 2023
Liheng Ma
Chen Lin
Derek Lim
Adriana Romero Soriano
P. Dokania
Mark Coates
Juil Sock
Ser-Nam Lim
AI4CE
250
150
0
27 May 2023
Dual Path Transformer with Partition Attention
Zhengkai Jiang
Liang Liu
Jiangning Zhang
Yabiao Wang
Mingang Chen
Chengjie Wang
ViT
236
2
0
24 May 2023
Semantic Segmentation using Vision Transformers: A survey
Engineering applications of artificial intelligence (Eng. Appl. Artif. Intell.), 2023
Hans Thisanke
Chamli Deshan
K. Chamith
Sachith Seneviratne
Rajith Vidanaarachchi
Damayanthi Herath
ViT
198
215
0
05 May 2023
Previous
1
2
3
4
5
6
Next