Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2202.06709
Cited By
How Do Vision Transformers Work?
14 February 2022
Namuk Park
Songkuk Kim
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"How Do Vision Transformers Work?"
50 / 236 papers shown
Title
DGMamba: Domain Generalization via Generalized State Space Model
Shaocong Long
Qianyu Zhou
Xiangtai Li
Xuequan Lu
Chenhao Ying
Yuan Luo
Lizhuang Ma
Shuicheng Yan
47
9
0
11 Apr 2024
Playing to Vision Foundation Model's Strengths in Stereo Matching
Chuangwei Liu
Qijun Chen
Rui Fan
25
12
0
09 Apr 2024
ASAP: Interpretable Analysis and Summarization of AI-generated Image Patterns at Scale
Jinbin Huang
C. L. P. Chen
Aditi Mishra
Bum Chul Kwon
Zhicheng Liu
Chris Bryan
37
4
0
03 Apr 2024
Seeing the Unseen: A Frequency Prompt Guided Transformer for Image Restoration
Shihao Zhou
Jinshan Pan
Jinglei Shi
Duosheng Chen
Lishen Qu
Jufeng Yang
VLM
21
3
0
30 Mar 2024
Look-Around Before You Leap: High-Frequency Injected Transformer for Image Restoration
Shihao Zhou
Duosheng Chen
Jinshan Pan
Jufeng Yang
32
2
0
30 Mar 2024
Dual-modal Prior Semantic Guided Infrared and Visible Image Fusion for Intelligent Transportation System
Jing Li
Lu Bai
Bi-Hong Yang
Chang Li
Lingfei Ma
Lixin Cui
Edwin R. Hancock
30
1
0
24 Mar 2024
Accelerating ViT Inference on FPGA through Static and Dynamic Pruning
Dhruv Parikh
Shouyi Li
Bingyi Zhang
Rajgopal Kannan
Carl E. Busart
Viktor Prasanna
38
1
0
21 Mar 2024
Spiking Wavelet Transformer
Yuetong Fang
Ziqing Wang
Lingfeng Zhang
Jiahang Cao
Honglei Chen
Renjing Xu
54
4
0
17 Mar 2024
Adaptive Semantic-Enhanced Denoising Diffusion Probabilistic Model for Remote Sensing Image Super-Resolution
Jialu Sui
Xianping Ma
Xiaokang Zhang
Man-On Pun
DiffM
21
0
0
17 Mar 2024
Frequency-Adaptive Dilated Convolution for Semantic Segmentation
Linwei Chen
Lin Gu
Ying Fu
16
21
0
08 Mar 2024
DuDoUniNeXt: Dual-domain unified hybrid model for single and multi-contrast undersampled MRI reconstruction
Ziqi Gao
Yue Zhang
Xinwen Liu
Kaiyan Li
S. Kevin Zhou
36
1
0
08 Mar 2024
Interactive Multi-Head Self-Attention with Linear Complexity
Hankyul Kang
Ming-Hsuan Yang
Jongbin Ryu
19
1
0
27 Feb 2024
SDR-Former: A Siamese Dual-Resolution Transformer for Liver Lesion Classification Using 3D Multi-Phase Imaging
Meng Lou
Hanning Ying
Xiaoqing Liu
Hong-Yu Zhou
Yuqing Zhang
Yizhou Yu
MedIm
37
7
0
27 Feb 2024
Interpretable Short-Term Load Forecasting via Multi-Scale Temporal Decomposition
Yuqi Jiang
Yan Li
Yize Chen
AI4TS
14
2
0
18 Feb 2024
Architecture Analysis and Benchmarking of 3D U-shaped Deep Learning Models for Thoracic Anatomical Segmentation
Arash Harirpoush
Amir Rasoulian
Marta Kersten-Oertel
Yiming Xiao
3DV
6
0
0
05 Feb 2024
CoBra: Complementary Branch Fusing Class and Semantic Knowledge for Robust Weakly Supervised Semantic Segmentation
Woojung Han
Seil Kang
Kyobin Choo
Seong Jae Hwang
17
0
0
05 Feb 2024
Precise Knowledge Transfer via Flow Matching
Shitong Shao
Zhiqiang Shen
Linrui Gong
Huanran Chen
Xu Dai
21
2
0
03 Feb 2024
Convolution Meets LoRA: Parameter Efficient Finetuning for Segment Anything Model
Zihan Zhong
Zhiqiang Tang
Tong He
Haoyang Fang
Chun Yuan
33
40
0
31 Jan 2024
SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design
Seokju Yun
Youngmin Ro
ViT
34
29
0
29 Jan 2024
MsSVT++: Mixed-scale Sparse Voxel Transformer with Center Voting for 3D Object Detection
Jianan Li
Shaocong Dong
Lihe Ding
Tingfa Xu
3DPC
19
7
0
22 Jan 2024
Harmonized Spatial and Spectral Learning for Robust and Generalized Medical Image Segmentation
Vandan Gorade
Sparsh Mittal
Debesh Jha
Rekha Singhal
Ulas Bagci
25
3
0
18 Jan 2024
Efficient generative adversarial networks using linear additive-attention Transformers
Emilio Morales-Juarez
Gibran Fuentes Pineda
21
3
0
17 Jan 2024
Learning Generalizable Models via Disentangling Spurious and Enhancing Potential Correlations
Na Wang
Lei Qi
Jintao Guo
Yinghuan Shi
Yang Gao
OOD
22
4
0
11 Jan 2024
Setting the Record Straight on Transformer Oversmoothing
G. Dovonon
M. Bronstein
Matt J. Kusner
20
5
0
09 Jan 2024
A Cost-Efficient FPGA Implementation of Tiny Transformer Model using Neural ODE
Ikumi Okubo
Keisuke Sugiura
Hiroki Matsutani
15
2
0
05 Jan 2024
GTA: Guided Transfer of Spatial Attention from Object-Centric Representations
SeokHyun Seo
Jinwoo Hong
Jungwoo Chae
Kyungyul Kim
Sangheum Hwang
19
0
0
05 Jan 2024
PnPNet: Pull-and-Push Networks for Volumetric Segmentation with Boundary Confusion
Xin You
Ming Ding
Minghui Zhang
Hanxiao Zhang
Yi Yu
Jie-jin Yang
Yun Gu
33
1
0
13 Dec 2023
PEAN: A Diffusion-Based Prior-Enhanced Attention Network for Scene Text Image Super-Resolution
Zuoyan Zhao
Hui Xue
Pengfei Fang
Shipeng Zhu
DiffM
11
4
0
29 Nov 2023
Aligning Non-Causal Factors for Transformer-Based Source-Free Domain Adaptation
Sunandini Sanyal
Ashish Ramayee Asokan
Suvaansh Bhambri
YM Pradyumna
Akshay Ravindra Kulkarni
Jogendra Nath Kundu
R. V. Babu
CML
25
2
0
27 Nov 2023
Dynamic Association Learning of Self-Attention and Convolution in Image Restoration
Kui Jiang
Xuemei Jia
Wenxin Huang
Wenbin Wang
Zheng Wang
Junjun Jiang
20
1
0
09 Nov 2023
SS-MAE: Spatial-Spectral Masked Auto-Encoder for Multi-Source Remote Sensing Image Classification
Junyan Lin
Feng Gao
Xiaochen Shi
Junyu Dong
Q. Du
36
40
0
08 Nov 2023
On the Convergence of Encoder-only Shallow Transformers
Yongtao Wu
Fanghui Liu
Grigorios G. Chrysos
V. Cevher
29
5
0
02 Nov 2023
Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders
Srijan Das
Tanmay Jain
Dominick Reilly
P. Balaji
Soumyajit Karmakar
Shyam Marjit
Xiang Li
Abhijit Das
Michael S. Ryoo
24
16
0
31 Oct 2023
Analyzing Vision Transformers for Image Classification in Class Embedding Space
Martina G. Vilas
Timothy Schaumlöffel
Gemma Roig
ViT
14
23
0
29 Oct 2023
Circuit as Set of Points
Jialv Zou
Xinggang Wang
Jiahao Guo
Wenyu Liu
Qian Zhang
Chang Huang
GNN
3DV
3DPC
15
0
0
26 Oct 2023
Frequency-Aware Transformer for Learned Image Compression
Han Li
Shaohui Li
Wenrui Dai
Chenglin Li
Junni Zou
H. Xiong
ViT
20
26
0
25 Oct 2023
Domain Generalization Using Large Pretrained Models with Mixture-of-Adapters
Gyuseong Lee
Wooseok Jang
Jin Hyeon Kim
Jaewoo Jung
Seungryong Kim
MoE
OOD
17
2
0
17 Oct 2023
A Simple and Robust Framework for Cross-Modality Medical Image Segmentation applied to Vision Transformers
Matteo Bastico
David Ryckelynck
Laurent Corté
Yannick Tillier
Etienne Decencière
MedIm
ViT
20
2
0
09 Oct 2023
AdaFuse: Adaptive Medical Image Fusion Based on Spatial-Frequential Cross Attention
Xianming Gu
Lihui Wang
Zeyu Deng
Ying Cao
Xingyu Huang
Y. Zhu
MedIm
14
1
0
09 Oct 2023
Sub-token ViT Embedding via Stochastic Resonance Transformers
Dong Lao
Yangchao Wu
Tian Yu Liu
Alex Wong
Stefano Soatto
VOS
25
4
0
06 Oct 2023
R-divergence for Estimating Model-oriented Distribution Discrepancy
Zhilin Zhao
Longbing Cao
55
1
0
02 Oct 2023
CINFormer: Transformer network with multi-stage CNN feature injection for surface defect segmentation
Xiaoheng Jiang
Kaiyi Guo
Yang Lu
Feng Yan
Hao Liu
Jiale Cao
Mingliang Xu
Dacheng Tao
MedIm
ViT
UQCV
10
1
0
22 Sep 2023
FreeU: Free Lunch in Diffusion U-Net
Chenyang Si
Ziqi Huang
Yuming Jiang
Ziwei Liu
DiffM
25
128
0
20 Sep 2023
Hierarchical Attention and Graph Neural Networks: Toward Drift-Free Pose Estimation
Kathia Melbouci
F. Nashashibi
14
0
0
18 Sep 2023
RingMo-lite: A Remote Sensing Multi-task Lightweight Network with CNN-Transformer Hybrid Framework
Yuelei Wang
Ting Zhang
Liangjin Zhao
Lin Hu
Zhechao Wang
...
Kaiqiang Chen
Xuan Zeng
Zhirui Wang
Hongqi Wang
Xian Sun
19
4
0
16 Sep 2023
Biased Attention: Do Vision Transformers Amplify Gender Bias More than Convolutional Neural Networks?
Abhishek Mandal
Susan Leavy
Suzanne Little
ViT
11
5
0
15 Sep 2023
Hydra: Multi-head Low-rank Adaptation for Parameter Efficient Fine-tuning
Sanghyeon Kim
Hyunmo Yang
Younghyun Kim
Youngjoon Hong
Eunbyung Park
AI4CE
10
16
0
13 Sep 2023
Dynamic Spectrum Mixer for Visual Recognition
Zhiqiang Hu
Tao Yu
8
3
0
13 Sep 2023
MB-TaylorFormer: Multi-branch Efficient Transformer Expanded by Taylor Formula for Image Dehazing
Yuwei Qiu
Kaihao Zhang
Chenxi Wang
Wenhan Luo
Hongdong Li
Zhi Jin
ViT
26
82
0
27 Aug 2023
EFormer: Enhanced Transformer towards Semantic-Contour Features of Foreground for Portraits Matting
Zitao Wang
Qiguang Miao
Peipei Zhao
Yue Xi
ViT
22
2
0
24 Aug 2023
Previous
1
2
3
4
5
Next