ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.06709
  4. Cited By
How Do Vision Transformers Work?

How Do Vision Transformers Work?

14 February 2022
Namuk Park
Songkuk Kim
    ViT
ArXivPDFHTML

Papers citing "How Do Vision Transformers Work?"

50 / 236 papers shown
Title
DGMamba: Domain Generalization via Generalized State Space Model
DGMamba: Domain Generalization via Generalized State Space Model
Shaocong Long
Qianyu Zhou
Xiangtai Li
Xuequan Lu
Chenhao Ying
Yuan Luo
Lizhuang Ma
Shuicheng Yan
47
9
0
11 Apr 2024
Playing to Vision Foundation Model's Strengths in Stereo Matching
Playing to Vision Foundation Model's Strengths in Stereo Matching
Chuangwei Liu
Qijun Chen
Rui Fan
25
12
0
09 Apr 2024
ASAP: Interpretable Analysis and Summarization of AI-generated Image
  Patterns at Scale
ASAP: Interpretable Analysis and Summarization of AI-generated Image Patterns at Scale
Jinbin Huang
C. L. P. Chen
Aditi Mishra
Bum Chul Kwon
Zhicheng Liu
Chris Bryan
37
4
0
03 Apr 2024
Seeing the Unseen: A Frequency Prompt Guided Transformer for Image
  Restoration
Seeing the Unseen: A Frequency Prompt Guided Transformer for Image Restoration
Shihao Zhou
Jinshan Pan
Jinglei Shi
Duosheng Chen
Lishen Qu
Jufeng Yang
VLM
21
3
0
30 Mar 2024
Look-Around Before You Leap: High-Frequency Injected Transformer for
  Image Restoration
Look-Around Before You Leap: High-Frequency Injected Transformer for Image Restoration
Shihao Zhou
Duosheng Chen
Jinshan Pan
Jufeng Yang
32
2
0
30 Mar 2024
Dual-modal Prior Semantic Guided Infrared and Visible Image Fusion for
  Intelligent Transportation System
Dual-modal Prior Semantic Guided Infrared and Visible Image Fusion for Intelligent Transportation System
Jing Li
Lu Bai
Bi-Hong Yang
Chang Li
Lingfei Ma
Lixin Cui
Edwin R. Hancock
30
1
0
24 Mar 2024
Accelerating ViT Inference on FPGA through Static and Dynamic Pruning
Accelerating ViT Inference on FPGA through Static and Dynamic Pruning
Dhruv Parikh
Shouyi Li
Bingyi Zhang
Rajgopal Kannan
Carl E. Busart
Viktor Prasanna
38
1
0
21 Mar 2024
Spiking Wavelet Transformer
Spiking Wavelet Transformer
Yuetong Fang
Ziqing Wang
Lingfeng Zhang
Jiahang Cao
Honglei Chen
Renjing Xu
54
4
0
17 Mar 2024
Adaptive Semantic-Enhanced Denoising Diffusion Probabilistic Model for
  Remote Sensing Image Super-Resolution
Adaptive Semantic-Enhanced Denoising Diffusion Probabilistic Model for Remote Sensing Image Super-Resolution
Jialu Sui
Xianping Ma
Xiaokang Zhang
Man-On Pun
DiffM
21
0
0
17 Mar 2024
Frequency-Adaptive Dilated Convolution for Semantic Segmentation
Frequency-Adaptive Dilated Convolution for Semantic Segmentation
Linwei Chen
Lin Gu
Ying Fu
16
21
0
08 Mar 2024
DuDoUniNeXt: Dual-domain unified hybrid model for single and
  multi-contrast undersampled MRI reconstruction
DuDoUniNeXt: Dual-domain unified hybrid model for single and multi-contrast undersampled MRI reconstruction
Ziqi Gao
Yue Zhang
Xinwen Liu
Kaiyan Li
S. Kevin Zhou
36
1
0
08 Mar 2024
Interactive Multi-Head Self-Attention with Linear Complexity
Interactive Multi-Head Self-Attention with Linear Complexity
Hankyul Kang
Ming-Hsuan Yang
Jongbin Ryu
19
1
0
27 Feb 2024
SDR-Former: A Siamese Dual-Resolution Transformer for Liver Lesion
  Classification Using 3D Multi-Phase Imaging
SDR-Former: A Siamese Dual-Resolution Transformer for Liver Lesion Classification Using 3D Multi-Phase Imaging
Meng Lou
Hanning Ying
Xiaoqing Liu
Hong-Yu Zhou
Yuqing Zhang
Yizhou Yu
MedIm
37
7
0
27 Feb 2024
Interpretable Short-Term Load Forecasting via Multi-Scale Temporal
  Decomposition
Interpretable Short-Term Load Forecasting via Multi-Scale Temporal Decomposition
Yuqi Jiang
Yan Li
Yize Chen
AI4TS
14
2
0
18 Feb 2024
Architecture Analysis and Benchmarking of 3D U-shaped Deep Learning
  Models for Thoracic Anatomical Segmentation
Architecture Analysis and Benchmarking of 3D U-shaped Deep Learning Models for Thoracic Anatomical Segmentation
Arash Harirpoush
Amir Rasoulian
Marta Kersten-Oertel
Yiming Xiao
3DV
6
0
0
05 Feb 2024
CoBra: Complementary Branch Fusing Class and Semantic Knowledge for
  Robust Weakly Supervised Semantic Segmentation
CoBra: Complementary Branch Fusing Class and Semantic Knowledge for Robust Weakly Supervised Semantic Segmentation
Woojung Han
Seil Kang
Kyobin Choo
Seong Jae Hwang
17
0
0
05 Feb 2024
Precise Knowledge Transfer via Flow Matching
Precise Knowledge Transfer via Flow Matching
Shitong Shao
Zhiqiang Shen
Linrui Gong
Huanran Chen
Xu Dai
21
2
0
03 Feb 2024
Convolution Meets LoRA: Parameter Efficient Finetuning for Segment
  Anything Model
Convolution Meets LoRA: Parameter Efficient Finetuning for Segment Anything Model
Zihan Zhong
Zhiqiang Tang
Tong He
Haoyang Fang
Chun Yuan
33
40
0
31 Jan 2024
SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design
SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design
Seokju Yun
Youngmin Ro
ViT
34
29
0
29 Jan 2024
MsSVT++: Mixed-scale Sparse Voxel Transformer with Center Voting for 3D
  Object Detection
MsSVT++: Mixed-scale Sparse Voxel Transformer with Center Voting for 3D Object Detection
Jianan Li
Shaocong Dong
Lihe Ding
Tingfa Xu
3DPC
19
7
0
22 Jan 2024
Harmonized Spatial and Spectral Learning for Robust and Generalized
  Medical Image Segmentation
Harmonized Spatial and Spectral Learning for Robust and Generalized Medical Image Segmentation
Vandan Gorade
Sparsh Mittal
Debesh Jha
Rekha Singhal
Ulas Bagci
25
3
0
18 Jan 2024
Efficient generative adversarial networks using linear
  additive-attention Transformers
Efficient generative adversarial networks using linear additive-attention Transformers
Emilio Morales-Juarez
Gibran Fuentes Pineda
21
3
0
17 Jan 2024
Learning Generalizable Models via Disentangling Spurious and Enhancing
  Potential Correlations
Learning Generalizable Models via Disentangling Spurious and Enhancing Potential Correlations
Na Wang
Lei Qi
Jintao Guo
Yinghuan Shi
Yang Gao
OOD
22
4
0
11 Jan 2024
Setting the Record Straight on Transformer Oversmoothing
Setting the Record Straight on Transformer Oversmoothing
G. Dovonon
M. Bronstein
Matt J. Kusner
20
5
0
09 Jan 2024
A Cost-Efficient FPGA Implementation of Tiny Transformer Model using
  Neural ODE
A Cost-Efficient FPGA Implementation of Tiny Transformer Model using Neural ODE
Ikumi Okubo
Keisuke Sugiura
Hiroki Matsutani
15
2
0
05 Jan 2024
GTA: Guided Transfer of Spatial Attention from Object-Centric
  Representations
GTA: Guided Transfer of Spatial Attention from Object-Centric Representations
SeokHyun Seo
Jinwoo Hong
Jungwoo Chae
Kyungyul Kim
Sangheum Hwang
19
0
0
05 Jan 2024
PnPNet: Pull-and-Push Networks for Volumetric Segmentation with Boundary
  Confusion
PnPNet: Pull-and-Push Networks for Volumetric Segmentation with Boundary Confusion
Xin You
Ming Ding
Minghui Zhang
Hanxiao Zhang
Yi Yu
Jie-jin Yang
Yun Gu
33
1
0
13 Dec 2023
PEAN: A Diffusion-Based Prior-Enhanced Attention Network for Scene Text
  Image Super-Resolution
PEAN: A Diffusion-Based Prior-Enhanced Attention Network for Scene Text Image Super-Resolution
Zuoyan Zhao
Hui Xue
Pengfei Fang
Shipeng Zhu
DiffM
11
4
0
29 Nov 2023
Aligning Non-Causal Factors for Transformer-Based Source-Free Domain
  Adaptation
Aligning Non-Causal Factors for Transformer-Based Source-Free Domain Adaptation
Sunandini Sanyal
Ashish Ramayee Asokan
Suvaansh Bhambri
YM Pradyumna
Akshay Ravindra Kulkarni
Jogendra Nath Kundu
R. V. Babu
CML
25
2
0
27 Nov 2023
Dynamic Association Learning of Self-Attention and Convolution in Image
  Restoration
Dynamic Association Learning of Self-Attention and Convolution in Image Restoration
Kui Jiang
Xuemei Jia
Wenxin Huang
Wenbin Wang
Zheng Wang
Junjun Jiang
20
1
0
09 Nov 2023
SS-MAE: Spatial-Spectral Masked Auto-Encoder for Multi-Source Remote
  Sensing Image Classification
SS-MAE: Spatial-Spectral Masked Auto-Encoder for Multi-Source Remote Sensing Image Classification
Junyan Lin
Feng Gao
Xiaochen Shi
Junyu Dong
Q. Du
36
40
0
08 Nov 2023
On the Convergence of Encoder-only Shallow Transformers
On the Convergence of Encoder-only Shallow Transformers
Yongtao Wu
Fanghui Liu
Grigorios G. Chrysos
V. Cevher
29
5
0
02 Nov 2023
Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked
  Autoencoders
Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders
Srijan Das
Tanmay Jain
Dominick Reilly
P. Balaji
Soumyajit Karmakar
Shyam Marjit
Xiang Li
Abhijit Das
Michael S. Ryoo
24
16
0
31 Oct 2023
Analyzing Vision Transformers for Image Classification in Class
  Embedding Space
Analyzing Vision Transformers for Image Classification in Class Embedding Space
Martina G. Vilas
Timothy Schaumlöffel
Gemma Roig
ViT
14
23
0
29 Oct 2023
Circuit as Set of Points
Circuit as Set of Points
Jialv Zou
Xinggang Wang
Jiahao Guo
Wenyu Liu
Qian Zhang
Chang Huang
GNN
3DV
3DPC
15
0
0
26 Oct 2023
Frequency-Aware Transformer for Learned Image Compression
Frequency-Aware Transformer for Learned Image Compression
Han Li
Shaohui Li
Wenrui Dai
Chenglin Li
Junni Zou
H. Xiong
ViT
20
26
0
25 Oct 2023
Domain Generalization Using Large Pretrained Models with
  Mixture-of-Adapters
Domain Generalization Using Large Pretrained Models with Mixture-of-Adapters
Gyuseong Lee
Wooseok Jang
Jin Hyeon Kim
Jaewoo Jung
Seungryong Kim
MoE
OOD
17
2
0
17 Oct 2023
A Simple and Robust Framework for Cross-Modality Medical Image
  Segmentation applied to Vision Transformers
A Simple and Robust Framework for Cross-Modality Medical Image Segmentation applied to Vision Transformers
Matteo Bastico
David Ryckelynck
Laurent Corté
Yannick Tillier
Etienne Decencière
MedIm
ViT
20
2
0
09 Oct 2023
AdaFuse: Adaptive Medical Image Fusion Based on Spatial-Frequential
  Cross Attention
AdaFuse: Adaptive Medical Image Fusion Based on Spatial-Frequential Cross Attention
Xianming Gu
Lihui Wang
Zeyu Deng
Ying Cao
Xingyu Huang
Y. Zhu
MedIm
14
1
0
09 Oct 2023
Sub-token ViT Embedding via Stochastic Resonance Transformers
Sub-token ViT Embedding via Stochastic Resonance Transformers
Dong Lao
Yangchao Wu
Tian Yu Liu
Alex Wong
Stefano Soatto
VOS
25
4
0
06 Oct 2023
R-divergence for Estimating Model-oriented Distribution Discrepancy
R-divergence for Estimating Model-oriented Distribution Discrepancy
Zhilin Zhao
Longbing Cao
55
1
0
02 Oct 2023
CINFormer: Transformer network with multi-stage CNN feature injection
  for surface defect segmentation
CINFormer: Transformer network with multi-stage CNN feature injection for surface defect segmentation
Xiaoheng Jiang
Kaiyi Guo
Yang Lu
Feng Yan
Hao Liu
Jiale Cao
Mingliang Xu
Dacheng Tao
MedIm
ViT
UQCV
10
1
0
22 Sep 2023
FreeU: Free Lunch in Diffusion U-Net
FreeU: Free Lunch in Diffusion U-Net
Chenyang Si
Ziqi Huang
Yuming Jiang
Ziwei Liu
DiffM
25
128
0
20 Sep 2023
Hierarchical Attention and Graph Neural Networks: Toward Drift-Free Pose
  Estimation
Hierarchical Attention and Graph Neural Networks: Toward Drift-Free Pose Estimation
Kathia Melbouci
F. Nashashibi
14
0
0
18 Sep 2023
RingMo-lite: A Remote Sensing Multi-task Lightweight Network with
  CNN-Transformer Hybrid Framework
RingMo-lite: A Remote Sensing Multi-task Lightweight Network with CNN-Transformer Hybrid Framework
Yuelei Wang
Ting Zhang
Liangjin Zhao
Lin Hu
Zhechao Wang
...
Kaiqiang Chen
Xuan Zeng
Zhirui Wang
Hongqi Wang
Xian Sun
19
4
0
16 Sep 2023
Biased Attention: Do Vision Transformers Amplify Gender Bias More than
  Convolutional Neural Networks?
Biased Attention: Do Vision Transformers Amplify Gender Bias More than Convolutional Neural Networks?
Abhishek Mandal
Susan Leavy
Suzanne Little
ViT
11
5
0
15 Sep 2023
Hydra: Multi-head Low-rank Adaptation for Parameter Efficient
  Fine-tuning
Hydra: Multi-head Low-rank Adaptation for Parameter Efficient Fine-tuning
Sanghyeon Kim
Hyunmo Yang
Younghyun Kim
Youngjoon Hong
Eunbyung Park
AI4CE
10
16
0
13 Sep 2023
Dynamic Spectrum Mixer for Visual Recognition
Dynamic Spectrum Mixer for Visual Recognition
Zhiqiang Hu
Tao Yu
8
3
0
13 Sep 2023
MB-TaylorFormer: Multi-branch Efficient Transformer Expanded by Taylor
  Formula for Image Dehazing
MB-TaylorFormer: Multi-branch Efficient Transformer Expanded by Taylor Formula for Image Dehazing
Yuwei Qiu
Kaihao Zhang
Chenxi Wang
Wenhan Luo
Hongdong Li
Zhi Jin
ViT
26
82
0
27 Aug 2023
EFormer: Enhanced Transformer towards Semantic-Contour Features of
  Foreground for Portraits Matting
EFormer: Enhanced Transformer towards Semantic-Contour Features of Foreground for Portraits Matting
Zitao Wang
Qiguang Miao
Peipei Zhao
Yue Xi
ViT
22
2
0
24 Aug 2023
Previous
12345
Next