Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2202.06709
Cited By
How Do Vision Transformers Work?
14 February 2022
Namuk Park
Songkuk Kim
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"How Do Vision Transformers Work?"
50 / 236 papers shown
Title
Towards Quantifying the Hessian Structure of Neural Networks
Zhaorui Dong
Yushun Zhang
Z. Luo
Jianfeng Yao
Ruoyu Sun
24
0
0
05 May 2025
CVVNet: A Cross-Vertical-View Network for Gait Recognition
X. Li
Wei Song
Yingda Huang
Wei Meng
Le Chang
CVBM
18
0
0
03 May 2025
Exploring Synergistic Ensemble Learning: Uniting CNNs, MLP-Mixers, and Vision Transformers to Enhance Image Classification
Mk Bashar
Ocean Monjur
Samia Islam
Mohammad Galib Shams
Niamul Quader
UQCV
29
0
0
12 Apr 2025
Spectral-Adaptive Modulation Networks for Visual Perception
Guhnoo Yun
J. Yoo
Kijung Kim
Jeongho Lee
Paul Hongsuck Seo
Dong Hwan Kim
32
0
0
31 Mar 2025
Filtering with Time-frequency Analysis: An Adaptive and Lightweight Model for Sequential Recommender Systems Based on Discrete Wavelet Transform
Sheng Lu
Mingxi Ge
Jiuyi Zhang
Wanli Zhu
Guanjin Li
Fangming Gu
AI4TS
56
0
0
30 Mar 2025
BlockDance: Reuse Structurally Similar Spatio-Temporal Features to Accelerate Diffusion Transformers
Hui Zhang
Tingwei Gao
Jie Shao
Zuxuan Wu
64
0
0
20 Mar 2025
Learning to Efficiently Adapt Foundation Models for Self-Supervised Endoscopic 3D Scene Reconstruction from Any Cameras
Beilei Cui
Long Bai
Mobarakol Islam
An-Chi Wang
Z. Ma
...
Feng Li
Zhen Chen
Zhongliang Jiang
Nassir Navab
Hongliang Ren
MedIm
60
0
0
20 Mar 2025
Exposure Bias Reduction for Enhancing Diffusion Transformer Feature Caching
Zhen Zou
Hu Yu
Jie Xiao
Feng Zhao
37
0
0
10 Mar 2025
Spatial-Spectral Diffusion Contrastive Representation Network for Hyperspectral Image Classification
Yimin Zhu
Linlin Xu
DiffM
50
1
0
27 Feb 2025
Learning Structure-Supporting Dependencies via Keypoint Interactive Transformer for General Mammal Pose Estimation
Tianyang Xu
Jiyong Rao
Xiaoning Song
Zhenhua Feng
Xiao Wu
ViT
62
1
0
25 Feb 2025
Understanding Why Adam Outperforms SGD: Gradient Heterogeneity in Transformers
Akiyoshi Tomihari
Issei Sato
ODL
59
0
0
31 Jan 2025
Keypoint Aware Masked Image Modelling
Madhava Krishna
Convin.AI
63
0
0
03 Jan 2025
Navigating the Maze of Explainable AI: A Systematic Approach to Evaluating Methods and Metrics
Lukas Klein
Carsten T. Lüth
U. Schlegel
Till J. Bungert
Mennatallah El-Assady
Paul F. Jäger
XAI
ELM
29
1
0
03 Jan 2025
Prompt Categories Cluster for Weakly Supervised Semantic Segmentation
Wangyu Wu
Xianglin Qiu
Siqi Song
Xiaowei Huang
Fei Ma
Jimin Xiao
VLM
62
4
0
18 Dec 2024
Adaptive High-Pass Kernel Prediction for Efficient Video Deblurring
Bo Ji
Angela Yao
68
0
0
02 Dec 2024
Enhancing Parameter-Efficient Fine-Tuning of Vision Transformers through Frequency-Based Adaptation
S. Ly
Hien Nguyen
72
1
0
28 Nov 2024
Freqformer: Frequency-Domain Transformer for 3-D Visualization and Quantification of Human Retinal Circulation
Lingyun Wang
Bingjie Wang
Jay Chhablani
J. Sahel
Shaohua Pi
MedIm
34
1
0
17 Nov 2024
D-Cube: Exploiting Hyper-Features of Diffusion Model for Robust Medical Classification
Minhee Jang
Juheon Son
Thanaporn Viriyasaranon
Junho Kim
Jang-Hwan Choi
MedIm
26
0
0
17 Nov 2024
Where Do Large Learning Rates Lead Us?
Ildus Sadrtdinov
M. Kodryan
Eduard Pokonechny
E. Lobacheva
Dmitry Vetrov
AI4CE
29
0
0
29 Oct 2024
Depth Attention for Robust RGB Tracking
Yu Liu
Arif Mahmood
Muhammad Haris Khan
VOS
MDE
21
0
0
27 Oct 2024
In Search of the Successful Interpolation: On the Role of Sharpness in CLIP Generalization
Alireza Abdollahpoorrostam
14
0
0
21 Oct 2024
TAS: Distilling Arbitrary Teacher and Student via a Hybrid Assistant
Guopeng Li
Qiang Wang
K. Yan
Shouhong Ding
Yuan Gao
Gui-Song Xia
21
0
0
16 Oct 2024
CATCH: Channel-Aware multivariate Time Series Anomaly Detection via Frequency Patching
Xingjian Wu
Xiangfei Qiu
Zhengyu Li
Yihang Wang
Jilin Hu
Chenjuan Guo
Hui Xiong
Bin Yang
AI4TS
59
12
0
16 Oct 2024
What Does It Mean to Be a Transformer? Insights from a Theoretical Hessian Analysis
Weronika Ormaniec
Felix Dangel
Sidak Pal Singh
29
6
0
14 Oct 2024
Neural Architecture Search of Hybrid Models for NPU-CIM Heterogeneous AR/VR Devices
Yiwei Zhao
Ziyun Li
Win-San Khwa
Xiaoyu Sun
Sai Qian Zhang
...
Jorge Gomez
Jae-sun Seo
Phillip B. Gibbons
B. D. Salvo
Chiao Liu
20
1
0
10 Oct 2024
ReFIR: Grounding Large Restoration Models with Retrieval Augmentation
Hang Guo
Tao Dai
Zhihao Ouyang
Taolin Zhang
Yaohua Zha
Bin Chen
Shu-Tao Xia
DiffM
27
5
0
08 Oct 2024
Spiking Transformer with Spatial-Temporal Attention
Donghyun Lee
Yuhang Li
Youngeun Kim
Shiting Xiao
Priyadarshini Panda
16
1
0
29 Sep 2024
The Overfocusing Bias of Convolutional Neural Networks: A Saliency-Guided Regularization Approach
David Bertoin
Eduardo Hugo Sanchez
Mehdi Zouitine
Emmanuel Rachelson
18
0
0
25 Sep 2024
DAE-Fuse: An Adaptive Discriminative Autoencoder for Multi-Modality Image Fusion
Yuchen Guo
Ruoxiang Xu
Rongcheng Li
Zhenghao Wu
Weifeng Su
16
0
0
16 Sep 2024
Investigation of Hierarchical Spectral Vision Transformer Architecture for Classification of Hyperspectral Imagery
Wei Liu
Saurabh Prasad
Melba M. Crawford
28
3
0
14 Sep 2024
STAA: Spatio-Temporal Alignment Attention for Short-Term Precipitation Forecasting
Min Chen
Hao Yang
Shaohan Li
Xiaolin Qin
20
1
0
06 Sep 2024
Do Sharpness-based Optimizers Improve Generalization in Medical Image Analysis?
Mohamed Hassan
Aleksandar Vakanski
Min Xian
AAML
MedIm
36
1
0
07 Aug 2024
Exploring the Adversarial Robustness of CLIP for AI-generated Image Detection
Vincenzo De Rosa
Fabrizio Guillaro
Giovanni Poggi
D. Cozzolino
L. Verdoliva
AAML
52
4
0
28 Jul 2024
SegPoint: Segment Any Point Cloud via Large Language Model
Shuting He
Henghui Ding
Xudong Jiang
Bihan Wen
3DV
MLLM
3DPC
35
17
0
18 Jul 2024
Hierarchical Separable Video Transformer for Snapshot Compressive Imaging
Ping Wang
Yulun Zhang
Lishun Wang
Xin Yuan
ViT
26
1
0
16 Jul 2024
Asynchronous Feedback Network for Perceptual Point Cloud Quality Assessment
Yujie Zhang
Qi Yang
Ziyu Shan
Yiling Xu
3DPC
19
0
0
13 Jul 2024
Revealing the Dark Secrets of Extremely Large Kernel ConvNets on Robustness
Honghao Chen
Yurong Zhang
Xiaokun Feng
Xiangxiang Chu
Kaiqi Huang
AAML
22
5
0
12 Jul 2024
Wavelet Convolutions for Large Receptive Fields
Shahaf E. Finder
Roy Amoyal
Eran Treister
O. Freifeld
ViT
MDE
20
48
0
08 Jul 2024
Segmentation of Non-Small Cell Lung Carcinomas: Introducing DRU-Net and Multi-Lens Distortion
Soroush Oskouei
Marit Valla
André Pedersen
Erik Smistad
V. G. Dale
...
T. Langø
M. Ramnefjell
L. A. Akslen
Gabriel Kiss
H. Sorger
21
0
0
20 Jun 2024
H-Fac: Memory-Efficient Optimization with Factorized Hamiltonian Descent
Son Nguyen
Lizhang Chen
Bo Liu
Qiang Liu
20
3
0
14 Jun 2024
Hybrid Spatial-spectral Neural Network for Hyperspectral Image Denoising
Hao Liang
Chengjie
Kun Li
Xin Tian
21
1
0
13 Jun 2024
RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks
Zhechao Wang
Peirui Cheng
Pengju Tian
Yuchao Wang
Mingxin Chen
Shujing Duan
Zhirui Wang
Xinming Li
Xian Sun
26
2
0
11 Jun 2024
Adapting Pretrained ViTs with Convolution Injector for Visuo-Motor Control
Dongyoon Hwang
ByungKun Lee
Hojoon Lee
Hyunseung Kim
Jaegul Choo
27
0
0
10 Jun 2024
Improving Object Detector Training on Synthetic Data by Starting With a Strong Baseline Methodology
Frank Ruis
Alma M. Liezenga
Friso G. Heslinga
Luca Ballan
Thijs A. Eker
Richard J. M. den Hollander
Martin C. van Leeuwen
Judith Dijk
Wyke Huizinga
23
2
0
30 May 2024
Hyperspectral Image Reconstruction for Predicting Chick Embryo Mortality Towards Advancing Egg and Hatchery Industry
Toukir Ahmed
Md Wadud Ahmed
Ocean Monjur
J. Emmert
Girish Chowdhary
Mohammed Kamruzzaman
19
11
0
22 May 2024
EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic Camera
Beilei Cui
Mobarakol Islam
Long Bai
An-Chi Wang
Hongliang Ren
MedIm
26
14
0
14 May 2024
CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation
Weiquan Huang
Yifei Shen
Yifan Yang
Mamba
33
4
0
30 Apr 2024
Data-independent Module-aware Pruning for Hierarchical Vision Transformers
Yang He
Joey Tianyi Zhou
ViT
27
3
0
21 Apr 2024
CKGConv: General Graph Convolution with Continuous Kernels
Liheng Ma
Soumyasundar Pal
Yitian Zhang
Jiaming Zhou
Yingxue Zhang
Mark J. Coates
37
3
0
21 Apr 2024
Partial Large Kernel CNNs for Efficient Super-Resolution
Dongheon Lee
Seokju Yun
Youngmin Ro
SupR
26
0
0
18 Apr 2024
1
2
3
4
5
Next