Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2202.06709
Cited By
v1
v2
v3
v4 (latest)
How Do Vision Transformers Work?
International Conference on Learning Representations (ICLR), 2022
14 February 2022
Namuk Park
Songkuk Kim
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Github (815★)
Papers citing
"How Do Vision Transformers Work?"
50 / 258 papers shown
ReFIR: Grounding Large Restoration Models with Retrieval Augmentation
Neural Information Processing Systems (NeurIPS), 2024
Hang Guo
Tao Dai
Zhihao Ouyang
Taolin Zhang
Yaohua Zha
Bin Chen
Shu-Tao Xia
DiffM
219
10
0
08 Oct 2024
Spiking Transformer with Spatial-Temporal Attention
Computer Vision and Pattern Recognition (CVPR), 2024
Donghyun Lee
Yuhang Li
Youngeun Kim
Shiting Xiao
Priyadarshini Panda
414
14
0
29 Sep 2024
The Overfocusing Bias of Convolutional Neural Networks: A Saliency-Guided Regularization Approach
David Bertoin
Eduardo Hugo Sanchez
Mehdi Zouitine
Emmanuel Rachelson
241
1
0
25 Sep 2024
DAE-Fuse: An Adaptive Discriminative Autoencoder for Multi-Modality Image Fusion
Yuchen Guo
Ruoxiang Xu
Rongcheng Li
Weifeng Su
474
1
0
16 Sep 2024
Investigation of Hierarchical Spectral Vision Transformer Architecture for Classification of Hyperspectral Imagery
IEEE Transactions on Geoscience and Remote Sensing (TGRS), 2024
Wei Liu
Saurabh Prasad
Melba M. Crawford
193
9
0
14 Sep 2024
STAA: Spatio-Temporal Alignment Attention for Short-Term Precipitation Forecasting
IEEE Geoscience and Remote Sensing Letters (GRSL), 2024
Min Chen
Hao Yang
Shaohan Li
Xiaolin Qin
139
1
0
06 Sep 2024
Do Sharpness-based Optimizers Improve Generalization in Medical Image Analysis?
IEEE Access (IEEE Access), 2024
Mohamed Hassan
Aleksandar Vakanski
Min Xian
AAML
MedIm
387
3
0
07 Aug 2024
Exploring the Adversarial Robustness of CLIP for AI-generated Image Detection
International Workshop on Information Forensics and Security (WIFS), 2024
Vincenzo De Rosa
Fabrizio Guillaro
Giovanni Poggi
D. Cozzolino
L. Verdoliva
AAML
281
14
0
28 Jul 2024
SegPoint: Segment Any Point Cloud via Large Language Model
Shuting He
Henghui Ding
Xudong Jiang
Bihan Wen
3DV
MLLM
3DPC
246
35
0
18 Jul 2024
Hierarchical Separable Video Transformer for Snapshot Compressive Imaging
Ping Wang
Yulun Zhang
Lishun Wang
Xin Yuan
ViT
415
4
0
16 Jul 2024
Asynchronous Feedback Network for Perceptual Point Cloud Quality Assessment
Yujie Zhang
Qi Yang
Ziyu Shan
Yiling Xu
3DPC
258
4
0
13 Jul 2024
Revealing the Dark Secrets of Extremely Large Kernel ConvNets on Robustness
Honghao Chen
Yurong Zhang
Xiaokun Feng
Xiangxiang Chu
Kaiqi Huang
AAML
298
10
0
12 Jul 2024
Wavelet Convolutions for Large Receptive Fields
Shahaf E. Finder
Roy Amoyal
Eran Treister
Oren Freifeld
ViT
MDE
488
313
0
08 Jul 2024
Learning Dual Transformers for All-In-One Image Restoration from a Frequency Perspective
Zenglin Shi
Tong Su
Pei Liu
Yunpeng Wu
Le Zhang
Meng Wang
Meng Wang
ViT
192
0
0
30 Jun 2024
Segmentation of Non-Small Cell Lung Carcinomas: Introducing DRU-Net and Multi-Lens Distortion
Soroush Oskouei
Marit Valla
André Pedersen
Erik Smistad
V. G. Dale
...
T. Langø
M. Ramnefjell
L. A. Akslen
Gabriel Kiss
H. Sorger
149
4
0
20 Jun 2024
H-Fac: Memory-Efficient Optimization with Factorized Hamiltonian Descent
International Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Son Nguyen
Lizhang Chen
Bo Liu
Qiang Liu
299
7
0
14 Jun 2024
Hybrid Spatial-spectral Neural Network for Hyperspectral Image Denoising
Hao Liang
Chengjie
Kun Li
Xin Tian
159
3
0
13 Jun 2024
RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks
Zhechao Wang
Peirui Cheng
Pengju Tian
Yuchao Wang
Mingxin Chen
Shujing Duan
Zhirui Wang
Xinming Li
Xian Sun
221
5
0
11 Jun 2024
Adapting Pretrained ViTs with Convolution Injector for Visuo-Motor Control
International Conference on Machine Learning (ICML), 2024
Dongyoon Hwang
ByungKun Lee
Hojoon Lee
Hyunseung Kim
Jaegul Choo
257
0
0
10 Jun 2024
Improving Object Detector Training on Synthetic Data by Starting With a Strong Baseline Methodology
Frank Ruis
Alma M. Liezenga
Friso G. Heslinga
Luca Ballan
Thijs A. Eker
Richard J. M. den Hollander
Martin C. van Leeuwen
Judith Dijk
Wyke Huizinga
207
9
0
30 May 2024
Hyperspectral Image Reconstruction for Predicting Chick Embryo Mortality Towards Advancing Egg and Hatchery Industry
Toukir Ahmed
Md Wadud Ahmed
Ocean Monjur
J. Emmert
Girish Chowdhary
Mohammed Kamruzzaman
163
19
0
22 May 2024
EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic Camera
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2024
Beilei Cui
Mobarakol Islam
Long Bai
An-Chi Wang
Hongliang Ren
MedIm
246
44
0
14 May 2024
CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation
Weiquan Huang
Yifei Shen
Yifan Yang
Mamba
150
7
0
30 Apr 2024
Data-independent Module-aware Pruning for Hierarchical Vision Transformers
Yang He
Qiufeng Wang
ViT
231
8
0
21 Apr 2024
CKGConv: General Graph Convolution with Continuous Kernels
Liheng Ma
Soumyasundar Pal
Yitian Zhang
Jiaming Zhou
Yingxue Zhang
Mark Coates
205
8
0
21 Apr 2024
Partial Large Kernel CNNs for Efficient Super-Resolution
Dongheon Lee
Seokju Yun
Youngmin Ro
SupR
190
6
0
18 Apr 2024
DGMamba: Domain Generalization via Generalized State Space Model
Shaocong Long
Qianyu Zhou
Hefei Ling
Xuequan Lu
Chenhao Ying
Yuan Luo
Lizhuang Ma
Shuicheng Yan
342
18
0
11 Apr 2024
Playing to Vision Foundation Model's Strengths in Stereo Matching
IEEE Transactions on Intelligent Vehicles (TIV), 2024
Chuangwei Liu
Qijun Chen
Rui Fan
256
26
0
09 Apr 2024
Frequency Decomposition-Driven Unsupervised Domain Adaptation for Remote Sensing Image Semantic Segmentation
Xianping Ma
Xiaokang Zhang
Xingchen Ding
Man-On Pun
Siwei Ma
147
2
0
06 Apr 2024
ASAP: Interpretable Analysis and Summarization of AI-generated Image Patterns at Scale
Jinbin Huang
Chong Chen
Aditi Mishra
Bum Chul Kwon
Zhicheng Liu
Chris Bryan
208
7
0
03 Apr 2024
Seeing the Unseen: A Frequency Prompt Guided Transformer for Image Restoration
Shihao Zhou
Jinshan Pan
Jinglei Shi
Duosheng Chen
Lishen Qu
Jufeng Yang
VLM
248
23
0
30 Mar 2024
Look-Around Before You Leap: High-Frequency Injected Transformer for Image Restoration
Shihao Zhou
Duosheng Chen
Jinshan Pan
Jufeng Yang
290
2
0
30 Mar 2024
Dual-modal Prior Semantic Guided Infrared and Visible Image Fusion for Intelligent Transportation System
Jing Li
Lu Bai
Bi-Hong Yang
Chang Li
Lingfei Ma
Lixin Cui
Edwin R. Hancock
285
2
0
24 Mar 2024
Accelerating ViT Inference on FPGA through Static and Dynamic Pruning
Dhruv Parikh
Shouyi Li
Bingyi Zhang
Rajgopal Kannan
Carl E. Busart
Viktor Prasanna
258
7
0
21 Mar 2024
Spiking Wavelet Transformer
Yuetong Fang
Ziqing Wang
Lingfeng Zhang
Jiahang Cao
Honglei Chen
Renjing Xu
342
16
0
17 Mar 2024
Adaptive Semantic-Enhanced Denoising Diffusion Probabilistic Model for Remote Sensing Image Super-Resolution
Jialu Sui
Xianping Ma
Xiaokang Zhang
Man-On Pun
DiffM
184
5
0
17 Mar 2024
Frequency-Adaptive Dilated Convolution for Semantic Segmentation
Computer Vision and Pattern Recognition (CVPR), 2024
Linwei Chen
Lin Gu
Ying Fu
751
81
0
08 Mar 2024
DuDoUniNeXt: Dual-domain unified hybrid model for single and multi-contrast undersampled MRI reconstruction
Ziqi Gao
Yue Zhang
Xinwen Liu
Kaiyan Li
S. Kevin Zhou
230
1
0
08 Mar 2024
Interactive Multi-Head Self-Attention with Linear Complexity
Hankyul Kang
Ming-Hsuan Yang
Jongbin Ryu
215
3
0
27 Feb 2024
SDR-Former: A Siamese Dual-Resolution Transformer for Liver Lesion Classification Using 3D Multi-Phase Imaging
Meng Lou
Hanning Ying
Xiaoqing Liu
Hong-Yu Zhou
Yuqing Zhang
Yizhou Yu
MedIm
281
21
0
27 Feb 2024
Interpretable Short-Term Load Forecasting via Multi-Scale Temporal Decomposition
Yuqi Jiang
Yan Li
Yize Chen
AI4TS
332
10
0
18 Feb 2024
Architecture Analysis and Benchmarking of 3D U-shaped Deep Learning Models for Thoracic Anatomical Segmentation
IEEE Access (IEEE Access), 2024
Arash Harirpoush
Amir Rasoulian
Marta Kersten-Oertel
Yiming Xiao
3DV
186
2
0
05 Feb 2024
CoBra: Complementary Branch Fusing Class and Semantic Knowledge for Robust Weakly Supervised Semantic Segmentation
Pattern Recognition (Pattern Recogn.), 2024
Woojung Han
Seil Kang
Kyobin Choo
Seong Jae Hwang
631
2
0
05 Feb 2024
Precise Knowledge Transfer via Flow Matching
Shitong Shao
Zhiqiang Shen
Linrui Gong
Huanran Chen
Xu Dai
277
2
0
03 Feb 2024
Convolution Meets LoRA: Parameter Efficient Finetuning for Segment Anything Model
Zihan Zhong
Zhiqiang Tang
Tong He
Haoyang Fang
Chun Yuan
281
80
0
31 Jan 2024
SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design
Computer Vision and Pattern Recognition (CVPR), 2024
Seokju Yun
Youngmin Ro
ViT
399
91
0
29 Jan 2024
MsSVT++: Mixed-scale Sparse Voxel Transformer with Center Voting for 3D Object Detection
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Jianan Li
Shaocong Dong
Lihe Ding
Tingfa Xu
3DPC
273
12
0
22 Jan 2024
Harmonized Spatial and Spectral Learning for Robust and Generalized Medical Image Segmentation
Vandan Gorade
Sparsh Mittal
Debesh Jha
Rekha Singhal
Ulas Bagci
210
3
0
18 Jan 2024
Efficient generative adversarial networks using linear additive-attention Transformers
Emilio Morales-Juarez
Gibran Fuentes Pineda
488
4
0
17 Jan 2024
Learning Generalizable Models via Disentangling Spurious and Enhancing Potential Correlations
IEEE Transactions on Image Processing (TIP), 2024
Na Wang
Lei Qi
Jintao Guo
Yinghuan Shi
Yang Gao
OOD
244
7
0
11 Jan 2024
Previous
1
2
3
4
5
6
Next