Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2108.03428
Cited By
PSViT: Better Vision Transformer via Token Pooling and Attention Sharing
7 August 2021
Boyu Chen
Peixia Li
Baopu Li
Chuming Li
Mengwei He
Chen Lin
Ming Sun
Junjie Yan
Wanli Ouyang
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"PSViT: Better Vision Transformer via Token Pooling and Attention Sharing"
21 / 21 papers shown
Image Recognition with Online Lightweight Vision Transformer: A Survey
Zherui Zhang
Rongtao Xu
Jie Zhou
Changwei Wang
Xingtian Pei
...
Jiguang Zhang
Li Guo
Longxiang Gao
Wenyuan Xu
Shibiao Xu
ViT
1.2K
2
0
06 May 2025
MVTN: A Multiscale Video Transformer Network for Hand Gesture Recognition
Mallika Garg
Debashis Ghosh
P. M. Pradhan
ViT
326
2
0
05 Sep 2024
TReX- Reusing Vision Transformer's Attention for Efficient Xbar-based Computing
IEEE Transactions on Emerging Topics in Computing (IEEE TETC), 2024
Abhishek Moitra
Abhiroop Bhattacharjee
Youngeun Kim
Priyadarshini Panda
ViT
235
3
0
22 Aug 2024
Depth-Wise Convolutions in Vision Transformers for Efficient Training on Small Datasets
Tianxiao Zhang
Wenju Xu
Bo Luo
Guanghui Wang
ViT
MDE
557
49
0
28 Jul 2024
DiTFastAttn: Attention Compression for Diffusion Transformer Models
Zhihang Yuan
Pu Lu
Hanling Zhang
Xuefei Ning
Linfeng Zhang
Tianchen Zhao
Shengen Yan
Guohao Dai
Yu Wang
341
90
0
12 Jun 2024
GestFormer: Multiscale Wavelet Pooling Transformer Network for Dynamic Hand Gesture Recognition
Mallika Garg
Debashis Ghosh
P. M. Pradhan
SLR
ViT
359
21
0
18 May 2024
MLP Can Be A Good Transformer Learner
Sihao Lin
Pumeng Lyu
Dongrui Liu
Tao Tang
Xiaodan Liang
Andy Song
Xiaojun Chang
ViT
209
25
0
08 Apr 2024
MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory
Neural Information Processing Systems (NeurIPS), 2023
Yinan Liang
Ziwei Wang
Xiuwei Xu
Yansong Tang
Jie Zhou
Jiwen Lu
332
20
0
25 Oct 2023
Learning Feature Matching via Matchable Keypoint-Assisted Graph Neural Network
IEEE Transactions on Image Processing (IEEE TIP), 2023
Zizhuo Li
Jiayi Ma
348
8
0
04 Jul 2023
Lightweight Monocular Depth Estimation via Token-Sharing Transformer
IEEE International Conference on Robotics and Automation (ICRA), 2023
Dong-Jae Lee
Jae Young Lee
Hyounguk Shon
Eojindl Yi
Yeong-Hun Park
Sung-Jin Cho
Junmo Kim
ViT
MDE
209
6
0
09 Jun 2023
IMP: Iterative Matching and Pose Estimation with Adaptive Pooling
Computer Vision and Pattern Recognition (CVPR), 2023
Fei Xue
Ignas Budvytis
R. Cipolla
375
19
0
28 Apr 2023
Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention
Computer Vision and Pattern Recognition (CVPR), 2023
Mingyu Ding
Songlin Yang
Lijie Fan
Zhenfang Chen
Z. Chen
Ping Luo
J. Tenenbaum
Chuang Gan
ViT
285
20
0
06 Apr 2023
Effective Vision Transformer Training: A Data-Centric Perspective
Benjia Zhou
Pichao Wang
Jun Wan
Yan-Ni Liang
Fan Wang
215
7
0
29 Sep 2022
Transformers Meet Visual Learning Understanding: A Comprehensive Review
Yuting Yang
Licheng Jiao
Xuantong Liu
Fan Liu
Shuyuan Yang
Zhixi Feng
Xu Tang
ViT
MedIm
236
37
0
24 Mar 2022
Attribute Surrogates Learning and Spectral Tokens Pooling in Transformers for Few-shot Learning
Computer Vision and Pattern Recognition (CVPR), 2022
Yang He
Weihan Liang
Dongyang Zhao
Hong-Yu Zhou
Weifeng Ge
Yizhou Yu
Wenqiang Zhang
ViT
291
61
0
17 Mar 2022
Backbone is All Your Need: A Simplified Architecture for Visual Object Tracking
European Conference on Computer Vision (ECCV), 2022
Boyu Chen
Peixia Li
Mengwei He
Leixian Qiao
Qiuhong Shen
Yue Liu
Weihao Gan
Wei Wu
Wanli Ouyang
ViT
VOT
334
295
0
10 Mar 2022
Pale Transformer: A General Vision Transformer Backbone with Pale-Shaped Attention
AAAI Conference on Artificial Intelligence (AAAI), 2021
Sitong Wu
Tianyi Wu
Hao Hao Tan
G. Guo
ViT
274
83
0
28 Dec 2021
SPViT: Enabling Faster Vision Transformers via Soft Token Pruning
European Conference on Computer Vision (ECCV), 2021
Zhenglun Kong
Zhaoyang Han
Xiaolong Ma
Xin Meng
Mengshu Sun
...
Geng Yuan
Bin Ren
Minghai Qin
Hao Tang
Yanzhi Wang
ViT
378
209
0
27 Dec 2021
ELSA: Enhanced Local Self-Attention for Vision Transformer
Jingkai Zhou
Pichao Wang
Fan Wang
Qiong Liu
Hao Li
Rong Jin
ViT
289
44
0
23 Dec 2021
CDTrans: Cross-domain Transformer for Unsupervised Domain Adaptation
Tongkun Xu
Weihua Chen
Pichao Wang
Fan Wang
Hao Li
Rong Jin
ViT
782
289
0
13 Sep 2021
Scaled ReLU Matters for Training Vision Transformers
AAAI Conference on Artificial Intelligence (AAAI), 2021
Pichao Wang
Qingsong Wen
Haowen Luo
Jingkai Zhou
Zhipeng Zhou
Fan Wang
Hao Li
Rong Jin
285
53
0
08 Sep 2021
1
Page 1 of 1