ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2108.03428
  4. Cited By
PSViT: Better Vision Transformer via Token Pooling and Attention Sharing

PSViT: Better Vision Transformer via Token Pooling and Attention Sharing

7 August 2021
Boyu Chen
Peixia Li
Baopu Li
Chuming Li
Mengwei He
Chen Lin
Ming Sun
Junjie Yan
Wanli Ouyang
    ViT
ArXiv (abs)PDFHTML

Papers citing "PSViT: Better Vision Transformer via Token Pooling and Attention Sharing"

21 / 21 papers shown
Image Recognition with Online Lightweight Vision Transformer: A Survey
Image Recognition with Online Lightweight Vision Transformer: A Survey
Zherui Zhang
Rongtao Xu
Jie Zhou
Changwei Wang
Xingtian Pei
...
Jiguang Zhang
Li Guo
Longxiang Gao
Wenyuan Xu
Shibiao Xu
ViT
1.2K
2
0
06 May 2025
MVTN: A Multiscale Video Transformer Network for Hand Gesture
  Recognition
MVTN: A Multiscale Video Transformer Network for Hand Gesture Recognition
Mallika Garg
Debashis Ghosh
P. M. Pradhan
ViT
326
2
0
05 Sep 2024
TReX- Reusing Vision Transformer's Attention for Efficient Xbar-based
  Computing
TReX- Reusing Vision Transformer's Attention for Efficient Xbar-based ComputingIEEE Transactions on Emerging Topics in Computing (IEEE TETC), 2024
Abhishek Moitra
Abhiroop Bhattacharjee
Youngeun Kim
Priyadarshini Panda
ViT
235
3
0
22 Aug 2024
Depth-Wise Convolutions in Vision Transformers for Efficient Training on
  Small Datasets
Depth-Wise Convolutions in Vision Transformers for Efficient Training on Small Datasets
Tianxiao Zhang
Wenju Xu
Bo Luo
Guanghui Wang
ViTMDE
557
49
0
28 Jul 2024
DiTFastAttn: Attention Compression for Diffusion Transformer Models
DiTFastAttn: Attention Compression for Diffusion Transformer Models
Zhihang Yuan
Pu Lu
Hanling Zhang
Xuefei Ning
Linfeng Zhang
Tianchen Zhao
Shengen Yan
Guohao Dai
Yu Wang
341
90
0
12 Jun 2024
GestFormer: Multiscale Wavelet Pooling Transformer Network for Dynamic
  Hand Gesture Recognition
GestFormer: Multiscale Wavelet Pooling Transformer Network for Dynamic Hand Gesture Recognition
Mallika Garg
Debashis Ghosh
P. M. Pradhan
SLRViT
359
21
0
18 May 2024
MLP Can Be A Good Transformer Learner
MLP Can Be A Good Transformer Learner
Sihao Lin
Pumeng Lyu
Dongrui Liu
Tao Tang
Xiaodan Liang
Andy Song
Xiaojun Chang
ViT
209
25
0
08 Apr 2024
MCUFormer: Deploying Vision Transformers on Microcontrollers with
  Limited Memory
MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited MemoryNeural Information Processing Systems (NeurIPS), 2023
Yinan Liang
Ziwei Wang
Xiuwei Xu
Yansong Tang
Jie Zhou
Jiwen Lu
332
20
0
25 Oct 2023
Learning Feature Matching via Matchable Keypoint-Assisted Graph Neural
  Network
Learning Feature Matching via Matchable Keypoint-Assisted Graph Neural NetworkIEEE Transactions on Image Processing (IEEE TIP), 2023
Zizhuo Li
Jiayi Ma
348
8
0
04 Jul 2023
Lightweight Monocular Depth Estimation via Token-Sharing Transformer
Lightweight Monocular Depth Estimation via Token-Sharing TransformerIEEE International Conference on Robotics and Automation (ICRA), 2023
Dong-Jae Lee
Jae Young Lee
Hyounguk Shon
Eojindl Yi
Yeong-Hun Park
Sung-Jin Cho
Junmo Kim
ViTMDE
209
6
0
09 Jun 2023
IMP: Iterative Matching and Pose Estimation with Adaptive Pooling
IMP: Iterative Matching and Pose Estimation with Adaptive PoolingComputer Vision and Pattern Recognition (CVPR), 2023
Fei Xue
Ignas Budvytis
R. Cipolla
375
19
0
28 Apr 2023
Visual Dependency Transformers: Dependency Tree Emerges from Reversed
  Attention
Visual Dependency Transformers: Dependency Tree Emerges from Reversed AttentionComputer Vision and Pattern Recognition (CVPR), 2023
Mingyu Ding
Songlin Yang
Lijie Fan
Zhenfang Chen
Z. Chen
Ping Luo
J. Tenenbaum
Chuang Gan
ViT
285
20
0
06 Apr 2023
Effective Vision Transformer Training: A Data-Centric Perspective
Effective Vision Transformer Training: A Data-Centric Perspective
Benjia Zhou
Pichao Wang
Jun Wan
Yan-Ni Liang
Fan Wang
215
7
0
29 Sep 2022
Transformers Meet Visual Learning Understanding: A Comprehensive Review
Transformers Meet Visual Learning Understanding: A Comprehensive Review
Yuting Yang
Licheng Jiao
Xuantong Liu
Fan Liu
Shuyuan Yang
Zhixi Feng
Xu Tang
ViTMedIm
236
37
0
24 Mar 2022
Attribute Surrogates Learning and Spectral Tokens Pooling in
  Transformers for Few-shot Learning
Attribute Surrogates Learning and Spectral Tokens Pooling in Transformers for Few-shot LearningComputer Vision and Pattern Recognition (CVPR), 2022
Yang He
Weihan Liang
Dongyang Zhao
Hong-Yu Zhou
Weifeng Ge
Yizhou Yu
Wenqiang Zhang
ViT
291
61
0
17 Mar 2022
Backbone is All Your Need: A Simplified Architecture for Visual Object
  Tracking
Backbone is All Your Need: A Simplified Architecture for Visual Object TrackingEuropean Conference on Computer Vision (ECCV), 2022
Boyu Chen
Peixia Li
Mengwei He
Leixian Qiao
Qiuhong Shen
Yue Liu
Weihao Gan
Wei Wu
Wanli Ouyang
ViTVOT
334
295
0
10 Mar 2022
Pale Transformer: A General Vision Transformer Backbone with Pale-Shaped
  Attention
Pale Transformer: A General Vision Transformer Backbone with Pale-Shaped AttentionAAAI Conference on Artificial Intelligence (AAAI), 2021
Sitong Wu
Tianyi Wu
Hao Hao Tan
G. Guo
ViT
274
83
0
28 Dec 2021
SPViT: Enabling Faster Vision Transformers via Soft Token Pruning
SPViT: Enabling Faster Vision Transformers via Soft Token PruningEuropean Conference on Computer Vision (ECCV), 2021
Zhenglun Kong
Zhaoyang Han
Xiaolong Ma
Xin Meng
Mengshu Sun
...
Geng Yuan
Bin Ren
Minghai Qin
Hao Tang
Yanzhi Wang
ViT
378
209
0
27 Dec 2021
ELSA: Enhanced Local Self-Attention for Vision Transformer
ELSA: Enhanced Local Self-Attention for Vision Transformer
Jingkai Zhou
Pichao Wang
Fan Wang
Qiong Liu
Hao Li
Rong Jin
ViT
289
44
0
23 Dec 2021
CDTrans: Cross-domain Transformer for Unsupervised Domain Adaptation
CDTrans: Cross-domain Transformer for Unsupervised Domain Adaptation
Tongkun Xu
Weihua Chen
Pichao Wang
Fan Wang
Hao Li
Rong Jin
ViT
782
289
0
13 Sep 2021
Scaled ReLU Matters for Training Vision Transformers
Scaled ReLU Matters for Training Vision TransformersAAAI Conference on Artificial Intelligence (AAAI), 2021
Pichao Wang
Qingsong Wen
Haowen Luo
Jingkai Zhou
Zhipeng Zhou
Fan Wang
Hao Li
Rong Jin
285
53
0
08 Sep 2021
1
Page 1 of 1