ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.15193
  4. Cited By
Shunted Self-Attention via Multi-Scale Token Aggregation

Shunted Self-Attention via Multi-Scale Token Aggregation

30 November 2021
Sucheng Ren
Daquan Zhou
Shengfeng He
Jiashi Feng
Xinchao Wang
    ViT
ArXivPDFHTML

Papers citing "Shunted Self-Attention via Multi-Scale Token Aggregation"

50 / 86 papers shown
Title
Image Recognition with Online Lightweight Vision Transformer: A Survey
Image Recognition with Online Lightweight Vision Transformer: A Survey
Zherui Zhang
Rongtao Xu
Jie Zhou
Changwei Wang
Xingtian Pei
...
Jiguang Zhang
Li Guo
Longxiang Gao
W. Xu
Shibiao Xu
ViT
60
0
0
06 May 2025
Towards Accurate and Interpretable Neuroblastoma Diagnosis via Contrastive Multi-scale Pathological Image Analysis
Towards Accurate and Interpretable Neuroblastoma Diagnosis via Contrastive Multi-scale Pathological Image Analysis
Zhu Zhu
Shuo Jiang
Jingyuan Zheng
Yawen Li
Yifei Chen
Manli Zhao
Weizhong Gu
Feiwei Qin
Jinhu Wang
Gang Yu
MedIm
33
0
0
18 Apr 2025
Multi-Modal Brain Tumor Segmentation via 3D Multi-Scale Self-attention and Cross-attention
Multi-Modal Brain Tumor Segmentation via 3D Multi-Scale Self-attention and Cross-attention
Yonghao Huang
Leiting Chen
Chuan Zhou
ViT
MedIm
26
0
0
12 Apr 2025
Multi-modal and Multi-view Fundus Image Fusion for Retinopathy Diagnosis via Multi-scale Cross-attention and Shifted Window Self-attention
Multi-modal and Multi-view Fundus Image Fusion for Retinopathy Diagnosis via Multi-scale Cross-attention and Shifted Window Self-attention
Yonghao Huang
Leiting Chen
Chuan Zhou
16
0
0
12 Apr 2025
Mixed-granularity Implicit Representation for Continuous Hyperspectral Compressive Reconstruction
Mixed-granularity Implicit Representation for Continuous Hyperspectral Compressive Reconstruction
Jianan Li
Huan Chen
Wangcai Zhao
Rui Chen
Tingfa Xu
59
0
0
17 Mar 2025
RhythmFormer: Extracting Patterned rPPG Signals based on Periodic Sparse Attention
RhythmFormer: Extracting Patterned rPPG Signals based on Periodic Sparse Attention
Bochao Zou
Zizheng Guo
Jiansheng Chen
Junbao Zhuo
Weiran Huang
Huimin Ma
ViT
AI4TS
105
1
0
21 Feb 2025
Cascaded Multi-Scale Attention for Enhanced Multi-Scale Feature
  Extraction and Interaction with Low-Resolution Images
Cascaded Multi-Scale Attention for Enhanced Multi-Scale Feature Extraction and Interaction with Low-Resolution Images
Xiangyong Lu
Masanori Suganuma
Takayuki Okatani
67
0
0
03 Dec 2024
Breaking the Low-Rank Dilemma of Linear Attention
Breaking the Low-Rank Dilemma of Linear Attention
Qihang Fan
Huaibo Huang
Ran He
28
0
0
12 Nov 2024
DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing
  Attention
DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention
Nguyen Huu Bao Long
Chenyu Zhang
Yuzhi Shi
Tsubasa Hirakawa
Takayoshi Yamashita
Tohgoroh Matsui
H. Fujiyoshi
23
2
0
11 Oct 2024
SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision
  Mamba and Transformer Networks
SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks
Meng Lou
Yunxiang Fu
Yizhou Yu
Mamba
40
5
0
15 Sep 2024
PSTNet: Enhanced Polyp Segmentation with Multi-scale Alignment and
  Frequency Domain Integration
PSTNet: Enhanced Polyp Segmentation with Multi-scale Alignment and Frequency Domain Integration
Wenhao Xu
Rongtao Xu
Changwei Wang
Xiuli Li
Shibiao Xu
Li Guo
32
3
0
13 Sep 2024
Brain-Inspired Stepwise Patch Merging for Vision Transformers
Brain-Inspired Stepwise Patch Merging for Vision Transformers
Yonghao Yu
Dongcheng Zhao
Guobin Shen
Yiting Dong
Yi Zeng
40
0
0
11 Sep 2024
Embedding-Free Transformer with Inference Spatial Reduction for
  Efficient Semantic Segmentation
Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation
Hyunwoo Yu
Yubin Cho
Beoungwoo Kang
Seunghun Moon
Kyeongbo Kong
Suk-Ju Kang
28
2
0
24 Jul 2024
MxT: Mamba x Transformer for Image Inpainting
MxT: Mamba x Transformer for Image Inpainting
Shuang Chen
Amir Atapour-Abarghouei
Haozheng Zhang
Hubert P. H. Shum
Mamba
32
2
0
23 Jul 2024
Rethinking Remote Sensing Change Detection With A Mask View
Rethinking Remote Sensing Change Detection With A Mask View
Xiaowen Ma
Zhenkai Wu
Rongrong Lian
Wei Zhang
Siyang Song
27
3
0
21 Jun 2024
Vision Mamba: Cutting-Edge Classification of Alzheimer's Disease with 3D
  MRI Scans
Vision Mamba: Cutting-Edge Classification of Alzheimer's Disease with 3D MRI Scans
Muthukumar K A
Amit Gurung
Priya Ranjan
27
1
0
09 Jun 2024
ARVideo: Autoregressive Pretraining for Self-Supervised Video
  Representation Learning
ARVideo: Autoregressive Pretraining for Self-Supervised Video Representation Learning
Sucheng Ren
Hongru Zhu
Chen Wei
Yijiang Li
Alan L. Yuille
Cihang Xie
AI4TS
VGen
SSL
49
1
0
24 May 2024
Semantic Equitable Clustering: A Simple, Fast and Effective Strategy for
  Vision Transformer
Semantic Equitable Clustering: A Simple, Fast and Effective Strategy for Vision Transformer
Qihang Fan
Huaibo Huang
Mingrui Chen
Ran He
39
3
0
22 May 2024
Vision Transformer with Sparse Scan Prior
Vision Transformer with Sparse Scan Prior
Qihang Fan
Huaibo Huang
Mingrui Chen
Ran He
ViT
36
4
0
22 May 2024
Multi-Scale Representations by Varying Window Attention for Semantic
  Segmentation
Multi-Scale Representations by Varying Window Attention for Semantic Segmentation
Haotian Yan
Ming Wu
Chuang Zhang
21
12
0
25 Apr 2024
HRVDA: High-Resolution Visual Document Assistant
HRVDA: High-Resolution Visual Document Assistant
Chaohu Liu
Kun Yin
Haoyu Cao
Xinghua Jiang
Xin Li
Yinsong Liu
Deqiang Jiang
Xing Sun
Linli Xu
VLM
35
23
0
10 Apr 2024
Unsegment Anything by Simulating Deformation
Unsegment Anything by Simulating Deformation
Jiahao Lu
Xingyi Yang
Xinchao Wang
29
4
0
03 Apr 2024
PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition
PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition
Chenhongyi Yang
Zehui Chen
Miguel Espinosa
Linus Ericsson
Zhenyu Wang
Jiaming Liu
Elliot J. Crowley
Mamba
26
86
0
26 Mar 2024
FViT: A Focal Vision Transformer with Gabor Filter
FViT: A Focal Vision Transformer with Gabor Filter
Yulong Shi
Mingwei Sun
Yongshuai Wang
Rui Wang
47
4
0
17 Feb 2024
MsSVT++: Mixed-scale Sparse Voxel Transformer with Center Voting for 3D
  Object Detection
MsSVT++: Mixed-scale Sparse Voxel Transformer with Center Voting for 3D Object Detection
Jianan Li
Shaocong Dong
Lihe Ding
Tingfa Xu
3DPC
19
7
0
22 Jan 2024
MST: Adaptive Multi-Scale Tokens Guided Interactive Segmentation
MST: Adaptive Multi-Scale Tokens Guided Interactive Segmentation
Long Xu
Shanghong Li
Yongquan Chen
Jun Luo
Shiwu Lai
12
0
0
09 Jan 2024
Factorization Vision Transformer: Modeling Long Range Dependency with
  Local Window Cost
Factorization Vision Transformer: Modeling Long Range Dependency with Local Window Cost
Haolin Qin
Daquan Zhou
Tingfa Xu
Ziyang Bian
Jianan Li
21
9
0
14 Dec 2023
BACTrack: Building Appearance Collection for Aerial Tracking
BACTrack: Building Appearance Collection for Aerial Tracking
Xincong Liu
Tingfa Xu
Ying Wang
Zhinong Yu
Xiaoying Yuan
Haolin Qin
Jianan Li
36
6
0
11 Dec 2023
Advancing Vision Transformers with Group-Mix Attention
Advancing Vision Transformers with Group-Mix Attention
Chongjian Ge
Xiaohan Ding
Zhan Tong
Li Yuan
Jiangliu Wang
Yibing Song
Ping Luo
112
16
0
26 Nov 2023
TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition
TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition
Meng Lou
Hong-Yu Zhou
Sibei Yang
Yizhou Yu
Chuan Wu
Yizhou Yu
ViT
31
35
0
30 Oct 2023
IFAST: Weakly Supervised Interpretable Face Anti-spoofing from
  Single-shot Binocular NIR Images
IFAST: Weakly Supervised Interpretable Face Anti-spoofing from Single-shot Binocular NIR Images
Jiancheng Huang
Donghao Zhou
Shifeng Chen
CVBM
27
2
0
29 Sep 2023
Video Adverse-Weather-Component Suppression Network via Weather
  Messenger and Adversarial Backpropagation
Video Adverse-Weather-Component Suppression Network via Weather Messenger and Adversarial Backpropagation
Yijun Yang
Angelica I. Aviles-Rivero
H. Fu
Ye Liu
Weiming Wang
Lei Zhu
ViT
14
14
0
24 Sep 2023
CINFormer: Transformer network with multi-stage CNN feature injection
  for surface defect segmentation
CINFormer: Transformer network with multi-stage CNN feature injection for surface defect segmentation
Xiaoheng Jiang
Kaiyi Guo
Yang Lu
Feng Yan
Hao Liu
Jiale Cao
Mingliang Xu
Dacheng Tao
MedIm
ViT
UQCV
13
1
0
22 Sep 2023
RMT: Retentive Networks Meet Vision Transformers
RMT: Retentive Networks Meet Vision Transformers
Qihang Fan
Huaibo Huang
Mingrui Chen
Hongmin Liu
Ran He
ViT
30
73
0
20 Sep 2023
Priority-Centric Human Motion Generation in Discrete Latent Space
Priority-Centric Human Motion Generation in Discrete Latent Space
Hanyang Kong
Kehong Gong
Dongze Lian
Michael Bi Mi
Xinchao Wang
DiffM
20
49
0
28 Aug 2023
Computation-efficient Deep Learning for Computer Vision: A Survey
Computation-efficient Deep Learning for Computer Vision: A Survey
Yulin Wang
Yizeng Han
Chaofei Wang
Shiji Song
Qi Tian
Gao Huang
VLM
26
20
0
27 Aug 2023
Boosting Semantic Segmentation from the Perspective of Explicit Class
  Embeddings
Boosting Semantic Segmentation from the Perspective of Explicit Class Embeddings
Yuhe Liu
Chuanjian Liu
Kai Han
Quan Tang
Zengchang Qin
14
5
0
24 Aug 2023
SG-Former: Self-guided Transformer with Evolving Token Reallocation
SG-Former: Self-guided Transformer with Evolving Token Reallocation
Sucheng Ren
Xingyi Yang
Songhua Liu
Xinchao Wang
ViT
25
39
0
23 Aug 2023
Vision Backbone Enhancement via Multi-Stage Cross-Scale Attention
Vision Backbone Enhancement via Multi-Stage Cross-Scale Attention
Liang Shang
Yanli Liu
Zhengyang Lou
Shuxue Quan
N. Adluru
Bochen Guan
W. Sethares
13
1
0
10 Aug 2023
MCPA: Multi-scale Cross Perceptron Attention Network for 2D Medical
  Image Segmentation
MCPA: Multi-scale Cross Perceptron Attention Network for 2D Medical Image Segmentation
Liang Xu
Mingxi Chen
Yiyu Cheng
Pengfei Shao
Shuwei Shen
Peng Yao
Ronald X. Xu
ViT
28
0
0
27 Jul 2023
RetouchingFFHQ: A Large-scale Dataset for Fine-grained Face Retouching
  Detection
RetouchingFFHQ: A Large-scale Dataset for Fine-grained Face Retouching Detection
Qichao Ying
Jiaxin Liu
Sheng Li
Haisheng Xu
Zhenxing Qian
Xinpeng Zhang
CVBM
9
7
0
20 Jul 2023
Scale-Aware Modulation Meet Transformer
Scale-Aware Modulation Meet Transformer
Wei-Shiang Lin
Ziheng Wu
Jiayu Chen
Jun Huang
Lianwen Jin
MoE
ViT
14
65
0
17 Jul 2023
Recent Advances of Local Mechanisms in Computer Vision: A Survey and
  Outlook of Recent Work
Recent Advances of Local Mechanisms in Computer Vision: A Survey and Outlook of Recent Work
Qiangchang Wang
Yilong Yin
21
0
0
02 Jun 2023
Lightweight Vision Transformer with Bidirectional Interaction
Lightweight Vision Transformer with Bidirectional Interaction
Qihang Fan
Huaibo Huang
Xiaoqiang Zhou
Ran He
ViT
27
27
0
01 Jun 2023
PSLT: A Light-weight Vision Transformer with Ladder Self-Attention and
  Progressive Shift
PSLT: A Light-weight Vision Transformer with Ladder Self-Attention and Progressive Shift
Gaojie Wu
Weishi Zheng
Yutong Lu
Q. Tian
ViT
38
13
0
07 Apr 2023
TM2D: Bimodality Driven 3D Dance Generation via Music-Text Integration
TM2D: Bimodality Driven 3D Dance Generation via Music-Text Integration
Kehong Gong
Dongze Lian
Heng Chang
Chuan Guo
Zihang Jiang
X. Zuo
Michael Bi Mi
Xinchao Wang
11
56
0
05 Apr 2023
CNNs with Multi-Level Attention for Domain Generalization
CNNs with Multi-Level Attention for Domain Generalization
Aristotelis Ballas
Christos Diou
OOD
14
6
0
02 Apr 2023
APPT : Asymmetric Parallel Point Transformer for 3D Point Cloud
  Understanding
APPT : Asymmetric Parallel Point Transformer for 3D Point Cloud Understanding
Hengjia Li
Tu Zheng
Zhihao Chi
Zheng Yang
Wenxiao Wang
Boxi Wu
Binbin Lin
Deng Cai
3DPC
30
1
0
31 Mar 2023
InceptionNeXt: When Inception Meets ConvNeXt
InceptionNeXt: When Inception Meets ConvNeXt
Weihao Yu
Pan Zhou
Shuicheng Yan
Xinchao Wang
34
117
0
29 Mar 2023
WM-MoE: Weather-aware Multi-scale Mixture-of-Experts for Blind Adverse
  Weather Removal
WM-MoE: Weather-aware Multi-scale Mixture-of-Experts for Blind Adverse Weather Removal
Yulin Luo
Rui Zhao
Xi Wei
Jinwei Chen
Yijie Lu
Shenghao Xie
Tianyu Wang
Ruiqin Xiong
Ming Lu
Shanghang Zhang
18
3
0
24 Mar 2023
12
Next