ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.14030
  4. Cited By
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
v1v2 (latest)

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

IEEE International Conference on Computer Vision (ICCV), 2021
25 March 2021
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
    ViT
ArXiv (abs)PDFHTMLHuggingFace (5 upvotes)Github (14835★)

Papers citing "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows"

50 / 8,640 papers shown
Are we ready for a new paradigm shift? A Survey on Visual Deep MLP
Are we ready for a new paradigm shift? A Survey on Visual Deep MLP
Ruiyang Liu
Hai-Tao Zheng
Li Tao
Dun Liang
Haitao Zheng
752
123
0
07 Nov 2021
Improving Visual Quality of Image Synthesis by A Token-based Generator
  with Transformers
Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers
Yanhong Zeng
Huan Yang
Hongyang Chao
Jianbo Wang
Jianlong Fu
ViT
385
31
0
05 Nov 2021
Hepatic vessel segmentation based on 3D swin-transformer with inductive
  biased multi-head self-attention
Hepatic vessel segmentation based on 3D swin-transformer with inductive biased multi-head self-attentionBMC Medical Imaging (BMC Med Imaging), 2021
Mian Wu
Yinling Qian
Xiangyun Liao
Qiong Wang
Pheng-Ann Heng
MedIm
268
45
0
05 Nov 2021
Bootstrap Your Object Detector via Mixed Training
Bootstrap Your Object Detector via Mixed TrainingNeural Information Processing Systems (NeurIPS), 2021
Mengde Xu
Zheng Zhang
Fangyun Wei
Yutong Lin
Yue Cao
Stephen Lin
Han Hu
Xiang Bai
ObjD
295
6
0
04 Nov 2021
LVIS Challenge Track Technical Report 1st Place Solution: Distribution
  Balanced and Boundary Refinement for Large Vocabulary Instance Segmentation
LVIS Challenge Track Technical Report 1st Place Solution: Distribution Balanced and Boundary Refinement for Large Vocabulary Instance Segmentation
Weifu Fu
Congchong Nie
Ting Sun
Jing Liu
Tianliang Zhang
Yong Liu
ISeg
283
5
0
04 Nov 2021
An Empirical Study of Training End-to-End Vision-and-Language
  Transformers
An Empirical Study of Training End-to-End Vision-and-Language TransformersComputer Vision and Pattern Recognition (CVPR), 2021
Zi-Yi Dou
Yichong Xu
Zhe Gan
Jianfeng Wang
Shuohang Wang
...
Pengchuan Zhang
Lu Yuan
Nanyun Peng
Zicheng Liu
Michael Zeng
VLM
359
442
0
03 Nov 2021
STC speaker recognition systems for the NIST SRE 2021
STC speaker recognition systems for the NIST SRE 2021The Speaker and Language Recognition Workshop (SLR), 2021
Anastasia Avdeeva
Aleksei Gusev
Igor Korsunov
Alexander Kozlov
G. Lavrentyeva
...
Andrey Shulipa
Alisa Vinogradova
V. Volokhov
Evgeny Smirnov
Vasily Galyuk
214
18
0
03 Nov 2021
Can Vision Transformers Perform Convolution?
Can Vision Transformers Perform Convolution?
Shanda Li
Xiangning Chen
Di He
Cho-Jui Hsieh
ViT
284
23
0
02 Nov 2021
Multi-Scale High-Resolution Vision Transformer for Semantic Segmentation
Multi-Scale High-Resolution Vision Transformer for Semantic SegmentationComputer Vision and Pattern Recognition (CVPR), 2021
Jiaqi Gu
Hyoukjun Kwon
Dilin Wang
Wei Ye
Meng Li
Yu-Hsin Chen
Liangzhen Lai
Vikas Chandra
David Z. Pan
ViT
366
231
0
01 Nov 2021
Livestock Monitoring with Transformer
Livestock Monitoring with TransformerBritish Machine Vision Conference (BMVC), 2021
Bhavesh Tangirala
Ishan Bhandari
Dániel László
D. K. Gupta
R. Thomas
Devanshu Arya
329
11
0
01 Nov 2021
DPNET: Dual-Path Network for Efficient Object Detectioj with Lightweight
  Self-Attention
DPNET: Dual-Path Network for Efficient Object Detectioj with Lightweight Self-AttentionInternational Conference on Information Photonics (ICIP), 2021
Huiming Shi
Quan Zhou
Yinghao Ni
Xiaofu Wu
Longin Jan Latecki
ObjD
195
11
0
31 Oct 2021
PatchFormer: An Efficient Point Transformer with Patch Attention
PatchFormer: An Efficient Point Transformer with Patch AttentionComputer Vision and Pattern Recognition (CVPR), 2021
Zhang Cheng
Haocheng Wan
Xinyi Shen
Zizhao Wu
3DPC
563
90
0
30 Oct 2021
MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning
MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep LearningNeural Information Processing Systems (NeurIPS), 2021
Ji Lin
Wei-Ming Chen
Han Cai
Chuang Gan
Song Han
409
187
0
28 Oct 2021
Blending Anti-Aliasing into Vision Transformer
Blending Anti-Aliasing into Vision TransformerNeural Information Processing Systems (NeurIPS), 2021
Shengju Qian
Hao Shao
Yi Zhu
Mu Li
Jiaya Jia
267
25
0
28 Oct 2021
Dispensed Transformer Network for Unsupervised Domain Adaptation
Dispensed Transformer Network for Unsupervised Domain Adaptation
Yunxiang Li
Jingxiong Li
Ruilong Dan
Shuai Wang
Kai Jin
...
Qianni Zhang
Huiyu Zhou
Qun Jin
Li Wang
Yaqi Wang
OODMedIm
244
5
0
28 Oct 2021
3D Object Tracking with Transformer
3D Object Tracking with TransformerBritish Machine Vision Conference (BMVC), 2021
Yubo Cui
Zheng Fang
Jiayao Shan
Zuoxu Gu
Sifan Zhou
ViT3DPC
221
75
0
28 Oct 2021
A Survey of Self-Supervised and Few-Shot Object Detection
A Survey of Self-Supervised and Few-Shot Object DetectionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Gabriel Huang
I. Laradji
David Vazquez
Damien Scieur
Pau Rodríguez López
ObjD
308
108
0
27 Oct 2021
GenURL: A General Framework for Unsupervised Representation Learning
GenURL: A General Framework for Unsupervised Representation Learning
Siyuan Li
Zicheng Liu
Z. Zang
Di Wu
Zhiyuan Chen
Stan Z. Li
OOD3DGSOffRL
376
13
0
27 Oct 2021
A2I Transformer: Permutation-equivariant attention network for pairwise
  and many-body interactions with minimal featurization
A2I Transformer: Permutation-equivariant attention network for pairwise and many-body interactions with minimal featurization
Ji Woong Yu
Min Young Ha
Bumjoon Seo
Won Bo Lee
88
0
0
27 Oct 2021
Video-based fully automatic assessment of open surgery suturing skills
Video-based fully automatic assessment of open surgery suturing skills
Adam Goldbraikh
Anne-Lise D. D’Angelo
C. Pugh
S. Laufer
226
34
0
26 Oct 2021
Bolt: Bridging the Gap between Auto-tuners and Hardware-native
  Performance
Bolt: Bridging the Gap between Auto-tuners and Hardware-native Performance
Jiarong Xing
Leyuan Wang
Shang Zhang
Jack H Chen
Ang Chen
Yibo Zhu
218
57
0
25 Oct 2021
Gophormer: Ego-Graph Transformer for Node Classification
Gophormer: Ego-Graph Transformer for Node Classification
Jianan Zhao
Chaozhuo Li
Qian Wen
Yiqi Wang
Yuming Liu
Hao Sun
Xing Xie
Yanfang Ye
229
96
0
25 Oct 2021
MVT: Multi-view Vision Transformer for 3D Object Recognition
MVT: Multi-view Vision Transformer for 3D Object RecognitionBritish Machine Vision Conference (BMVC), 2021
Shuo Chen
Tan Yu
Ping Li
ViT
154
58
0
25 Oct 2021
The Efficiency Misnomer
The Efficiency MisnomerInternational Conference on Learning Representations (ICLR), 2021
Daoyuan Chen
Liuyi Yao
Dawei Gao
Ashish Vaswani
Yaliang Li
382
115
0
25 Oct 2021
SOFT: Softmax-free Transformer with Linear Complexity
SOFT: Softmax-free Transformer with Linear ComplexityNeural Information Processing Systems (NeurIPS), 2021
Jiachen Lu
Jinghan Yao
Junge Zhang
Martin Danelljan
Hang Xu
Weiguo Gao
Chunjing Xu
Thomas B. Schon
Li Zhang
331
204
0
22 Oct 2021
UVO Challenge on Video-based Open-World Segmentation 2021: 1st Place
  Solution
UVO Challenge on Video-based Open-World Segmentation 2021: 1st Place Solution
Yuming Du
Wenqian. Guo
Yang Xiao
Vincent Lepetit
VGen
229
4
0
22 Oct 2021
Grafting Transformer on Automatically Designed Convolutional Neural
  Network for Hyperspectral Image Classification
Grafting Transformer on Automatically Designed Convolutional Neural Network for Hyperspectral Image ClassificationIEEE Transactions on Geoscience and Remote Sensing (TGRS), 2021
Xizhe Xue
Haokui Zhang
Bei Fang
Zongwen Bai
Ying Li
ViT
336
33
0
21 Oct 2021
Vis-TOP: Visual Transformer Overlay Processor
Vis-TOP: Visual Transformer Overlay Processor
Wei Hu
Dian Xu
Zimeng Fan
Fang Liu
Yanxiang He
BDLViT
291
5
0
21 Oct 2021
Ranking and Tuning Pre-trained Models: A New Paradigm for Exploiting
  Model Hubs
Ranking and Tuning Pre-trained Models: A New Paradigm for Exploiting Model HubsJournal of machine learning research (JMLR), 2021
Kaichao You
Yong Liu
Ziyang Zhang
Jianmin Wang
Sai Li
Mingsheng Long
465
39
0
20 Oct 2021
Toward Accurate and Reliable Iris Segmentation Using Uncertainty
  Learning
Toward Accurate and Reliable Iris Segmentation Using Uncertainty Learning
Jianze Wei
Huaibo Huang
Muyi Sun
Yunlong Wang
Min Ren
Ran He
Zhenan Sun
188
8
0
20 Oct 2021
1st Place Solution for the UVO Challenge on Image-based Open-World
  Segmentation 2021
1st Place Solution for the UVO Challenge on Image-based Open-World Segmentation 2021
Yuming Du
Wen Guo
Yang Xiao
Vincent Lepetit
139
10
0
19 Oct 2021
Towards Toxic and Narcotic Medication Detection with Rotated Object
  Detector
Towards Toxic and Narcotic Medication Detection with Rotated Object Detector
Jiao Peng
Feifan Wang
Zhongqiang Fu
Yiying Hu
Zichen Chen
Xinghan Zhou
Lijun Wang
MedIm
257
2
0
19 Oct 2021
HRFormer: High-Resolution Transformer for Dense Prediction
HRFormer: High-Resolution Transformer for Dense Prediction
Yuhui Yuan
Rao Fu
Lang Huang
Weihong Lin
Chao Zhang
Xilin Chen
Jingdong Wang
ViT
403
314
0
18 Oct 2021
3D-RETR: End-to-End Single and Multi-View 3D Reconstruction with
  Transformers
3D-RETR: End-to-End Single and Multi-View 3D Reconstruction with Transformers
Z. Shi
Zhao Meng
Yiran Xing
Yunpu Ma
Roger Wattenhofer
ViT
252
38
0
17 Oct 2021
Towards Language-guided Visual Recognition via Dynamic Convolutions
Towards Language-guided Visual Recognition via Dynamic Convolutions
Gen Luo
Weihao Ye
Xiaoshuai Sun
Yongjian Wu
Yue Gao
Rongrong Ji
ObjD
276
29
0
17 Oct 2021
ASFormer: Transformer for Action Segmentation
ASFormer: Transformer for Action Segmentation
Fangqiu Yi
Hongyu Wen
Tingting Jiang
ViT
701
253
0
16 Oct 2021
COVID-19 Detection in Chest X-ray Images Using Swin-Transformer and
  Transformer in Transformer
COVID-19 Detection in Chest X-ray Images Using Swin-Transformer and Transformer in Transformer
Juntao Jiang
Shuyi Lin
ViTMedIm
148
14
0
16 Oct 2021
Detecting Gender Bias in Transformer-based Models: A Case Study on BERT
Detecting Gender Bias in Transformer-based Models: A Case Study on BERT
Bingbing Li
Hongwu Peng
Rajat Sainju
Junhuan Yang
Lei Yang
Yueying Liang
Weiwen Jiang
Binghui Wang
Hang Liu
Caiwen Ding
142
15
0
15 Oct 2021
Receptive Field Broadening and Boosting for Salient Object Detection
Receptive Field Broadening and Boosting for Salient Object Detection
Mingcan Ma
Changqun Xia
Chenxi Xie
Xiaowu Chen
Jia Li
ViT
260
3
0
15 Oct 2021
Training Neural Networks for Solving 1-D Optimal Piecewise Linear
  Approximation
Training Neural Networks for Solving 1-D Optimal Piecewise Linear Approximation
Hangcheng Dong
Jing-Xiao Liao
Yan Wang
Yixin Chen
Bingguo Liu
Dong Ye
Guodong Liu
712
0
0
14 Oct 2021
ByteTrack: Multi-Object Tracking by Associating Every Detection Box
ByteTrack: Multi-Object Tracking by Associating Every Detection Box
Yifu Zhang
Pei Sun
Yi Jiang
Dongdong Yu
Fucheng Weng
Zehuan Yuan
Ping Luo
Wenyu Liu
Xinggang Wang
VOT
719
2,156
0
13 Oct 2021
StARformer: Transformer with State-Action-Reward Representations for
  Visual Reinforcement Learning
StARformer: Transformer with State-Action-Reward Representations for Visual Reinforcement LearningEuropean Conference on Computer Vision (ECCV), 2021
Jinghuan Shang
Kumara Kahatapitiya
Xiang Li
Michael S. Ryoo
OffRL
462
42
0
12 Oct 2021
Satellite Image Semantic Segmentation
Satellite Image Semantic Segmentation
Eric Guérin
Killian Oechslin
Christian Wolf
Benoît Martinez
139
8
0
12 Oct 2021
Revitalizing CNN Attentions via Transformers in Self-Supervised Visual
  Representation Learning
Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning
Chongjian Ge
Youwei Liang
Yibing Song
Jianbo Jiao
Jue Wang
Ping Luo
ViT
155
36
0
11 Oct 2021
Investigating Transfer Learning Capabilities of Vision Transformers and
  CNNs by Fine-Tuning a Single Trainable Block
Investigating Transfer Learning Capabilities of Vision Transformers and CNNs by Fine-Tuning a Single Trainable Block
Durvesh Malpure
Onkar Litake
Rajesh S. Ingle
ViT
153
9
0
11 Oct 2021
Multi-modal Self-supervised Pre-training for Regulatory Genome Across
  Cell Types
Multi-modal Self-supervised Pre-training for Regulatory Genome Across Cell Types
Shentong Mo
Xiao Fu
Chenyang Hong
Yizhen Chen
Yuxuan Zheng
Xiangru Tang
Zhiqiang Shen
Eric Xing
Yanyan Lan
AI4CE
189
27
0
11 Oct 2021
Efficient Training of Audio Transformers with Patchout
Efficient Training of Audio Transformers with PatchoutInterspeech (Interspeech), 2021
Khaled Koutini
Jan Schluter
Hamid Eghbalzadeh
Gerhard Widmer
ViT
600
372
0
11 Oct 2021
Global Vision Transformer Pruning with Hessian-Aware Saliency
Global Vision Transformer Pruning with Hessian-Aware SaliencyComputer Vision and Pattern Recognition (CVPR), 2021
Huanrui Yang
Hongxu Yin
Maying Shen
Pavlo Molchanov
Hai Helen Li
Jan Kautz
ViT
317
86
0
10 Oct 2021
Google Landmark Retrieval 2021 Competition Third Place Solution
Google Landmark Retrieval 2021 Competition Third Place Solution
Qishen Ha
Bo Liu
Hongwei Zhang
3DV3DPC
144
2
0
09 Oct 2021
EfficientPhys: Enabling Simple, Fast and Accurate Camera-Based Vitals
  Measurement
EfficientPhys: Enabling Simple, Fast and Accurate Camera-Based Vitals MeasurementIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021
Xin Liu
B. Hill
Ziheng Jiang
Shwetak N. Patel
Daniel J. McDuff
3DHMedIm
361
154
0
09 Oct 2021
Previous
123...166167168...171172173
Next
Page 167 of 173
Pageof 173