Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2103.14030
Cited By
v1
v2 (latest)
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
IEEE International Conference on Computer Vision (ICCV), 2021
25 March 2021
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (5 upvotes)
Github (14835★)
Papers citing
"Swin Transformer: Hierarchical Vision Transformer using Shifted Windows"
50 / 8,640 papers shown
Are we ready for a new paradigm shift? A Survey on Visual Deep MLP
Ruiyang Liu
Hai-Tao Zheng
Li Tao
Dun Liang
Haitao Zheng
752
123
0
07 Nov 2021
Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers
Yanhong Zeng
Huan Yang
Hongyang Chao
Jianbo Wang
Jianlong Fu
ViT
385
31
0
05 Nov 2021
Hepatic vessel segmentation based on 3D swin-transformer with inductive biased multi-head self-attention
BMC Medical Imaging (BMC Med Imaging), 2021
Mian Wu
Yinling Qian
Xiangyun Liao
Qiong Wang
Pheng-Ann Heng
MedIm
268
45
0
05 Nov 2021
Bootstrap Your Object Detector via Mixed Training
Neural Information Processing Systems (NeurIPS), 2021
Mengde Xu
Zheng Zhang
Fangyun Wei
Yutong Lin
Yue Cao
Stephen Lin
Han Hu
Xiang Bai
ObjD
295
6
0
04 Nov 2021
LVIS Challenge Track Technical Report 1st Place Solution: Distribution Balanced and Boundary Refinement for Large Vocabulary Instance Segmentation
Weifu Fu
Congchong Nie
Ting Sun
Jing Liu
Tianliang Zhang
Yong Liu
ISeg
283
5
0
04 Nov 2021
An Empirical Study of Training End-to-End Vision-and-Language Transformers
Computer Vision and Pattern Recognition (CVPR), 2021
Zi-Yi Dou
Yichong Xu
Zhe Gan
Jianfeng Wang
Shuohang Wang
...
Pengchuan Zhang
Lu Yuan
Nanyun Peng
Zicheng Liu
Michael Zeng
VLM
359
442
0
03 Nov 2021
STC speaker recognition systems for the NIST SRE 2021
The Speaker and Language Recognition Workshop (SLR), 2021
Anastasia Avdeeva
Aleksei Gusev
Igor Korsunov
Alexander Kozlov
G. Lavrentyeva
...
Andrey Shulipa
Alisa Vinogradova
V. Volokhov
Evgeny Smirnov
Vasily Galyuk
214
18
0
03 Nov 2021
Can Vision Transformers Perform Convolution?
Shanda Li
Xiangning Chen
Di He
Cho-Jui Hsieh
ViT
284
23
0
02 Nov 2021
Multi-Scale High-Resolution Vision Transformer for Semantic Segmentation
Computer Vision and Pattern Recognition (CVPR), 2021
Jiaqi Gu
Hyoukjun Kwon
Dilin Wang
Wei Ye
Meng Li
Yu-Hsin Chen
Liangzhen Lai
Vikas Chandra
David Z. Pan
ViT
366
231
0
01 Nov 2021
Livestock Monitoring with Transformer
British Machine Vision Conference (BMVC), 2021
Bhavesh Tangirala
Ishan Bhandari
Dániel László
D. K. Gupta
R. Thomas
Devanshu Arya
329
11
0
01 Nov 2021
DPNET: Dual-Path Network for Efficient Object Detectioj with Lightweight Self-Attention
International Conference on Information Photonics (ICIP), 2021
Huiming Shi
Quan Zhou
Yinghao Ni
Xiaofu Wu
Longin Jan Latecki
ObjD
195
11
0
31 Oct 2021
PatchFormer: An Efficient Point Transformer with Patch Attention
Computer Vision and Pattern Recognition (CVPR), 2021
Zhang Cheng
Haocheng Wan
Xinyi Shen
Zizhao Wu
3DPC
563
90
0
30 Oct 2021
MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning
Neural Information Processing Systems (NeurIPS), 2021
Ji Lin
Wei-Ming Chen
Han Cai
Chuang Gan
Song Han
409
187
0
28 Oct 2021
Blending Anti-Aliasing into Vision Transformer
Neural Information Processing Systems (NeurIPS), 2021
Shengju Qian
Hao Shao
Yi Zhu
Mu Li
Jiaya Jia
267
25
0
28 Oct 2021
Dispensed Transformer Network for Unsupervised Domain Adaptation
Yunxiang Li
Jingxiong Li
Ruilong Dan
Shuai Wang
Kai Jin
...
Qianni Zhang
Huiyu Zhou
Qun Jin
Li Wang
Yaqi Wang
OOD
MedIm
244
5
0
28 Oct 2021
3D Object Tracking with Transformer
British Machine Vision Conference (BMVC), 2021
Yubo Cui
Zheng Fang
Jiayao Shan
Zuoxu Gu
Sifan Zhou
ViT
3DPC
221
75
0
28 Oct 2021
A Survey of Self-Supervised and Few-Shot Object Detection
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Gabriel Huang
I. Laradji
David Vazquez
Damien Scieur
Pau Rodríguez López
ObjD
308
108
0
27 Oct 2021
GenURL: A General Framework for Unsupervised Representation Learning
Siyuan Li
Zicheng Liu
Z. Zang
Di Wu
Zhiyuan Chen
Stan Z. Li
OOD
3DGS
OffRL
376
13
0
27 Oct 2021
A2I Transformer: Permutation-equivariant attention network for pairwise and many-body interactions with minimal featurization
Ji Woong Yu
Min Young Ha
Bumjoon Seo
Won Bo Lee
88
0
0
27 Oct 2021
Video-based fully automatic assessment of open surgery suturing skills
Adam Goldbraikh
Anne-Lise D. D’Angelo
C. Pugh
S. Laufer
226
34
0
26 Oct 2021
Bolt: Bridging the Gap between Auto-tuners and Hardware-native Performance
Jiarong Xing
Leyuan Wang
Shang Zhang
Jack H Chen
Ang Chen
Yibo Zhu
218
57
0
25 Oct 2021
Gophormer: Ego-Graph Transformer for Node Classification
Jianan Zhao
Chaozhuo Li
Qian Wen
Yiqi Wang
Yuming Liu
Hao Sun
Xing Xie
Yanfang Ye
229
96
0
25 Oct 2021
MVT: Multi-view Vision Transformer for 3D Object Recognition
British Machine Vision Conference (BMVC), 2021
Shuo Chen
Tan Yu
Ping Li
ViT
154
58
0
25 Oct 2021
The Efficiency Misnomer
International Conference on Learning Representations (ICLR), 2021
Daoyuan Chen
Liuyi Yao
Dawei Gao
Ashish Vaswani
Yaliang Li
382
115
0
25 Oct 2021
SOFT: Softmax-free Transformer with Linear Complexity
Neural Information Processing Systems (NeurIPS), 2021
Jiachen Lu
Jinghan Yao
Junge Zhang
Martin Danelljan
Hang Xu
Weiguo Gao
Chunjing Xu
Thomas B. Schon
Li Zhang
331
204
0
22 Oct 2021
UVO Challenge on Video-based Open-World Segmentation 2021: 1st Place Solution
Yuming Du
Wenqian. Guo
Yang Xiao
Vincent Lepetit
VGen
229
4
0
22 Oct 2021
Grafting Transformer on Automatically Designed Convolutional Neural Network for Hyperspectral Image Classification
IEEE Transactions on Geoscience and Remote Sensing (TGRS), 2021
Xizhe Xue
Haokui Zhang
Bei Fang
Zongwen Bai
Ying Li
ViT
336
33
0
21 Oct 2021
Vis-TOP: Visual Transformer Overlay Processor
Wei Hu
Dian Xu
Zimeng Fan
Fang Liu
Yanxiang He
BDL
ViT
291
5
0
21 Oct 2021
Ranking and Tuning Pre-trained Models: A New Paradigm for Exploiting Model Hubs
Journal of machine learning research (JMLR), 2021
Kaichao You
Yong Liu
Ziyang Zhang
Jianmin Wang
Sai Li
Mingsheng Long
465
39
0
20 Oct 2021
Toward Accurate and Reliable Iris Segmentation Using Uncertainty Learning
Jianze Wei
Huaibo Huang
Muyi Sun
Yunlong Wang
Min Ren
Ran He
Zhenan Sun
188
8
0
20 Oct 2021
1st Place Solution for the UVO Challenge on Image-based Open-World Segmentation 2021
Yuming Du
Wen Guo
Yang Xiao
Vincent Lepetit
139
10
0
19 Oct 2021
Towards Toxic and Narcotic Medication Detection with Rotated Object Detector
Jiao Peng
Feifan Wang
Zhongqiang Fu
Yiying Hu
Zichen Chen
Xinghan Zhou
Lijun Wang
MedIm
257
2
0
19 Oct 2021
HRFormer: High-Resolution Transformer for Dense Prediction
Yuhui Yuan
Rao Fu
Lang Huang
Weihong Lin
Chao Zhang
Xilin Chen
Jingdong Wang
ViT
403
314
0
18 Oct 2021
3D-RETR: End-to-End Single and Multi-View 3D Reconstruction with Transformers
Z. Shi
Zhao Meng
Yiran Xing
Yunpu Ma
Roger Wattenhofer
ViT
252
38
0
17 Oct 2021
Towards Language-guided Visual Recognition via Dynamic Convolutions
Gen Luo
Weihao Ye
Xiaoshuai Sun
Yongjian Wu
Yue Gao
Rongrong Ji
ObjD
276
29
0
17 Oct 2021
ASFormer: Transformer for Action Segmentation
Fangqiu Yi
Hongyu Wen
Tingting Jiang
ViT
701
253
0
16 Oct 2021
COVID-19 Detection in Chest X-ray Images Using Swin-Transformer and Transformer in Transformer
Juntao Jiang
Shuyi Lin
ViT
MedIm
148
14
0
16 Oct 2021
Detecting Gender Bias in Transformer-based Models: A Case Study on BERT
Bingbing Li
Hongwu Peng
Rajat Sainju
Junhuan Yang
Lei Yang
Yueying Liang
Weiwen Jiang
Binghui Wang
Hang Liu
Caiwen Ding
142
15
0
15 Oct 2021
Receptive Field Broadening and Boosting for Salient Object Detection
Mingcan Ma
Changqun Xia
Chenxi Xie
Xiaowu Chen
Jia Li
ViT
260
3
0
15 Oct 2021
Training Neural Networks for Solving 1-D Optimal Piecewise Linear Approximation
Hangcheng Dong
Jing-Xiao Liao
Yan Wang
Yixin Chen
Bingguo Liu
Dong Ye
Guodong Liu
712
0
0
14 Oct 2021
ByteTrack: Multi-Object Tracking by Associating Every Detection Box
Yifu Zhang
Pei Sun
Yi Jiang
Dongdong Yu
Fucheng Weng
Zehuan Yuan
Ping Luo
Wenyu Liu
Xinggang Wang
VOT
719
2,156
0
13 Oct 2021
StARformer: Transformer with State-Action-Reward Representations for Visual Reinforcement Learning
European Conference on Computer Vision (ECCV), 2021
Jinghuan Shang
Kumara Kahatapitiya
Xiang Li
Michael S. Ryoo
OffRL
462
42
0
12 Oct 2021
Satellite Image Semantic Segmentation
Eric Guérin
Killian Oechslin
Christian Wolf
Benoît Martinez
139
8
0
12 Oct 2021
Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning
Chongjian Ge
Youwei Liang
Yibing Song
Jianbo Jiao
Jue Wang
Ping Luo
ViT
155
36
0
11 Oct 2021
Investigating Transfer Learning Capabilities of Vision Transformers and CNNs by Fine-Tuning a Single Trainable Block
Durvesh Malpure
Onkar Litake
Rajesh S. Ingle
ViT
153
9
0
11 Oct 2021
Multi-modal Self-supervised Pre-training for Regulatory Genome Across Cell Types
Shentong Mo
Xiao Fu
Chenyang Hong
Yizhen Chen
Yuxuan Zheng
Xiangru Tang
Zhiqiang Shen
Eric Xing
Yanyan Lan
AI4CE
189
27
0
11 Oct 2021
Efficient Training of Audio Transformers with Patchout
Interspeech (Interspeech), 2021
Khaled Koutini
Jan Schluter
Hamid Eghbalzadeh
Gerhard Widmer
ViT
600
372
0
11 Oct 2021
Global Vision Transformer Pruning with Hessian-Aware Saliency
Computer Vision and Pattern Recognition (CVPR), 2021
Huanrui Yang
Hongxu Yin
Maying Shen
Pavlo Molchanov
Hai Helen Li
Jan Kautz
ViT
317
86
0
10 Oct 2021
Google Landmark Retrieval 2021 Competition Third Place Solution
Qishen Ha
Bo Liu
Hongwei Zhang
3DV
3DPC
144
2
0
09 Oct 2021
EfficientPhys: Enabling Simple, Fast and Accurate Camera-Based Vitals Measurement
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021
Xin Liu
B. Hill
Ziheng Jiang
Shwetak N. Patel
Daniel J. McDuff
3DH
MedIm
361
154
0
09 Oct 2021
Previous
1
2
3
...
166
167
168
...
171
172
173
Next
Page 167 of 173
Page
of 173
Go