Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2103.14030
Cited By
v1
v2 (latest)
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
IEEE International Conference on Computer Vision (ICCV), 2021
25 March 2021
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (5 upvotes)
Github (14835★)
Papers citing
"Swin Transformer: Hierarchical Vision Transformer using Shifted Windows"
50 / 8,525 papers shown
ProTo: Program-Guided Transformer for Program-Guided Tasks
Zelin Zhao
Karan Samel
Binghong Chen
Le Song
ViT
LM&Ro
260
32
0
02 Oct 2021
Instance Segmentation Challenge Track Technical Report, VIPriors Workshop at ICCV 2021: Task-Specific Copy-Paste Data Augmentation Method for Instance Segmentation
Jahongir Yunusov
Shohruh Rakhmatov
A. Namozov
Abdulaziz Gaybulayev
Tae-Hyong Kim
ISeg
162
6
0
01 Oct 2021
3rd Place Scheme on Instance Segmentation Track of ICCV 2021 VIPriors Challenges
Pengyu Chen
Wanhua Li
ISeg
309
1
0
01 Oct 2021
GT U-Net: A U-Net Like Group Transformer Network for Tooth Root Segmentation
Yunxiang Li
Shuai Wang
Jun Wang
G. Zeng
Wenjun Liu
Qianni Zhang
Qun Jin
Yaqi Wang
ViT
MedIm
190
62
0
30 Sep 2021
CCTrans: Simplifying and Improving Crowd Counting with Transformer
Ye Tian
Xiangxiang Chu
Hongpeng Wang
ViT
227
102
0
29 Sep 2021
UFO-ViT: High Performance Linear Vision Transformer without Softmax
Jeonggeun Song
ViT
325
27
0
29 Sep 2021
Infrared Small-Dim Target Detection with Transformer under Complex Backgrounds
Fangcen Liu
Chenqiang Gao
Fangge Chen
Deyu Meng
W. Zuo
Xinbo Gao
ViT
165
39
0
29 Sep 2021
A Systematic Survey of Deep Learning-based Single-Image Super-Resolution
Juncheng Li
Zehua Pei
Wenjie Li
Guangwei Gao
Longguang Wang
Yingqian Wang
T. Zeng
397
59
0
29 Sep 2021
Localizing Objects with Self-Supervised Transformers and no Labels
Oriane Siméoni
Gilles Puy
Huy V. Vo
Simon Roburin
Spyros Gidaris
Andrei Bursuc
P. Pérez
Renaud Marlet
Jean Ponce
ViT
451
248
0
29 Sep 2021
BiTr-Unet: a CNN-Transformer Combined Network for MRI Brain Tumor Segmentation
Qiran Jia
Hai Shu
ViT
MedIm
236
101
0
25 Sep 2021
Long-Range Transformers for Dynamic Spatiotemporal Forecasting
J. E. Grigsby
Zhe Wang
Nam Nguyen
Yanjun Qi
AI4TS
320
117
0
24 Sep 2021
LGD: Label-guided Self-distillation for Object Detection
AAAI Conference on Artificial Intelligence (AAAI), 2021
Peizhen Zhang
Zijian Kang
Tong Yang
Xinming Zhang
N. Zheng
Jian Sun
ObjD
412
37
0
23 Sep 2021
End-to-End Dense Video Grounding via Parallel Regression
Computer Vision and Image Understanding (CVIU), 2021
Fengyuan Shi
Weilin Huang
Limin Wang
375
12
0
23 Sep 2021
DS-Net++: Dynamic Weight Slicing for Efficient Inference in CNNs and Transformers
Changlin Li
Guangrun Wang
Bing Wang
Xiaodan Liang
Zhihui Li
Xiaojun Chang
203
9
0
21 Sep 2021
SDTP: Semantic-aware Decoupled Transformer Pyramid for Dense Image Prediction
Zekun Li
Yufan Liu
Bing Li
Weiming Hu
Kebin Wu
Chengwei Peng
ViT
144
24
0
18 Sep 2021
UNetFormer: A UNet-like Transformer for Efficient Semantic Segmentation of Remote Sensing Urban Scene Imagery
Libo Wang
Rui Li
Ce Zhang
Shenghui Fang
Chenxi Duan
Xiaoliang Meng
P. M. Atkinson
ViT
370
991
0
18 Sep 2021
Distract Your Attention: Multi-head Cross Attention Network for Facial Expression Recognition
Zhengyao Wen
Wen-Long Lin
Tao Wang
Ge Xu
CVBM
444
268
0
15 Sep 2021
MISSFormer: An Effective Medical Image Segmentation Transformer
Xiaohong Huang
Zhifang Deng
Dandan Li
Xueguang Yuan
ViT
MedIm
344
264
0
15 Sep 2021
Anchor DETR: Query Design for Transformer-Based Object Detection
Yingming Wang
Xinming Zhang
Tong Yang
Jian Sun
ViT
209
70
0
15 Sep 2021
Complementary Feature Enhanced Network with Vision Transformer for Image Dehazing
Dong Zhao
Jia Li
Hongyu Li
Longhao Xu
ViT
229
23
0
15 Sep 2021
Semi-Supervised Wide-Angle Portraits Correction by Multi-Scale Transformer
Fushun Zhu
Shan Zhao
Peng Wang
Hao Wang
Hua Yan
Shuaicheng Liu
ViT
142
20
0
14 Sep 2021
CDTrans: Cross-domain Transformer for Unsupervised Domain Adaptation
Tongkun Xu
Weihua Chen
Pichao Wang
Fan Wang
Hao Li
Rong Jin
ViT
601
277
0
13 Sep 2021
Compute and Energy Consumption Trends in Deep Learning Inference
Radosvet Desislavov
Fernando Martínez-Plumed
José Hernández-Orallo
190
167
0
12 Sep 2021
Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?
Chuanxin Tang
Yucheng Zhao
Guangting Wang
Chong Luo
Wenxuan Xie
Wenjun Zeng
MoE
ViT
204
119
0
12 Sep 2021
RobustART: Benchmarking Robustness on Architecture Design and Training Techniques
Shiyu Tang
Yazhe Niu
Yan Wang
Aishan Liu
Jinyang Guo
...
Xianglong Liu
Basel Alomair
Alan Yuille
Juil Sock
Dacheng Tao
VLM
AAML
315
122
0
11 Sep 2021
LibFewShot: A Comprehensive Library for Few-shot Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Wenbin Li
Ziyi
Ziyi Wang
Xuesong Yang
C. Dong
...
Jing Huo
Yinghuan Shi
Lei Wang
Yang Gao
Jiebo Luo
VLM
413
83
0
10 Sep 2021
Is Attention Better Than Matrix Decomposition?
International Conference on Learning Representations (ICLR), 2021
Zhengyang Geng
Meng-Hao Guo
Hongxu Chen
Xia Li
Ke Wei
Zhouchen Lin
294
166
0
09 Sep 2021
ConvMLP: Hierarchical Convolutional MLPs for Vision
Jiachen Li
Ali Hassani
Steven Walton
Humphrey Shi
178
85
0
09 Sep 2021
UCTransNet: Rethinking the Skip Connections in U-Net from a Channel-wise Perspective with Transformer
AAAI Conference on Artificial Intelligence (AAAI), 2021
Haonan Wang
Peng Cao
Yuan Liu
Osmar R. Zaiane
MedIm
ViT
417
1,029
0
09 Sep 2021
Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers
Computer Vision and Pattern Recognition (CVPR), 2021
Zhiqi Li
Wenhai Wang
Enze Xie
Zhiding Yu
Anima Anandkumar
J. Álvarez
Ping Luo
Tong Lu
ViT
410
175
0
08 Sep 2021
Scaled ReLU Matters for Training Vision Transformers
AAAI Conference on Artificial Intelligence (AAAI), 2021
Pichao Wang
Qingsong Wen
Haowen Luo
Jingkai Zhou
Zhipeng Zhou
Fan Wang
Hao Li
Rong Jin
253
52
0
08 Sep 2021
Axial multi-layer perceptron architecture for automatic segmentation of choroid plexus in multiple sclerosis
bioRxiv (bioRxiv), 2021
Marius Schmidt-Mengin
Vito A G Ricigliano
B. Bodini
E. Morena
A. Colombi
Mariem Hamzaoui
Arya Yazdan Panah
B. Stankoff
O. Colliot
85
16
0
08 Sep 2021
nnFormer: Interleaved Transformer for Volumetric Segmentation
IEEE Transactions on Image Processing (TIP), 2021
Hong-Yu Zhou
J. Guo
Yinghao Zhang
Lequan Yu
Liansheng Wang
Yizhou Yu
ViT
MedIm
547
542
0
07 Sep 2021
Ultra-high Resolution Image Segmentation via Locality-aware Context Fusion and Alternating Local Enhancement
International Journal of Computer Vision (IJCV), 2021
Wenxi Liu
Qi Li
Xin Lin
Weixiang Yang
Shengfeng He
Yuanlong Yu
404
15
0
06 Sep 2021
Exploiting Spatial-Temporal Semantic Consistency for Video Scene Parsing
Xingjian He
Weining Wang
Zhiyong Xu
Hao Wang
Jie Jiang
Jing Liu
110
2
0
06 Sep 2021
Semantic Segmentation on VSPW Dataset through Aggregation of Transformer Models
Zixuan Chen
Junhong Zou
Xiaotao Wang
ViT
162
2
0
03 Sep 2021
Benchmarking the Robustness of Instance Segmentation Models
Said Fahri Altindis
Yusuf Dalva
Hamza Pehlivan
Aysegül Dündar
VLM
OOD
580
17
0
02 Sep 2021
Searching for Efficient Multi-Stage Vision Transformers
Yi-Lun Liao
S. Karaman
Vivienne Sze
ViT
113
19
0
01 Sep 2021
Unsupervised Person Re-Identification: A Systematic Survey of Challenges and Solutions
Xiangtan Lin
Pengzhen Ren
C. Yeh
Weitong Chen
A. Song
Xiaojun Chang
345
21
0
01 Sep 2021
Hire-MLP: Vision MLP via Hierarchical Rearrangement
Computer Vision and Pattern Recognition (CVPR), 2021
Jianyuan Guo
Yehui Tang
Kai Han
Xinghao Chen
Han Wu
Chao Xu
Chang Xu
Yunhe Wang
282
115
0
30 Aug 2021
LUAI Challenge 2021 on Learning to Understand Aerial Images
Guisong Xia
Jian Ding
Ming Qian
Nan Xue
Jiaming Han
...
Zhuojun Dong
Jie Zhang
Qianyue Bao
Zixiao Zhang
Fang Liu
147
1
0
30 Aug 2021
Exploring and Improving Mobile Level Vision Transformers
Pengguang Chen
Yixin Chen
Shu Liu
Ming-Hsuan Yang
Jiaya Jia
ViT
255
4
0
30 Aug 2021
A Battle of Network Structures: An Empirical Study of CNN, Transformer, and MLP
Yucheng Zhao
Guangting Wang
Chuanxin Tang
Chong Luo
Wenjun Zeng
Zhengjun Zha
175
91
0
30 Aug 2021
Evaluating Transformer-based Semantic Segmentation Networks for Pathological Image Segmentation
Cam-Ngoan Nguyen
Zuhayr Asad
Yuankai Huo
ViT
MedIm
159
39
0
26 Aug 2021
TPH-YOLOv5: Improved YOLOv5 Based on Transformer Prediction Head for Object Detection on Drone-captured Scenarios
Xingkui Zhu
Shuchang Lyu
Xu Wang
Qi Zhao
175
1,573
0
26 Aug 2021
Transformer for Single Image Super-Resolution
Zhisheng Lu
Juncheng Li
Hong Liu
Chao Huang
Linlin Zhang
T. Zeng
ViT
267
465
0
25 Aug 2021
Improving Object Detection by Label Assignment Distillation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021
Chuong H. Nguyen
Thuy C. Nguyen
Tuan N. Tang
N. Phan
321
54
0
24 Aug 2021
SwinIR: Image Restoration Using Swin Transformer
Christos Sakaridis
Jie Cao
Guolei Sun
Lucas Beerens
Luc Van Gool
Radu Timofte
ViT
479
4,098
0
23 Aug 2021
MM-ViT: Multi-Modal Video Transformer for Compressed Video Action Recognition
Jiawei Chen
C. Ho
ViT
270
103
0
20 Aug 2021
Trans4Trans: Efficient Transformer for Transparent Object and Semantic Scene Segmentation in Real-World Navigation Assistance
Kailai Li
Kailun Yang
Angela Constantinescu
Kunyu Peng
Karin Muller
Rainer Stiefelhagen
ViT
193
82
0
20 Aug 2021
Previous
1
2
3
...
165
166
167
...
169
170
171
Next
Page 166 of 171
Page
of 171
Go