Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2103.14030
Cited By
v1
v2 (latest)
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
IEEE International Conference on Computer Vision (ICCV), 2021
25 March 2021
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (5 upvotes)
Github (14835★)
Papers citing
"Swin Transformer: Hierarchical Vision Transformer using Shifted Windows"
50 / 8,575 papers shown
3D Object Tracking with Transformer
British Machine Vision Conference (BMVC), 2021
Yubo Cui
Zheng Fang
Jiayao Shan
Zuoxu Gu
Sifan Zhou
ViT
3DPC
180
74
0
28 Oct 2021
A Survey of Self-Supervised and Few-Shot Object Detection
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Gabriel Huang
I. Laradji
David Vazquez
Damien Scieur
Pau Rodríguez López
ObjD
291
106
0
27 Oct 2021
GenURL: A General Framework for Unsupervised Representation Learning
Siyuan Li
Zicheng Liu
Z. Zang
Di Wu
Zhiyuan Chen
Stan Z. Li
OOD
3DGS
OffRL
353
13
0
27 Oct 2021
A2I Transformer: Permutation-equivariant attention network for pairwise and many-body interactions with minimal featurization
Ji Woong Yu
Min Young Ha
Bumjoon Seo
Won Bo Lee
78
0
0
27 Oct 2021
Video-based fully automatic assessment of open surgery suturing skills
Adam Goldbraikh
Anne-Lise D. D’Angelo
C. Pugh
S. Laufer
209
34
0
26 Oct 2021
Bolt: Bridging the Gap between Auto-tuners and Hardware-native Performance
Jiarong Xing
Leyuan Wang
Shang Zhang
Jack H Chen
Ang Chen
Yibo Zhu
203
55
0
25 Oct 2021
Gophormer: Ego-Graph Transformer for Node Classification
Jianan Zhao
Chaozhuo Li
Qian Wen
Yiqi Wang
Yuming Liu
Hao Sun
Xing Xie
Yanfang Ye
228
95
0
25 Oct 2021
MVT: Multi-view Vision Transformer for 3D Object Recognition
British Machine Vision Conference (BMVC), 2021
Shuo Chen
Tan Yu
Ping Li
ViT
139
58
0
25 Oct 2021
The Efficiency Misnomer
International Conference on Learning Representations (ICLR), 2021
Daoyuan Chen
Liuyi Yao
Dawei Gao
Ashish Vaswani
Yaliang Li
315
115
0
25 Oct 2021
SOFT: Softmax-free Transformer with Linear Complexity
Neural Information Processing Systems (NeurIPS), 2021
Jiachen Lu
Jinghan Yao
Junge Zhang
Martin Danelljan
Hang Xu
Weiguo Gao
Chunjing Xu
Thomas B. Schon
Li Zhang
251
197
0
22 Oct 2021
UVO Challenge on Video-based Open-World Segmentation 2021: 1st Place Solution
Yuming Du
Wenqian. Guo
Yang Xiao
Vincent Lepetit
VGen
209
4
0
22 Oct 2021
Grafting Transformer on Automatically Designed Convolutional Neural Network for Hyperspectral Image Classification
IEEE Transactions on Geoscience and Remote Sensing (TGRS), 2021
Xizhe Xue
Haokui Zhang
Bei Fang
Zongwen Bai
Ying Li
ViT
314
33
0
21 Oct 2021
Vis-TOP: Visual Transformer Overlay Processor
Wei Hu
Dian Xu
Zimeng Fan
Fang Liu
Yanxiang He
BDL
ViT
252
5
0
21 Oct 2021
Ranking and Tuning Pre-trained Models: A New Paradigm for Exploiting Model Hubs
Journal of machine learning research (JMLR), 2021
Kaichao You
Yong Liu
Ziyang Zhang
Jianmin Wang
Sai Li
Mingsheng Long
419
38
0
20 Oct 2021
Toward Accurate and Reliable Iris Segmentation Using Uncertainty Learning
Jianze Wei
Huaibo Huang
Muyi Sun
Yunlong Wang
Min Ren
Ran He
Zhenan Sun
175
8
0
20 Oct 2021
1st Place Solution for the UVO Challenge on Image-based Open-World Segmentation 2021
Yuming Du
Wen Guo
Yang Xiao
Vincent Lepetit
127
10
0
19 Oct 2021
Towards Toxic and Narcotic Medication Detection with Rotated Object Detector
Jiao Peng
Feifan Wang
Zhongqiang Fu
Yiying Hu
Zichen Chen
Xinghan Zhou
Lijun Wang
MedIm
243
2
0
19 Oct 2021
HRFormer: High-Resolution Transformer for Dense Prediction
Yuhui Yuan
Rao Fu
Lang Huang
Weihong Lin
Chao Zhang
Xilin Chen
Jingdong Wang
ViT
356
308
0
18 Oct 2021
3D-RETR: End-to-End Single and Multi-View 3D Reconstruction with Transformers
Z. Shi
Zhao Meng
Yiran Xing
Yunpu Ma
Roger Wattenhofer
ViT
212
38
0
17 Oct 2021
Towards Language-guided Visual Recognition via Dynamic Convolutions
Gen Luo
Weihao Ye
Xiaoshuai Sun
Yongjian Wu
Yue Gao
Rongrong Ji
ObjD
248
28
0
17 Oct 2021
ASFormer: Transformer for Action Segmentation
Fangqiu Yi
Hongyu Wen
Tingting Jiang
ViT
664
243
0
16 Oct 2021
COVID-19 Detection in Chest X-ray Images Using Swin-Transformer and Transformer in Transformer
Juntao Jiang
Shuyi Lin
ViT
MedIm
144
13
0
16 Oct 2021
Detecting Gender Bias in Transformer-based Models: A Case Study on BERT
Bingbing Li
Hongwu Peng
Rajat Sainju
Junhuan Yang
Lei Yang
Yueying Liang
Weiwen Jiang
Binghui Wang
Hang Liu
Caiwen Ding
133
15
0
15 Oct 2021
Receptive Field Broadening and Boosting for Salient Object Detection
Mingcan Ma
Changqun Xia
Chenxi Xie
Xiaowu Chen
Jia Li
ViT
251
3
0
15 Oct 2021
Training Neural Networks for Solving 1-D Optimal Piecewise Linear Approximation
Hangcheng Dong
Jing-Xiao Liao
Yan Wang
Yixin Chen
Bingguo Liu
Dong Ye
Guodong Liu
702
0
0
14 Oct 2021
ByteTrack: Multi-Object Tracking by Associating Every Detection Box
Yifu Zhang
Pei Sun
Yi Jiang
Dongdong Yu
Fucheng Weng
Zehuan Yuan
Ping Luo
Wenyu Liu
Xinggang Wang
VOT
659
2,069
0
13 Oct 2021
StARformer: Transformer with State-Action-Reward Representations for Visual Reinforcement Learning
European Conference on Computer Vision (ECCV), 2021
Jinghuan Shang
Kumara Kahatapitiya
Xiang Li
Michael S. Ryoo
OffRL
432
41
0
12 Oct 2021
Satellite Image Semantic Segmentation
Eric Guérin
Killian Oechslin
Christian Wolf
Benoît Martinez
137
8
0
12 Oct 2021
Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning
Chongjian Ge
Youwei Liang
Yibing Song
Jianbo Jiao
Jue Wang
Ping Luo
ViT
140
36
0
11 Oct 2021
Investigating Transfer Learning Capabilities of Vision Transformers and CNNs by Fine-Tuning a Single Trainable Block
Durvesh Malpure
Onkar Litake
Rajesh S. Ingle
ViT
138
9
0
11 Oct 2021
Multi-modal Self-supervised Pre-training for Regulatory Genome Across Cell Types
Shentong Mo
Xiao Fu
Chenyang Hong
Yizhen Chen
Yuxuan Zheng
Xiangru Tang
Zhiqiang Shen
Eric Xing
Yanyan Lan
AI4CE
179
26
0
11 Oct 2021
Efficient Training of Audio Transformers with Patchout
Interspeech (Interspeech), 2021
Khaled Koutini
Jan Schluter
Hamid Eghbalzadeh
Gerhard Widmer
ViT
559
360
0
11 Oct 2021
Global Vision Transformer Pruning with Hessian-Aware Saliency
Computer Vision and Pattern Recognition (CVPR), 2021
Huanrui Yang
Hongxu Yin
Maying Shen
Pavlo Molchanov
Hai Helen Li
Jan Kautz
ViT
234
80
0
10 Oct 2021
Google Landmark Retrieval 2021 Competition Third Place Solution
Qishen Ha
Bo Liu
Hongwei Zhang
3DV
3DPC
133
2
0
09 Oct 2021
EfficientPhys: Enabling Simple, Fast and Accurate Camera-Based Vitals Measurement
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021
Xin Liu
B. Hill
Ziheng Jiang
Shwetak N. Patel
Daniel J. McDuff
3DH
MedIm
344
144
0
09 Oct 2021
UniNet: Unified Architecture Search with Convolution, Transformer, and MLP
European Conference on Computer Vision (ECCV), 2021
Jihao Liu
Jiaming Song
Guanglu Song
Xin Huang
Yu Liu
ViT
235
38
0
08 Oct 2021
An End-to-End Trainable Video Panoptic Segmentation Method usingTransformers
Jeongwon Ryu
Kwangjin Yoon
ViT
108
2
0
08 Oct 2021
ViDT: An Efficient and Effective Fully Transformer-based Object Detector
International Conference on Learning Representations (ICLR), 2021
Hwanjun Song
Deqing Sun
Sanghyuk Chun
Varun Jampani
Dongyoon Han
Byeongho Heo
Wonjae Kim
Ming-Hsuan Yang
353
95
0
08 Oct 2021
Token Pooling in Vision Transformers
D. Marin
Jen-Hao Rick Chang
Anurag Ranjan
Anish K. Prabhu
Mohammad Rastegari
Oncel Tuzel
ViT
359
82
0
08 Oct 2021
Efficient large-scale image retrieval with deep feature orthogonality and Hybrid-Swin-Transformers
Christof Henkel
274
16
0
07 Oct 2021
TranSalNet: Towards perceptually relevant visual saliency prediction
Jianxun Lou
Hanhe Lin
David Marshall
Dietmar Saupe
Hantao Liu
ViT
188
112
0
07 Oct 2021
Adversarial Robustness Comparison of Vision Transformer and MLP-Mixer to CNNs
Philipp Benz
Soomin Ham
Chaoning Zhang
Adil Karjauv
In So Kweon
AAML
ViT
310
89
0
06 Oct 2021
3rd Place Solution to Google Landmark Recognition Competition 2021
Chengfeng Xu
Weimin Wang
Shuai Liu
Yong Wang
Yuxiang Tang
Tianling Bian
Yanyu Yan
Qi She
Cheng Yang
3DPC
3DV
232
6
0
06 Oct 2021
Anomaly Transformer: Time Series Anomaly Detection with Association Discrepancy
Jiehui Xu
Haixu Wu
Jianmin Wang
Mingsheng Long
AI4TS
528
849
0
06 Oct 2021
2nd Place Solution to Google Landmark Recognition Competition 2021
Shubin Dai
3DV
ViT
159
4
0
06 Oct 2021
Ripple Attention for Visual Perception with Sub-quadratic Complexity
Lin Zheng
Huijie Pan
Lingpeng Kong
274
4
0
06 Oct 2021
MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer
Sachin Mehta
Mohammad Rastegari
ViT
660
1,979
0
05 Oct 2021
Deep Instance Segmentation with Automotive Radar Detection Points
Tao Huang
Weiyi Xiong
Liping Bai
Yu Xia
Wei Chen
Wanli Ouyang
Bing Zhu
417
70
0
05 Oct 2021
Implicit and Explicit Attention for Zero-Shot Learning
Faisal Alamri
Anjan Dutta
215
9
0
02 Oct 2021
SurvTRACE: Transformers for Survival Analysis with Competing Events
Zifeng Wang
Jimeng Sun
236
87
0
02 Oct 2021
Previous
1
2
3
...
165
166
167
...
170
171
172
Next
Page 166 of 172
Page
of 172
Go