v1v2 (latest)

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

IEEE International Conference on Computer Vision (ICCV), 2021

25 March 2021

ArXiv (abs)PDF HTML HuggingFace (5 upvotes)Github (14835★)

Papers citing "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows"

50 / 8,575 papers shown

3D Object Tracking with TransformerBritish Machine Vision Conference (BMVC), 2021

180

28 Oct 2021

A Survey of Self-Supervised and Few-Shot Object DetectionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021

David Vazquez

291

106

27 Oct 2021

GenURL: A General Framework for Unsupervised Representation Learning

Siyuan Li

Zicheng Liu

Stan Z. Li

353

27 Oct 2021

A2I Transformer: Permutation-equivariant attention network for pairwise and many-body interactions with minimal featurization

27 Oct 2021

Video-based fully automatic assessment of open surgery suturing skills

Adam Goldbraikh

Anne-Lise D. D’Angelo

C. Pugh

S. Laufer

209

26 Oct 2021

Bolt: Bridging the Gap between Auto-tuners and Hardware-native Performance

203

25 Oct 2021

Gophormer: Ego-Graph Transformer for Node Classification

Xing Xie

228

25 Oct 2021

MVT: Multi-view Vision Transformer for 3D Object RecognitionBritish Machine Vision Conference (BMVC), 2021

Shuo Chen

Tan Yu

Ping Li

ViT

139

25 Oct 2021

The Efficiency MisnomerInternational Conference on Learning Representations (ICLR), 2021

315

115

25 Oct 2021

SOFT: Softmax-free Transformer with Linear ComplexityNeural Information Processing Systems (NeurIPS), 2021

Jiachen Lu

Jinghan Yao

Junge Zhang

Hang Xu

Li Zhang

251

197

22 Oct 2021

UVO Challenge on Video-based Open-World Segmentation 2021: 1st Place Solution

209

22 Oct 2021

Grafting Transformer on Automatically Designed Convolutional Neural Network for Hyperspectral Image ClassificationIEEE Transactions on Geoscience and Remote Sensing (TGRS), 2021

314

21 Oct 2021

Vis-TOP: Visual Transformer Overlay Processor

252

21 Oct 2021

Ranking and Tuning Pre-trained Models: A New Paradigm for Exploiting Model HubsJournal of machine learning research (JMLR), 2021

Yong Liu

419

20 Oct 2021

Toward Accurate and Reliable Iris Segmentation Using Uncertainty Learning

175

20 Oct 2021

1st Place Solution for the UVO Challenge on Image-based Open-World Segmentation 2021

127

19 Oct 2021

Towards Toxic and Narcotic Medication Detection with Rotated Object Detector

243

19 Oct 2021

HRFormer: High-Resolution Transformer for Dense Prediction

Jingdong Wang

356

308

18 Oct 2021

3D-RETR: End-to-End Single and Multi-View 3D Reconstruction with Transformers

Roger Wattenhofer

212

17 Oct 2021

Towards Language-guided Visual Recognition via Dynamic Convolutions

Yongjian Wu

248

17 Oct 2021

ASFormer: Transformer for Action Segmentation

Fangqiu Yi

Hongyu Wen

Tingting Jiang

ViT

664

243

16 Oct 2021

COVID-19 Detection in Chest X-ray Images Using Swin-Transformer and Transformer in Transformer

Juntao Jiang

Shuyi Lin

ViT MedIm

144

16 Oct 2021

Detecting Gender Bias in Transformer-based Models: A Case Study on BERT

Hongwu Peng

Caiwen Ding

133

15 Oct 2021

Receptive Field Broadening and Boosting for Salient Object Detection

251

15 Oct 2021

Training Neural Networks for Solving 1-D Optimal Piecewise Linear Approximation

702

14 Oct 2021

ByteTrack: Multi-Object Tracking by Associating Every Detection Box

659

2,069

13 Oct 2021

StARformer: Transformer with State-Action-Reward Representations for Visual Reinforcement LearningEuropean Conference on Computer Vision (ECCV), 2021

432

12 Oct 2021

Satellite Image Semantic Segmentation

137

12 Oct 2021

Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning

140

11 Oct 2021

Investigating Transfer Learning Capabilities of Vision Transformers and CNNs by Fine-Tuning a Single Trainable Block

138

11 Oct 2021

Multi-modal Self-supervised Pre-training for Regulatory Genome Across Cell Types

Shentong Mo

179

11 Oct 2021

Efficient Training of Audio Transformers with PatchoutInterspeech (Interspeech), 2021

559

360

11 Oct 2021

Global Vision Transformer Pruning with Hessian-Aware SaliencyComputer Vision and Pattern Recognition (CVPR), 2021

Huanrui Yang

234

10 Oct 2021

Google Landmark Retrieval 2021 Competition Third Place Solution

133

09 Oct 2021

EfficientPhys: Enabling Simple, Fast and Accurate Camera-Based Vitals MeasurementIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021

344

144

09 Oct 2021

UniNet: Unified Architecture Search with Convolution, Transformer, and MLPEuropean Conference on Computer Vision (ECCV), 2021

235

08 Oct 2021

An End-to-End Trainable Video Panoptic Segmentation Method usingTransformers

Jeongwon Ryu

Kwangjin Yoon

ViT

108

08 Oct 2021

ViDT: An Efficient and Effective Fully Transformer-based Object DetectorInternational Conference on Learning Representations (ICLR), 2021

Ming-Hsuan Yang

353

08 Oct 2021

Token Pooling in Vision Transformers

359

08 Oct 2021

Efficient large-scale image retrieval with deep feature orthogonality and Hybrid-Swin-Transformers

Christof Henkel

274

07 Oct 2021

TranSalNet: Towards perceptually relevant visual saliency prediction

188

112

07 Oct 2021

Adversarial Robustness Comparison of Vision Transformer and MLP-Mixer to CNNs

310

06 Oct 2021

3rd Place Solution to Google Landmark Recognition Competition 2021

232

06 Oct 2021

Anomaly Transformer: Time Series Anomaly Detection with Association Discrepancy

528

849

06 Oct 2021

2nd Place Solution to Google Landmark Recognition Competition 2021

Shubin Dai

3DV ViT

159

06 Oct 2021

Ripple Attention for Visual Perception with Sub-quadratic Complexity

Lin Zheng

Huijie Pan

Lingpeng Kong

274

06 Oct 2021

MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer

Sachin Mehta

Mohammad Rastegari

ViT

660

1,979

05 Oct 2021

Deep Instance Segmentation with Automotive Radar Detection Points

Wanli Ouyang

417

05 Oct 2021

Implicit and Explicit Attention for Zero-Shot Learning

Faisal Alamri

Anjan Dutta

215

02 Oct 2021

SurvTRACE: Transformers for Survival Analysis with Competing Events

Zifeng Wang

Jimeng Sun

236

02 Oct 2021