v1v2 (latest)

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

IEEE International Conference on Computer Vision (ICCV), 2021

25 March 2021

ArXiv (abs)PDF HTML HuggingFace (5 upvotes)Github (14835★)

Papers citing "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows"

50 / 8,640 papers shown

Are we ready for a new paradigm shift? A Survey on Visual Deep MLP

752

123

07 Nov 2021

Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers

385

05 Nov 2021

Hepatic vessel segmentation based on 3D swin-transformer with inductive biased multi-head self-attentionBMC Medical Imaging (BMC Med Imaging), 2021

Pheng-Ann Heng

268

05 Nov 2021

Bootstrap Your Object Detector via Mixed TrainingNeural Information Processing Systems (NeurIPS), 2021

295

04 Nov 2021

LVIS Challenge Track Technical Report 1st Place Solution: Distribution Balanced and Boundary Refinement for Large Vocabulary Instance Segmentation

283

04 Nov 2021

An Empirical Study of Training End-to-End Vision-and-Language TransformersComputer Vision and Pattern Recognition (CVPR), 2021

...

Lu Yuan

Zicheng Liu

359

442

03 Nov 2021

STC speaker recognition systems for the NIST SRE 2021The Speaker and Language Recognition Workshop (SLR), 2021

...

214

03 Nov 2021

Can Vision Transformers Perform Convolution?

284

02 Nov 2021

Multi-Scale High-Resolution Vision Transformer for Semantic SegmentationComputer Vision and Pattern Recognition (CVPR), 2021

366

231

01 Nov 2021

Livestock Monitoring with TransformerBritish Machine Vision Conference (BMVC), 2021

329

01 Nov 2021

DPNET: Dual-Path Network for Efficient Object Detectioj with Lightweight Self-AttentionInternational Conference on Information Photonics (ICIP), 2021

195

31 Oct 2021

PatchFormer: An Efficient Point Transformer with Patch AttentionComputer Vision and Pattern Recognition (CVPR), 2021

563

30 Oct 2021

MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep LearningNeural Information Processing Systems (NeurIPS), 2021

Ji Lin

Wei-Ming Chen

Han Cai

Chuang Gan

Song Han

409

187

28 Oct 2021

Blending Anti-Aliasing into Vision TransformerNeural Information Processing Systems (NeurIPS), 2021

267

28 Oct 2021

Dispensed Transformer Network for Unsupervised Domain Adaptation

...

244

28 Oct 2021

3D Object Tracking with TransformerBritish Machine Vision Conference (BMVC), 2021

221

28 Oct 2021

A Survey of Self-Supervised and Few-Shot Object DetectionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021

David Vazquez

308

108

27 Oct 2021

GenURL: A General Framework for Unsupervised Representation Learning

Siyuan Li

Zicheng Liu

Stan Z. Li

376

27 Oct 2021

A2I Transformer: Permutation-equivariant attention network for pairwise and many-body interactions with minimal featurization

27 Oct 2021

Video-based fully automatic assessment of open surgery suturing skills

Adam Goldbraikh

Anne-Lise D. D’Angelo

C. Pugh

S. Laufer

226

26 Oct 2021

Bolt: Bridging the Gap between Auto-tuners and Hardware-native Performance

218

25 Oct 2021

Gophormer: Ego-Graph Transformer for Node Classification

Xing Xie

229

25 Oct 2021

MVT: Multi-view Vision Transformer for 3D Object RecognitionBritish Machine Vision Conference (BMVC), 2021

Shuo Chen

Tan Yu

Ping Li

ViT

154

25 Oct 2021

The Efficiency MisnomerInternational Conference on Learning Representations (ICLR), 2021

382

115

25 Oct 2021

SOFT: Softmax-free Transformer with Linear ComplexityNeural Information Processing Systems (NeurIPS), 2021

Jiachen Lu

Jinghan Yao

Junge Zhang

Hang Xu

Li Zhang

331

204

22 Oct 2021

UVO Challenge on Video-based Open-World Segmentation 2021: 1st Place Solution

229

22 Oct 2021

Grafting Transformer on Automatically Designed Convolutional Neural Network for Hyperspectral Image ClassificationIEEE Transactions on Geoscience and Remote Sensing (TGRS), 2021

336

21 Oct 2021

Vis-TOP: Visual Transformer Overlay Processor

291

21 Oct 2021

Ranking and Tuning Pre-trained Models: A New Paradigm for Exploiting Model HubsJournal of machine learning research (JMLR), 2021

Yong Liu

465

20 Oct 2021

Toward Accurate and Reliable Iris Segmentation Using Uncertainty Learning

188

20 Oct 2021

1st Place Solution for the UVO Challenge on Image-based Open-World Segmentation 2021

139

19 Oct 2021

Towards Toxic and Narcotic Medication Detection with Rotated Object Detector

257

19 Oct 2021

HRFormer: High-Resolution Transformer for Dense Prediction

Jingdong Wang

403

314

18 Oct 2021

3D-RETR: End-to-End Single and Multi-View 3D Reconstruction with Transformers

Roger Wattenhofer

252

17 Oct 2021

Towards Language-guided Visual Recognition via Dynamic Convolutions

Yongjian Wu

276

17 Oct 2021

ASFormer: Transformer for Action Segmentation

Fangqiu Yi

Hongyu Wen

Tingting Jiang

ViT

701

253

16 Oct 2021

COVID-19 Detection in Chest X-ray Images Using Swin-Transformer and Transformer in Transformer

Juntao Jiang

Shuyi Lin

ViT MedIm

148

16 Oct 2021

Detecting Gender Bias in Transformer-based Models: A Case Study on BERT

Hongwu Peng

Caiwen Ding

142

15 Oct 2021

Receptive Field Broadening and Boosting for Salient Object Detection

260

15 Oct 2021

Training Neural Networks for Solving 1-D Optimal Piecewise Linear Approximation

712

14 Oct 2021

ByteTrack: Multi-Object Tracking by Associating Every Detection Box

719

2,156

13 Oct 2021

StARformer: Transformer with State-Action-Reward Representations for Visual Reinforcement LearningEuropean Conference on Computer Vision (ECCV), 2021

462

12 Oct 2021

Satellite Image Semantic Segmentation

139

12 Oct 2021

Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning

155

11 Oct 2021

Investigating Transfer Learning Capabilities of Vision Transformers and CNNs by Fine-Tuning a Single Trainable Block

153

11 Oct 2021

Multi-modal Self-supervised Pre-training for Regulatory Genome Across Cell Types

Shentong Mo

189

11 Oct 2021

Efficient Training of Audio Transformers with PatchoutInterspeech (Interspeech), 2021

600

372

11 Oct 2021

Global Vision Transformer Pruning with Hessian-Aware SaliencyComputer Vision and Pattern Recognition (CVPR), 2021

Huanrui Yang

317

10 Oct 2021

Google Landmark Retrieval 2021 Competition Third Place Solution

144

09 Oct 2021

EfficientPhys: Enabling Simple, Fast and Accurate Camera-Based Vitals MeasurementIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021

361

154

09 Oct 2021