ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.14030
  4. Cited By
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

25 March 2021
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng-Wei Zhang
Stephen Lin
B. Guo
    ViT
ArXivPDFHTML

Papers citing "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows"

50 / 2,048 papers shown
Title
Rega-Net:Retina Gabor Attention for Deep Convolutional Neural Networks
Rega-Net:Retina Gabor Attention for Deep Convolutional Neural Networks
Chun Bao
Jie Cao
Yaqian Ning
Yang Cheng
Q. Hao
11
1
0
23 Nov 2022
DETRs with Collaborative Hybrid Assignments Training
DETRs with Collaborative Hybrid Assignments Training
Zhuofan Zong
Guanglu Song
Yu Liu
ViT
24
304
0
22 Nov 2022
Efficient Frequency Domain-based Transformers for High-Quality Image
  Deblurring
Efficient Frequency Domain-based Transformers for High-Quality Image Deblurring
Lingshun Kong
Jiangxin Dong
Mingqiang Li
J. Ge
Jin-shan Pan
ViT
22
141
0
22 Nov 2022
Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition
Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition
Qibin Hou
Cheng Lu
Mingg-Ming Cheng
Jiashi Feng
ViT
23
129
0
22 Nov 2022
NeRF-RPN: A general framework for object detection in NeRFs
NeRF-RPN: A general framework for object detection in NeRFs
Benran Hu
Junkai Huang
Yichen Liu
Yu-Wing Tai
Chi-Keung Tang
ObjD
17
59
0
21 Nov 2022
Contrastive Masked Autoencoders for Self-Supervised Video Hashing
Contrastive Masked Autoencoders for Self-Supervised Video Hashing
Yuting Wang
Jinpeng Wang
B. Chen
Ziyun Zeng
Shutao Xia
21
20
0
21 Nov 2022
Peeling the Onion: Hierarchical Reduction of Data Redundancy for
  Efficient Vision Transformer Training
Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training
Zhenglun Kong
Haoyu Ma
Geng Yuan
Mengshu Sun
Yanyue Xie
...
Tianlong Chen
Xiaolong Ma
Xiaohui Xie
Zhangyang Wang
Yanzhi Wang
ViT
26
22
0
19 Nov 2022
DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text
  Spotting
DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting
Maoyuan Ye
Jing Zhang
Shanshan Zhao
Juhua Liu
Tongliang Liu
Bo Du
Dacheng Tao
23
70
0
19 Nov 2022
TORE: Token Reduction for Efficient Human Mesh Recovery with Transformer
TORE: Token Reduction for Efficient Human Mesh Recovery with Transformer
Zhiyang Dou
Qingxuan Wu
Chu-Hsing Lin
Zeyu Cao
Qiangqiang Wu
Weilin Wan
Taku Komura
Wenping Wang
24
39
0
19 Nov 2022
CroCo v2: Improved Cross-view Completion Pre-training for Stereo
  Matching and Optical Flow
CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical Flow
Philippe Weinzaepfel
Thomas Lucas
Vincent Leroy
Yohann Cabon
Vaibhav Arora
Romain Brégier
G. Csurka
L. Antsfeld
Boris Chidlovskii
Jérôme Revaud
ViT
15
80
0
18 Nov 2022
Self-Supervised Visual Representation Learning via Residual Momentum
Self-Supervised Visual Representation Learning via Residual Momentum
T. Pham
Axi Niu
Zhang Kang
Sultan Rizky Hikmawan Madjid
Jiajing Hong
Daehyeok Kim
Joshua Tian Jin Tee
Chang-Dong Yoo
SSL
34
6
0
17 Nov 2022
DiffusionDet: Diffusion Model for Object Detection
DiffusionDet: Diffusion Model for Object Detection
Shoufa Chen
Pei Sun
Yibing Song
Ping Luo
37
442
0
17 Nov 2022
CPT-V: A Contrastive Approach to Post-Training Quantization of Vision
  Transformers
CPT-V: A Contrastive Approach to Post-Training Quantization of Vision Transformers
N. Frumkin
Dibakar Gope
Diana Marculescu
ViT
MQ
21
1
0
17 Nov 2022
Hypergraph Transformer for Skeleton-based Action Recognition
Hypergraph Transformer for Skeleton-based Action Recognition
Yuxuan Zhou
Zhi-Qi Cheng
C. Li
Yanwen Fang
Yifeng Geng
Xuansong Xie
M. Keuper
ViT
18
52
0
17 Nov 2022
UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video
  UniFormer
UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer
Kunchang Li
Yali Wang
Yinan He
Yizhuo Li
Yi Wang
Limin Wang
Yu Qiao
ViT
20
106
0
17 Nov 2022
Progressive Tree-Structured Prototype Network for End-to-End Image
  Captioning
Progressive Tree-Structured Prototype Network for End-to-End Image Captioning
Pengpeng Zeng
Jinkuan Zhu
Jingkuan Song
Lianli Gao
VLM
22
27
0
17 Nov 2022
Video Unsupervised Domain Adaptation with Deep Learning: A Comprehensive
  Survey
Video Unsupervised Domain Adaptation with Deep Learning: A Comprehensive Survey
Yuecong Xu
Haozhi Cao
Zhenghua Chen
Xiaoli Li
Lihua Xie
Jianfei Yang
24
14
0
17 Nov 2022
Language-Assisted Deep Learning for Autistic Behaviors Recognition
Language-Assisted Deep Learning for Autistic Behaviors Recognition
Andong Deng
Taojiannan Yang
C. L. P. Chen
Qian Chen
Leslie C. Neely
Sakiko Oyama
16
8
0
17 Nov 2022
Prompt Tuning for Parameter-efficient Medical Image Segmentation
Prompt Tuning for Parameter-efficient Medical Image Segmentation
Marc Fischer
Alexander Bartler
Bin Yang
SSeg
14
18
0
16 Nov 2022
Token Turing Machines
Token Turing Machines
Michael S. Ryoo
K. Gopalakrishnan
Kumara Kahatapitiya
Ted Xiao
Kanishka Rao
Austin Stone
Yao Lu
Julian Ibarz
Anurag Arnab
27
21
0
16 Nov 2022
Robust Online Video Instance Segmentation with Track Queries
Robust Online Video Instance Segmentation with Track Queries
Zitong Zhan
Daniel McKee
Svetlana Lazebnik
21
9
0
16 Nov 2022
SWIN-SFTNet : Spatial Feature Expansion and Aggregation using Swin
  Transformer For Whole Breast micro-mass segmentation
SWIN-SFTNet : Spatial Feature Expansion and Aggregation using Swin Transformer For Whole Breast micro-mass segmentation
Sharif Amit Kamran
Khondker Fariha Hossain
Alireza Tavakkoli
G. Bebis
Salah A. Baker
ViT
9
4
0
16 Nov 2022
HMOE: Hypernetwork-based Mixture of Experts for Domain Generalization
HMOE: Hypernetwork-based Mixture of Experts for Domain Generalization
Jingang Qu
T. Faney
Zehao Wang
Patrick Gallinari
Soleiman Yousef
J. D. Hemptinne
OOD
14
7
0
15 Nov 2022
Masked Reconstruction Contrastive Learning with Information Bottleneck
  Principle
Masked Reconstruction Contrastive Learning with Information Bottleneck Principle
Ziwen Liu
Bonan Li
Congying Han
Tiande Guo
Xuecheng Nie
SSL
27
2
0
15 Nov 2022
Self-supervised remote sensing feature learning: Learning Paradigms,
  Challenges, and Future Works
Self-supervised remote sensing feature learning: Learning Paradigms, Challenges, and Future Works
Chao Tao
Ji Qi
Mingning Guo
Qing Zhu
Haifeng Li
SSL
19
56
0
15 Nov 2022
FedTune: A Deep Dive into Efficient Federated Fine-Tuning with
  Pre-trained Transformers
FedTune: A Deep Dive into Efficient Federated Fine-Tuning with Pre-trained Transformers
Jinyu Chen
Wenchao Xu
Song Guo
Junxiao Wang
Jie M. Zhang
Haozhao Wang
FedML
15
32
0
15 Nov 2022
NAR-Former: Neural Architecture Representation Learning towards Holistic
  Attributes Prediction
NAR-Former: Neural Architecture Representation Learning towards Holistic Attributes Prediction
Yun Yi
Haokui Zhang
Wenze Hu
Nannan Wang
Xiaoyu Wang
AI4TS
AI4CE
19
8
0
15 Nov 2022
Contextual Transformer for Offline Meta Reinforcement Learning
Contextual Transformer for Offline Meta Reinforcement Learning
Runji Lin
Ye Li
Xidong Feng
Zhaowei Zhang
Xian Hong Wu Fung
Haifeng Zhang
Jun Wang
Yali Du
Yaodong Yang
OffRL
8
6
0
15 Nov 2022
Adaptive Multi-Neighborhood Attention based Transformer for Graph
  Representation Learning
Adaptive Multi-Neighborhood Attention based Transformer for Graph Representation Learning
Gaichao Li
Jinsong Chen
Kun He
10
3
0
15 Nov 2022
PiPa: Pixel- and Patch-wise Self-supervised Learning for Domain
  Adaptative Semantic Segmentation
PiPa: Pixel- and Patch-wise Self-supervised Learning for Domain Adaptative Semantic Segmentation
Mu Chen
Zhedong Zheng
Yi Yang
Tat-Seng Chua
27
53
0
14 Nov 2022
ParCNetV2: Oversized Kernel with Enhanced Attention
ParCNetV2: Oversized Kernel with Enhanced Attention
Ruihan Xu
Haokui Zhang
Wenze Hu
Shiliang Zhang
Xiaoyu Wang
ViT
17
6
0
14 Nov 2022
BiViT: Extremely Compressed Binary Vision Transformer
BiViT: Extremely Compressed Binary Vision Transformer
Yefei He
Zhenyu Lou
Luoming Zhang
Jing Liu
Weijia Wu
Hong Zhou
Bohan Zhuang
ViT
MQ
18
28
0
14 Nov 2022
Deep Learning-enabled Virtual Histological Staining of Biological
  Samples
Deep Learning-enabled Virtual Histological Staining of Biological Samples
Bijie Bai
Xilin Yang
Yuzhu Li
Yijie Zhang
N. Pillar
Aydogan Ozcan
11
149
0
13 Nov 2022
Long-Range Zero-Shot Generative Deep Network Quantization
Long-Range Zero-Shot Generative Deep Network Quantization
Yan Luo
Yangcheng Gao
Zhao Zhang
Haijun Zhang
Mingliang Xu
Meng Wang
MQ
17
9
0
13 Nov 2022
Perceptual Video Coding for Machines via Satisfied Machine Ratio
  Modeling
Perceptual Video Coding for Machines via Satisfied Machine Ratio Modeling
Qi Zhang
Shanshe Wang
Xinfeng Zhang
Chuanmin Jia
Jingshan Pan
Siwei Ma
Wen Gao
19
3
0
13 Nov 2022
Multistep feature aggregation framework for salient object detection
Multistep feature aggregation framework for salient object detection
Xiaogang Liu
25
0
0
12 Nov 2022
Large-scale Contrastive Language-Audio Pretraining with Feature Fusion
  and Keyword-to-Caption Augmentation
Large-scale Contrastive Language-Audio Pretraining with Feature Fusion and Keyword-to-Caption Augmentation
Yusong Wu
K. Chen
Tianyu Zhang
Yuchen Hui
Marianna Nezhurina
Taylor Berg-Kirkpatrick
Shlomo Dubnov
CLIP
37
480
0
12 Nov 2022
AU-Aware Vision Transformers for Biased Facial Expression Recognition
AU-Aware Vision Transformers for Biased Facial Expression Recognition
Shuyi Mao
Xinpeng Li
Q. Wu
Xiaojiang Peng
ViT
28
2
0
12 Nov 2022
End-to-End Machine Learning Framework for Facial AU Detection in
  Intensive Care Units
End-to-End Machine Learning Framework for Facial AU Detection in Intensive Care Units
Subhash Nerella
Kia Khezeli
Andrea Davidson
P. Tighe
A. Bihorac
Parisa Rashidi
CVBM
10
4
0
12 Nov 2022
Unifying Flow, Stereo and Depth Estimation
Unifying Flow, Stereo and Depth Estimation
Haofei Xu
Jing Zhang
Jianfei Cai
Hamid Rezatofighi
F. I. F. Richard Yu
Dacheng Tao
Andreas Geiger
MDE
17
191
0
10 Nov 2022
OneFormer: One Transformer to Rule Universal Image Segmentation
OneFormer: One Transformer to Rule Universal Image Segmentation
Jitesh Jain
Jiacheng Li
M. Chiu
Ali Hassani
Nikita Orlov
Humphrey Shi
ViT
21
324
0
10 Nov 2022
Learning Cross-view Geo-localization Embeddings via Dynamic Weighted
  Decorrelation Regularization
Learning Cross-view Geo-localization Embeddings via Dynamic Weighted Decorrelation Regularization
Ting Wang
Zhedong Zheng
Zunjie Zhu
Yuhan Gao
Yi Yang
Chenggang Yan
23
34
0
10 Nov 2022
Training a Vision Transformer from scratch in less than 24 hours with 1
  GPU
Training a Vision Transformer from scratch in less than 24 hours with 1 GPU
Saghar Irandoust
Thibaut Durand
Yunduz Rakhmangulova
Wenjie Zi
Hossein Hajimirsadeghi
ViT
25
6
0
09 Nov 2022
ViTALiTy: Unifying Low-rank and Sparse Approximation for Vision
  Transformer Acceleration with a Linear Taylor Attention
ViTALiTy: Unifying Low-rank and Sparse Approximation for Vision Transformer Acceleration with a Linear Taylor Attention
Jyotikrishna Dass
Shang Wu
Huihong Shi
Chaojian Li
Zhifan Ye
Zhongfeng Wang
Yingyan Lin
15
49
0
09 Nov 2022
Group DETR v2: Strong Object Detector with Encoder-Decoder Pretraining
Group DETR v2: Strong Object Detector with Encoder-Decoder Pretraining
Qiang Chen
Jian Wang
Chuchu Han
Shangang Zhang
Zexian Li
...
Haocheng Feng
Kun Yao
Junyu Han
Errui Ding
Jingdong Wang
ViT
VLM
27
44
0
07 Nov 2022
ViT-CX: Causal Explanation of Vision Transformers
ViT-CX: Causal Explanation of Vision Transformers
Weiyan Xie
Xiao-hui Li
Caleb Chen Cao
Nevin L.Zhang
ViT
16
17
0
06 Nov 2022
Rethinking Hierarchies in Pre-trained Plain Vision Transformer
Rethinking Hierarchies in Pre-trained Plain Vision Transformer
Yufei Xu
Jing Zhang
Qiming Zhang
Dacheng Tao
13
1
0
03 Nov 2022
MALUNet: A Multi-Attention and Light-weight UNet for Skin Lesion
  Segmentation
MALUNet: A Multi-Attention and Light-weight UNet for Skin Lesion Segmentation
Jiacheng Ruan
Suncheng Xiang
Mingye Xie
Ting Liu
Yuzhuo Fu
16
132
0
03 Nov 2022
MPCFormer: fast, performant and private Transformer inference with MPC
MPCFormer: fast, performant and private Transformer inference with MPC
Dacheng Li
Rulin Shao
Hongyi Wang
Han Guo
Eric P. Xing
Haotong Zhang
11
79
0
02 Nov 2022
Attention-based Neural Cellular Automata
Attention-based Neural Cellular Automata
Mattie Tesfaldet
Derek Nowrouzezahrai
C. Pal
ViT
21
17
0
02 Nov 2022
Previous
123...262728...394041
Next