ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.14030
  4. Cited By
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

25 March 2021
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng-Wei Zhang
Stephen Lin
B. Guo
    ViT
ArXivPDFHTML

Papers citing "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows"

50 / 2,188 papers shown
Title
ByteTrackV2: 2D and 3D Multi-Object Tracking by Associating Every
  Detection Box
ByteTrackV2: 2D and 3D Multi-Object Tracking by Associating Every Detection Box
Yifu Zhang
Xing-Hui Wang
Xiaoqing Ye
Wei Zhang
Jincheng Lu
Xiao Tan
Errui Ding
Pei Sun
Jingdong Wang
VOT
24
20
0
27 Mar 2023
Vision Transformer with Quadrangle Attention
Vision Transformer with Quadrangle Attention
Qiming Zhang
Jing Zhang
Yufei Xu
Dacheng Tao
ViT
19
38
0
27 Mar 2023
UniDistill: A Universal Cross-Modality Knowledge Distillation Framework
  for 3D Object Detection in Bird's-Eye View
UniDistill: A Universal Cross-Modality Knowledge Distillation Framework for 3D Object Detection in Bird's-Eye View
Shengchao Zhou
Weizhou Liu
Chen Hu
Shuchang Zhou
Chaoxiang Ma
21
42
0
27 Mar 2023
Learned Image Compression with Mixed Transformer-CNN Architectures
Learned Image Compression with Mixed Transformer-CNN Architectures
Jinming Liu
Heming Sun
J. Katto
10
220
0
27 Mar 2023
Joint Person Identity, Gender and Age Estimation from Hand Images using Deep Multi-Task Representation Learning
Joint Person Identity, Gender and Age Estimation from Hand Images using Deep Multi-Task Representation Learning
N. L. Baisa
CVBM
32
4
0
27 Mar 2023
Global-to-Local Modeling for Video-based 3D Human Pose and Shape
  Estimation
Global-to-Local Modeling for Video-based 3D Human Pose and Shape Estimation
Xi Shen
Zongxin Yang
Xiaohan Wang
Jianxin Ma
Chang Zhou
Yezhou Yang
ViT
3DH
21
33
0
26 Mar 2023
SDTracker: Synthetic Data Based Multi-Object Tracking
SDTracker: Synthetic Data Based Multi-Object Tracking
Yingda Guan
Zhengyang Feng
Huiying Chang
Kuo Du
Tingting Li
Min Wang
21
0
0
26 Mar 2023
Sector Patch Embedding: An Embedding Module Conforming to The Distortion
  Pattern of Fisheye Image
Sector Patch Embedding: An Embedding Module Conforming to The Distortion Pattern of Fisheye Image
Dian Yang
Jiadong Tang
Yu Gao
Yi Yang
M. Fu
18
1
0
26 Mar 2023
An Evaluation of Memory Optimization Methods for Training Neural
  Networks
An Evaluation of Memory Optimization Methods for Training Neural Networks
Xiaoxuan Liu
Siddharth Jha
Alvin Cheung
21
0
0
26 Mar 2023
EfficientAD: Accurate Visual Anomaly Detection at Millisecond-Level
  Latencies
EfficientAD: Accurate Visual Anomaly Detection at Millisecond-Level Latencies
Kilian Batzner
Lars Heckler
Rebecca König
30
129
0
25 Mar 2023
MDQE: Mining Discriminative Query Embeddings to Segment Occluded
  Instances on Challenging Videos
MDQE: Mining Discriminative Query Embeddings to Segment Occluded Instances on Challenging Videos
Minghan Li
Shuai Li
Wangmeng Xiang
Lei Zhang
26
9
0
25 Mar 2023
Prompt-Guided Transformers for End-to-End Open-Vocabulary Object
  Detection
Prompt-Guided Transformers for End-to-End Open-Vocabulary Object Detection
Hwanjun Song
Jihwan Bang
VLM
ObjD
24
14
0
25 Mar 2023
Towards Accurate Post-Training Quantization for Vision Transformer
Towards Accurate Post-Training Quantization for Vision Transformer
Yifu Ding
Haotong Qin
Qing-Yu Yan
Z. Chai
Junjie Liu
Xiaolin K. Wei
Xianglong Liu
MQ
54
67
0
25 Mar 2023
Learned Two-Plane Perspective Prior based Image Resampling for Efficient
  Object Detection
Learned Two-Plane Perspective Prior based Image Resampling for Efficient Object Detection
Anurag Ghosh
Dinesh Reddy Narapureddy
Christoph Mertz
S. Narasimhan
28
4
0
25 Mar 2023
FastViT: A Fast Hybrid Vision Transformer using Structural
  Reparameterization
FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization
Pavan Kumar Anasosalu Vasu
J. Gabriel
Jeff J. Zhu
Oncel Tuzel
Anurag Ranjan
ViT
26
149
0
24 Mar 2023
PanoVPR: Towards Unified Perspective-to-Equirectangular Visual Place
  Recognition via Sliding Windows across the Panoramic View
PanoVPR: Towards Unified Perspective-to-Equirectangular Visual Place Recognition via Sliding Windows across the Panoramic View
Ze Shi
Haowen Shi
Kailun Yang
Zhen-fei Yin
Yining Lin
Kaiwei Wang
24
3
0
24 Mar 2023
PFT-SSR: Parallax Fusion Transformer for Stereo Image Super-Resolution
PFT-SSR: Parallax Fusion Transformer for Stereo Image Super-Resolution
Hansheng Guo
Juncheng Li
Guangwei Gao
Zhi Li
T. Zeng
ViT
24
12
0
24 Mar 2023
MSFA-Frequency-Aware Transformer for Hyperspectral Images Demosaicing
MSFA-Frequency-Aware Transformer for Hyperspectral Images Demosaicing
Haijin Zeng
Kai Feng
Shaoguang Huang
Jiezhang Cao
Yongyong Chen
Hongyan Zhang
H. Luong
Wilfried Philips
21
1
0
23 Mar 2023
Towards Better Dynamic Graph Learning: New Architecture and Unified
  Library
Towards Better Dynamic Graph Learning: New Architecture and Unified Library
Le Yu
Leilei Sun
Bowen Du
Weifeng Lv
AI4CE
22
96
0
23 Mar 2023
Top-Down Visual Attention from Analysis by Synthesis
Top-Down Visual Attention from Analysis by Synthesis
Baifeng Shi
Trevor Darrell
Xin Eric Wang
17
28
0
23 Mar 2023
From Knowledge Distillation to Self-Knowledge Distillation: A Unified
  Approach with Normalized Loss and Customized Soft Labels
From Knowledge Distillation to Self-Knowledge Distillation: A Unified Approach with Normalized Loss and Customized Soft Labels
Zhendong Yang
Ailing Zeng
Zhe Li
Tianke Zhang
Chun Yuan
Yu Li
21
72
0
23 Mar 2023
LiDARFormer: A Unified Transformer-based Multi-task Network for LiDAR
  Perception
LiDARFormer: A Unified Transformer-based Multi-task Network for LiDAR Perception
Zixiang Zhou
Dongqiangzi Ye
Weijia Chen
Yufei Xie
Yu Wang
Panqu Wang
H. Foroosh
29
10
0
21 Mar 2023
Machine Learning for Brain Disorders: Transformers and Visual
  Transformers
Machine Learning for Brain Disorders: Transformers and Visual Transformers
Robin Courant
Maika Edberg
Nicolas Dufour
Vicky Kalogeiton
MedIm
ViT
27
1
0
21 Mar 2023
Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D
  Object Detection
Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection
Shihao Wang
Yingfei Liu
Tiancai Wang
Ying Li
Xiangyu Zhang
3DPC
39
190
0
21 Mar 2023
The Multiscale Surface Vision Transformer
The Multiscale Surface Vision Transformer
Simon Dahan
Logan Z. J. Williams
Daniel Rueckert
E. C. Robinson
MedIm
ViT
10
2
0
21 Mar 2023
A High-Frequency Focused Network for Lightweight Single Image
  Super-Resolution
A High-Frequency Focused Network for Lightweight Single Image Super-Resolution
Xiaotian Weng
Yi Chen
Zhichao Zheng
Yanhui Gu
Junsheng Zhou
Yudong Zhang
21
0
0
21 Mar 2023
Human Pose as Compositional Tokens
Human Pose as Compositional Tokens
Zigang Geng
Chunyu Wang
Yixuan Wei
Ze Liu
Houqiang Li
Han Hu
23
47
0
21 Mar 2023
Equiangular Basis Vectors
Equiangular Basis Vectors
Yang Shen
Xuhao Sun
Xiuying Wei
33
7
0
21 Mar 2023
Learning Context-aware Classifier for Semantic Segmentation
Learning Context-aware Classifier for Semantic Segmentation
Zhuotao Tian
Jiequan Cui
Li Jiang
Xiaojuan Qi
Xin Lai
Yixin Chen
Shu Liu
Jiaya Jia
85
22
0
21 Mar 2023
Detecting the open-world objects with the help of the Brain
Detecting the open-world objects with the help of the Brain
Shuailei Ma
Yuefeng Wang
Ying-yu Wei
Peihao Chen
Zhixiang Ye
Jiaqi Fan
Enming Zhang
Thomas H. Li
VLM
ObjD
16
2
0
21 Mar 2023
One-to-Few Label Assignment for End-to-End Dense Detection
One-to-Few Label Assignment for End-to-End Dense Detection
Shuai Li
Minghan Li
Ruihuang Li
Chenhang He
Lei Zhang
25
19
0
21 Mar 2023
Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training
  Efficiency
Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training Efficiency
Vithursan Thangarasa
Shreyas Saxena
Abhay Gupta
Sean Lie
21
3
0
21 Mar 2023
Tracker Meets Night: A Transformer Enhancer for UAV Tracking
Tracker Meets Night: A Transformer Enhancer for UAV Tracking
Junjie Ye
Changhong Fu
Ziang Cao
Shan An
Guang-Zheng Zheng
Bowen Li
32
51
0
20 Mar 2023
Multiscale Audio Spectrogram Transformer for Efficient Audio
  Classification
Multiscale Audio Spectrogram Transformer for Efficient Audio Classification
Wenjie Zhu
M. Omar
35
22
0
19 Mar 2023
Spatio-Temporal AU Relational Graph Representation Learning For Facial
  Action Units Detection
Spatio-Temporal AU Relational Graph Representation Learning For Facial Action Units Detection
Zihan Wang
Siyang Song
Cheng Luo
Yuzhi Zhou
Shiling Wu
Weicheng Xie
Linlin Shen
CVBM
31
13
0
19 Mar 2023
Vision Transformer-based Model for Severity Quantification of Lung
  Pneumonia Using Chest X-ray Images
Vision Transformer-based Model for Severity Quantification of Lung Pneumonia Using Chest X-ray Images
Bouthaina Slika
Fadi Dornaika
H. Merdji
K. Hammoudi
ViT
MedIm
26
0
0
18 Mar 2023
HDformer: A Higher Dimensional Transformer for Diabetes Detection
  Utilizing Long Range Vascular Signals
HDformer: A Higher Dimensional Transformer for Diabetes Detection Utilizing Long Range Vascular Signals
Ella Lan
MedIm
20
1
0
17 Mar 2023
MedNeXt: Transformer-driven Scaling of ConvNets for Medical Image
  Segmentation
MedNeXt: Transformer-driven Scaling of ConvNets for Medical Image Segmentation
Saikat Roy
Gregor Koehler
Constantin Ulrich
Michael Baumgartner
Jens Petersen
Fabian Isensee
Paul F. Jaeger
Klaus Maier-Hein
ViT
MedIm
24
136
0
17 Mar 2023
GNNFormer: A Graph-based Framework for Cytopathology Report Generation
GNNFormer: A Graph-based Framework for Cytopathology Report Generation
Yangqiaoyu Zhou
Kai-Lang Yao
Wusuo Li
MedIm
11
1
0
17 Mar 2023
SwinVFTR: A Novel Volumetric Feature-learning Transformer for 3D OCT Fluid Segmentation
SwinVFTR: A Novel Volumetric Feature-learning Transformer for 3D OCT Fluid Segmentation
Sharif Amit Kamran
Khondker Fariha Hossain
Alireza Tavakkoli
Salah A. Baker
S. Zuckerbrod
ViT
MedIm
11
1
0
16 Mar 2023
Highly Efficient 3D Human Pose Tracking from Events with Spiking Spatiotemporal Transformer
Highly Efficient 3D Human Pose Tracking from Events with Spiking Spatiotemporal Transformer
Shihao Zou
Yuxuan Mu
X. Zuo
Zi-An Wang
Chao Li
Sen Wang
Weixin Si
Li Cheng
3DH
31
15
0
16 Mar 2023
DeDA: Deep Directed Accumulator
DeDA: Deep Directed Accumulator
Hang Zhang
Rongguang Wang
Renjiu Hu
Jinwei Zhang
Jiahao Nick Li
MedIm
19
4
0
15 Mar 2023
Multi Modal Facial Expression Recognition with Transformer-Based Fusion
  Networks and Dynamic Sampling
Multi Modal Facial Expression Recognition with Transformer-Based Fusion Networks and Dynamic Sampling
Jun-Hwa Kim
Namho Kim
C. Won
CVBM
6
8
0
15 Mar 2023
Subjective and Objective Quality Assessment for in-the-Wild Computer
  Graphics Images
Subjective and Objective Quality Assessment for in-the-Wild Computer Graphics Images
Zicheng Zhang
Wei Sun
Yingjie Zhou
Jun Jia
Zhichao Zhang
Jing Liu
Xiongkuo Min
Guangtao Zhai
23
25
0
14 Mar 2023
CAT: Causal Audio Transformer for Audio Classification
CAT: Causal Audio Transformer for Audio Classification
Xiaoyu Liu
Hanlin Lu
Jianbo Yuan
Xinyu Li
ViT
21
22
0
14 Mar 2023
RTMPose: Real-Time Multi-Person Pose Estimation based on MMPose
RTMPose: Real-Time Multi-Person Pose Estimation based on MMPose
Tao Jiang
Peng Lu
Li Zhang
Ning Ma
Rui Han
Chengqi Lyu
Yining Li
Kai-xiang Chen
3DH
31
155
0
13 Mar 2023
Diffusion-Based Hierarchical Multi-Label Object Detection to Analyze
  Panoramic Dental X-rays
Diffusion-Based Hierarchical Multi-Label Object Detection to Analyze Panoramic Dental X-rays
Ibrahim Ethem Hamamci
Sezgin Er
Enis Simsar
Anjany Sekuboyina
M. Gundogar
B. Stadlinger
A. Mehl
Bjoern H. Menze
DiffM
MedIm
10
25
0
11 Mar 2023
Fine-grained Visual Classification with High-temperature Refinement and
  Background Suppression
Fine-grained Visual Classification with High-temperature Refinement and Background Suppression
Po-Yung Chou
Yu-Yung Kao
Cheng-Hung Lin
23
30
0
11 Mar 2023
Semantics-Aware Dynamic Localization and Refinement for Referring Image
  Segmentation
Semantics-Aware Dynamic Localization and Refinement for Referring Image Segmentation
Zhao Yang
Jiaqi Wang
Yansong Tang
Kai-xiang Chen
Hengshuang Zhao
Philip H. S. Torr
31
23
0
11 Mar 2023
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set
  Object Detection
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
Shilong Liu
Zhaoyang Zeng
Tianhe Ren
Feng Li
Hao Zhang
...
Chun-yue Li
Jianwei Yang
Hang Su
Jun Zhu
Lei Zhang
ObjD
58
1,804
0
09 Mar 2023
Previous
123...222324...424344
Next