ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.14030
  4. Cited By
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

25 March 2021
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng-Wei Zhang
Stephen Lin
B. Guo
    ViT
ArXivPDFHTML

Papers citing "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows"

50 / 2,425 papers shown
Title
Test-Time Training on Video Streams
Test-Time Training on Video Streams
Renhao Wang
Yu Sun
Yossi Gandelsman
Xinlei Chen
Alexei A. Efros
Alexei A. Efros
Xiaolong Wang
TTA
ViT
3DGS
31
16
0
11 Jul 2023
Random Position Adversarial Patch for Vision Transformers
Random Position Adversarial Patch for Vision Transformers
Mingzhen Shao
ViT
AAML
23
2
0
09 Jul 2023
Joint Perceptual Learning for Enhancement and Object Detection in
  Underwater Scenarios
Joint Perceptual Learning for Enhancement and Object Detection in Underwater Scenarios
C. Fu
Wanqi Yuan
Jiewen Xiao
Risheng Liu
Xin-Yue Fan
20
0
0
07 Jul 2023
Non-iterative Coarse-to-fine Transformer Networks for Joint Affine and
  Deformable Image Registration
Non-iterative Coarse-to-fine Transformer Networks for Joint Affine and Deformable Image Registration
Mingyuan Meng
Lei Bi
M. Fulham
D. Feng
Jinman Kim
ViT
MedIm
18
24
0
07 Jul 2023
Joint Hierarchical Priors and Adaptive Spatial Resolution for Efficient
  Neural Image Compression
Joint Hierarchical Priors and Adaptive Spatial Resolution for Efficient Neural Image Compression
Ahmed Ghorbel
W. Hamidouche
L. Morin
23
0
0
05 Jul 2023
Compound Attention and Neighbor Matching Network for Multi-contrast MRI
  Super-resolution
Compound Attention and Neighbor Matching Network for Multi-contrast MRI Super-resolution
Wenxuan Chen
Sirui Wu
Shuai Wang
Zhong Li
Jia Yang
Huifeng Yao
Xiao-quan Song
SupR
MedIm
30
1
0
05 Jul 2023
ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: TREK-150 Single
  Object Tracking
ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: TREK-150 Single Object Tracking
Yuanyou Xu
Jiahao Li
Zongxin Yang
Yi Yang
Yueting Zhuang
14
1
0
05 Jul 2023
ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: Semi-Supervised
  Video Object Segmentation
ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: Semi-Supervised Video Object Segmentation
Jiahao Li
Yuanyou Xu
Zongxin Yang
Yi Yang
Yueting Zhuang
VOS
33
0
0
05 Jul 2023
MaskBEV: Joint Object Detection and Footprint Completion for Bird's-eye
  View 3D Point Clouds
MaskBEV: Joint Object Detection and Footprint Completion for Bird's-eye View 3D Point Clouds
William Guimont-Martin
Jean-Michel Fortin
François Pomerleau
Philippe Giguère
3DPC
12
1
0
04 Jul 2023
Spike-driven Transformer
Spike-driven Transformer
Man Yao
Jiakui Hu
Zhaokun Zhou
Liuliang Yuan
Yonghong Tian
Boxing Xu
Guoqi Li
34
112
0
04 Jul 2023
Separated RoadTopoFormer
Separated RoadTopoFormer
Mingjie Lu
Yuanxian Huang
Ji Liu
Jinzhan Peng
Lu Tian
Ashish Sirasao
16
2
0
04 Jul 2023
Tomato Maturity Recognition with Convolutional Transformers
Tomato Maturity Recognition with Convolutional Transformers
Asim Khan
Taimur Hassan
Muhammad Shafay
Israa Fahmy
N. Werghi
Seneviratne Mudigansalage
Irfan Hussain
ViT
34
24
0
04 Jul 2023
RefSAM: Efficiently Adapting Segmenting Anything Model for Referring
  Video Object Segmentation
RefSAM: Efficiently Adapting Segmenting Anything Model for Referring Video Object Segmentation
Yonglin Li
Jing Zhang
Xiao Teng
Long Lan
VOS
VLM
23
17
0
03 Jul 2023
Unveiling the Potential of Spike Streams for Foreground Occlusion
  Removal from Densely Continuous Views
Unveiling the Potential of Spike Streams for Foreground Occlusion Removal from Densely Continuous Views
Jiyuan Zhang
Shiyan Chen
Yajing Zheng
Zhaofei Yu
Tiejun Huang
3DH
25
3
0
03 Jul 2023
Guided Patch-Grouping Wavelet Transformer with Spatial Congruence for
  Ultra-High Resolution Segmentation
Guided Patch-Grouping Wavelet Transformer with Spatial Congruence for Ultra-High Resolution Segmentation
Deyi Ji
Feng Zhao
Hongtao Lu
ViT
41
15
0
03 Jul 2023
TopicFM+: Boosting Accuracy and Efficiency of Topic-Assisted Feature
  Matching
TopicFM+: Boosting Accuracy and Efficiency of Topic-Assisted Feature Matching
Khang Truong Giang
Soohwan Song
Sung-Guk Jo
24
3
0
02 Jul 2023
MobileViG: Graph-Based Sparse Attention for Mobile Vision Applications
MobileViG: Graph-Based Sparse Attention for Mobile Vision Applications
Mustafa Munir
William Avery
R. Marculescu
ViT
GNN
31
33
0
01 Jul 2023
SysNoise: Exploring and Benchmarking Training-Deployment System
  Inconsistency
SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency
Yan Wang
Yuhang Li
Ruihao Gong
Aishan Liu
Yanfei Wang
...
Yongqiang Yao
Yunchen Zhang
Tianzi Xiao
F. Yu
Xianglong Liu
AAML
32
0
0
01 Jul 2023
Q-YOLO: Efficient Inference for Real-time Object Detection
Q-YOLO: Efficient Inference for Real-time Object Detection
Mingze Wang
H. Sun
Jun Shi
Xuhui Liu
Baochang Zhang
Xianbin Cao
ObjD
28
8
0
01 Jul 2023
Long-Tailed Continual Learning For Visual Food Recognition
Long-Tailed Continual Learning For Visual Food Recognition
Jiangpeng He
Luotao Lin
Jack Ma
H. Eicher-Miller
F. Zhu
Fengqing M Zhu
59
14
0
01 Jul 2023
Counting Guidance for High Fidelity Text-to-Image Synthesis
Counting Guidance for High Fidelity Text-to-Image Synthesis
Wonjune Kang
Kevin Galim
H. Koo
Nam Ik Cho
DiffM
32
8
0
30 Jun 2023
Cross Architecture Distillation for Face Recognition
Cross Architecture Distillation for Face Recognition
Weisong Zhao
Xiangyu Zhu
Zhixiang He
Xiaoyu Zhang
Zhen Lei
CVBM
15
6
0
26 Jun 2023
SuperBench: A Super-Resolution Benchmark Dataset for Scientific Machine Learning
SuperBench: A Super-Resolution Benchmark Dataset for Scientific Machine Learning
Pu Ren
N. Benjamin Erichson
Shashank Subramanian
Omer San
Z. Lukić
Michael W. Mahoney
Michael W. Mahoney
36
13
0
24 Jun 2023
Quantizable Transformers: Removing Outliers by Helping Attention Heads
  Do Nothing
Quantizable Transformers: Removing Outliers by Helping Attention Heads Do Nothing
Yelysei Bondarenko
Markus Nagel
Tijmen Blankevoort
MQ
13
88
0
22 Jun 2023
Efficient Deep Spiking Multi-Layer Perceptrons with Multiplication-Free
  Inference
Efficient Deep Spiking Multi-Layer Perceptrons with Multiplication-Free Inference
Boyan Li
Luziwei Leng
Shuaijie Shen
Kaixuan Zhang
Jianguo Zhang
Jianxing Liao
Ran Cheng
26
7
0
21 Jun 2023
A Comprehensive Study on the Robustness of Image Classification and
  Object Detection in Remote Sensing: Surveying and Benchmarking
A Comprehensive Study on the Robustness of Image Classification and Object Detection in Remote Sensing: Surveying and Benchmarking
Shaohui Mei
Jiawei Lian
Xiaofei Wang
Yuru Su
Mingyang Ma
Lap-Pui Chau
AAML
18
11
0
21 Jun 2023
MSW-Transformer: Multi-Scale Shifted Windows Transformer Networks for
  12-Lead ECG Classification
MSW-Transformer: Multi-Scale Shifted Windows Transformer Networks for 12-Lead ECG Classification
Ren-Wei Cheng
Zhemin Zhuang
Shuxin Zhuang
Linfu Xie
Jingfeng Guo
MedIm
22
6
0
21 Jun 2023
Balanced Mixture of SuperNets for Learning the CNN Pooling Architecture
Balanced Mixture of SuperNets for Learning the CNN Pooling Architecture
Mehraveh Javan
Matthew Toews
M. Pedersoli
31
1
0
21 Jun 2023
Using super-resolution for enhancing visual perception and segmentation
  performance in veterinary cytology
Using super-resolution for enhancing visual perception and segmentation performance in veterinary cytology
Jakub Caputa
Maciej Wielgosz
Daria Lukasik
P. Russek
Jakub Grzeszczyk
...
E. Jamro
S. Koryciak
A. Dabrowska-Boruch
Marcin Pietroñ
K. Wiatr
11
0
0
20 Jun 2023
Improving Image Captioning Descriptiveness by Ranking and LLM-based
  Fusion
Improving Image Captioning Descriptiveness by Ranking and LLM-based Fusion
Simone Bianco
Luigi Celona
Marco Donzella
Paolo Napoletano
31
18
0
20 Jun 2023
CrossKD: Cross-Head Knowledge Distillation for Object Detection
CrossKD: Cross-Head Knowledge Distillation for Object Detection
Jiabao Wang
Yuming Chen
Zhaohui Zheng
Xiang Li
Ming-Ming Cheng
Qibin Hou
38
32
0
20 Jun 2023
Parameter-efficient is not sufficient: Exploring Parameter, Memory, and
  Time Efficient Adapter Tuning for Dense Predictions
Parameter-efficient is not sufficient: Exploring Parameter, Memory, and Time Efficient Adapter Tuning for Dense Predictions
Dongshuo Yin
Xueting Han
Bin Li
Hao Feng
Jinghua Bai
VPVLM
26
17
0
16 Jun 2023
End-to-End Vectorized HD-map Construction with Piecewise Bezier Curve
End-to-End Vectorized HD-map Construction with Piecewise Bezier Curve
Limeng Qiao
Wenjie Ding
Xi Qiu
Chi Zhang
24
73
0
16 Jun 2023
PaReprop: Fast Parallelized Reversible Backpropagation
PaReprop: Fast Parallelized Reversible Backpropagation
Tyler Lixuan Zhu
K. Mangalam
17
1
0
15 Jun 2023
DEYOv2: Rank Feature with Greedy Matching for End-to-End Object
  Detection
DEYOv2: Rank Feature with Greedy Matching for End-to-End Object Detection
Hao Ouyang
34
4
0
15 Jun 2023
When to Use Efficient Self Attention? Profiling Text, Speech and Image
  Transformer Variants
When to Use Efficient Self Attention? Profiling Text, Speech and Image Transformer Variants
Anuj Diwan
Eunsol Choi
David F. Harwath
41
0
0
14 Jun 2023
Towards AGI in Computer Vision: Lessons Learned from GPT and Large
  Language Models
Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models
Lingxi Xie
Longhui Wei
Xiaopeng Zhang
Kaifeng Bi
Xiaotao Gu
Jianlong Chang
Qi Tian
33
7
0
14 Jun 2023
Extending CLIP's Image-Text Alignment to Referring Image Segmentation
Extending CLIP's Image-Text Alignment to Referring Image Segmentation
Seoyeon Kim
Minguk Kang
Dongwon Kim
Jaesik Park
Suha Kwak
VLM
22
10
0
14 Jun 2023
UOD: Universal One-shot Detection of Anatomical Landmarks
UOD: Universal One-shot Detection of Anatomical Landmarks
Heqin Zhu
Quan Quan
Qingsong Yao
Zaiyi Liu
S.Kevin Zhou
ViT
15
10
0
13 Jun 2023
Referring Camouflaged Object Detection
Referring Camouflaged Object Detection
Xuying Zhang
Bo Yin
Zheng Lin
Qibin Hou
Deng-Ping Fan
Ming-Ming Cheng
38
17
0
13 Jun 2023
Transferring Foundation Models for Generalizable Robotic Manipulation
Transferring Foundation Models for Generalizable Robotic Manipulation
Jiange Yang
Wenhui Tan
Chuhao Jin
Keling Yao
Bei Liu
Jianlong Fu
Ruihua Song
Gangshan Wu
Limin Wang
LM&Ro
47
6
0
09 Jun 2023
Efficient Multi-Task Scene Analysis with RGB-D Transformers
Efficient Multi-Task Scene Analysis with RGB-D Transformers
Söhnke Benedikt Fischedick
Daniel Seichter
Robin M. Schmidt
Leonard Rabes
H. Groß
25
9
0
08 Jun 2023
InvPT++: Inverted Pyramid Multi-Task Transformer for Visual Scene
  Understanding
InvPT++: Inverted Pyramid Multi-Task Transformer for Visual Scene Understanding
Hanrong Ye
Dan Xu
ViT
25
10
0
08 Jun 2023
Matte Anything: Interactive Natural Image Matting with Segment Anything
  Models
Matte Anything: Interactive Natural Image Matting with Segment Anything Models
J. Yao
Xinggang Wang
Lang Ye
Wenyu Liu
20
38
0
07 Jun 2023
BokehOrNot: Transforming Bokeh Effect with Image Transformer and Lens
  Metadata Embedding
BokehOrNot: Transforming Bokeh Effect with Image Transformer and Lens Metadata Embedding
Zhihao Yang
Wenyi Lian
Siyuan Lai
30
6
0
06 Jun 2023
Semantic Segmentation on VSPW Dataset through Contrastive Loss and
  Multi-dataset Training Approach
Semantic Segmentation on VSPW Dataset through Contrastive Loss and Multi-dataset Training Approach
Min Yan
Qianxiong Ning
Qian Wang
25
1
0
06 Jun 2023
Cross-LKTCN: Modern Convolution Utilizing Cross-Variable Dependency for
  Multivariate Time Series Forecasting Dependency for Multivariate Time Series
  Forecasting
Cross-LKTCN: Modern Convolution Utilizing Cross-Variable Dependency for Multivariate Time Series Forecasting Dependency for Multivariate Time Series Forecasting
Donghao Luo
Xue Wang
BDL
AI4TS
11
2
0
04 Jun 2023
A Novel Driver Distraction Behavior Detection Method Based on
  Self-supervised Learning with Masked Image Modeling
A Novel Driver Distraction Behavior Detection Method Based on Self-supervised Learning with Masked Image Modeling
Yingzhi Zhang
Taiguo Li
C. Li
Xinghong Zhou
32
10
0
01 Jun 2023
Lightweight Vision Transformer with Bidirectional Interaction
Lightweight Vision Transformer with Bidirectional Interaction
Qihang Fan
Huaibo Huang
Xiaoqiang Zhou
Ran He
ViT
37
28
0
01 Jun 2023
VIPriors 3: Visual Inductive Priors for Data-Efficient Deep Learning
  Challenges
VIPriors 3: Visual Inductive Priors for Data-Efficient Deep Learning Challenges
Robert-Jan Bruintjes
A. Lengyel
Marcos Baptista-Rios
O. Kayhan
Davide Zambrano
Nergis Tomen
J. C. V. Gemert
18
9
0
31 May 2023
Previous
123...202122...474849
Next