ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.14030
  4. Cited By
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

25 March 2021
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng-Wei Zhang
Stephen Lin
B. Guo
    ViT
ArXivPDFHTML

Papers citing "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows"

50 / 1,659 papers shown
Title
Decoupled Multimodal Prototypes for Visual Recognition with Missing Modalities
Decoupled Multimodal Prototypes for Visual Recognition with Missing Modalities
Jueqing Lu
Yuanyuan Qi
Xiaohao Yang
Shujie Zhou
Lan Du
14
0
0
13 May 2025
H$^{\mathbf{3}}$DP: Triply-Hierarchical Diffusion Policy for Visuomotor Learning
H3^{\mathbf{3}}3DP: Triply-Hierarchical Diffusion Policy for Visuomotor Learning
Yiyang Lu
Yufeng Tian
Zhecheng Yuan
X. Wang
Pu Hua
Zhengrong Xue
Huazhe Xu
16
0
0
12 May 2025
Weakly Supervised Temporal Sentence Grounding via Positive Sample Mining
Weakly Supervised Temporal Sentence Grounding via Positive Sample Mining
Lu Dong
H. Zhang
Hongjie Zhang
Y. Huang
Z. Ling
Yu Qiao
Limin Wang
Y. Wang
AI4TS
21
0
0
10 May 2025
The Application of Deep Learning for Lymph Node Segmentation: A Systematic Review
The Application of Deep Learning for Lymph Node Segmentation: A Systematic Review
Jingguo Qu
Xinyang Han
Man-Lik Chui
Yao Pu
Simon Takadiyi Gunda
...
Jing Qin
Ann Dorothy King
Winnie Chiu-Wing Chu
J. Cai
Michael Tin-Cheung Ying
21
0
0
09 May 2025
A review of advancements in low-light image enhancement using deep learning
A review of advancements in low-light image enhancement using deep learning
Fangxue Liu
Lei Fan
37
0
0
09 May 2025
Brain Hematoma Marker Recognition Using Multitask Learning: SwinTransformer and Swin-Unet
Brain Hematoma Marker Recognition Using Multitask Learning: SwinTransformer and Swin-Unet
Kodai Hirata
Tsuyoshi Okita
ViT
36
0
0
09 May 2025
Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation
Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation
Kunpeng Qiu
Zhiqiang Gao
Zhiying Zhou
Mingjie Sun
Yongxin Guo
MedIm
29
0
0
09 May 2025
Accurate and Efficient Multivariate Time Series Forecasting via Offline Clustering
Accurate and Efficient Multivariate Time Series Forecasting via Offline Clustering
Yiming Niu
Jinliang Deng
L. Zhang
Zimu Zhou
Yongxin Tong
AI4TS
21
0
0
09 May 2025
Predicting Diabetic Macular Edema Treatment Responses Using OCT: Dataset and Methods of APTOS Competition
Predicting Diabetic Macular Edema Treatment Responses Using OCT: Dataset and Methods of APTOS Competition
Weiyi Zhang
Peranut Chotcomwongse
Yinwen Li
Pusheng Xu
Ruijie Yao
...
Shujun Wang
Yalin Zheng
M. He
Danli Shi
Paisan Ruamviboonsuk
13
0
0
09 May 2025
SSH-Net: A Self-Supervised and Hybrid Network for Noisy Image Watermark Removal
SSH-Net: A Self-Supervised and Hybrid Network for Noisy Image Watermark Removal
Wenyang Liu
Jianjun Gao
Kim-Hui Yap
36
0
0
08 May 2025
Cross-Branch Orthogonality for Improved Generalization in Face Deepfake Detection
Cross-Branch Orthogonality for Improved Generalization in Face Deepfake Detection
Tharindu Fernando
Clinton Fookes
S. Sridharan
Simon Denman
39
0
0
08 May 2025
Adaptive Contextual Embedding for Robust Far-View Borehole Detection
Adaptive Contextual Embedding for Robust Far-View Borehole Detection
Xuesong Liu
Tianyu Hao
Emmett J. Ientilucci
34
0
0
08 May 2025
The Moon's Many Faces: A Single Unified Transformer for Multimodal Lunar Reconstruction
The Moon's Many Faces: A Single Unified Transformer for Multimodal Lunar Reconstruction
Tom Sander
Moritz Tenthoff
Kay Wohlfarth
Christian Wöhler
19
0
0
08 May 2025
A Simple Detector with Frame Dynamics is a Strong Tracker
A Simple Detector with Frame Dynamics is a Strong Tracker
Chenxu Peng
C. Wang
Minrui Zou
Danyang Li
Z. Yang
Yimian Dai
Ming-Ming Cheng
Xiang Li
40
0
0
08 May 2025
Pro2SAM: Mask Prompt to SAM with Grid Points for Weakly Supervised Object Localization
Pro2SAM: Mask Prompt to SAM with Grid Points for Weakly Supervised Object Localization
Xi Yang
Songsong Duan
Nannan Wang
Xinbo Gao
WSOL
71
0
0
08 May 2025
FF-PNet: A Pyramid Network Based on Feature and Field for Brain Image Registration
FF-PNet: A Pyramid Network Based on Feature and Field for Brain Image Registration
Ying Zhang
Shuai Guo
Chenxi Sun
Yuchen Zhu
Jinhai Xiang
MedIm
45
0
0
08 May 2025
GaMNet: A Hybrid Network with Gabor Fusion and NMamba for Efficient 3D Glioma Segmentation
GaMNet: A Hybrid Network with Gabor Fusion and NMamba for Efficient 3D Glioma Segmentation
Chengwei Ye
H. Zhang
Yufei Lin
Kangsheng Wang
Linuo Xu
Shuyan Liu
MedIm
50
0
0
08 May 2025
T-T: Table Transformer for Tagging-based Aspect Sentiment Triplet Extraction
T-T: Table Transformer for Tagging-based Aspect Sentiment Triplet Extraction
Kun Peng
Chaodong Tong
Cong Cao
Hao Peng
Q. Li
Guanlin Wu
Lei Jiang
Yanbing Liu
Philip S. Yu
LMTD
43
0
0
08 May 2025
CM1 - A Dataset for Evaluating Few-Shot Information Extraction with Large Vision Language Models
CM1 - A Dataset for Evaluating Few-Shot Information Extraction with Large Vision Language Models
Fabian Wolf
Oliver Tüselmann
Arthur Matei
Lukas Hennies
Christoph Rass
Gernot A. Fink
46
0
0
07 May 2025
Lightweight RGB-D Salient Object Detection from a Speed-Accuracy Tradeoff Perspective
Lightweight RGB-D Salient Object Detection from a Speed-Accuracy Tradeoff Perspective
Songsong Duan
Xi Yang
Nannan Wang
Xinbo Gao
53
0
0
07 May 2025
TS-SNN: Temporal Shift Module for Spiking Neural Networks
TS-SNN: Temporal Shift Module for Spiking Neural Networks
Kairong Yu
Tianqing Zhang
Qi Xu
Gang Pan
Hongwei Wang
60
0
0
07 May 2025
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer
Young-Hu Park
R.-H. Park
Hyung-Min Park
49
0
0
07 May 2025
Image Restoration via Multi-domain Learning
Image Restoration via Multi-domain Learning
Xingyu Jiang
Ning Gao
Xiuhui Zhang
Hongkun Dou
Shaowen Fu
Xiaoqing Zhong
H. Li
Yue Deng
ViT
29
0
0
07 May 2025
ORXE: Orchestrating Experts for Dynamically Configurable Efficiency
ORXE: Orchestrating Experts for Dynamically Configurable Efficiency
Qingyuan Wang
Guoxin Wang
B. Cardiff
Deepu John
31
0
0
07 May 2025
Vision Graph Prompting via Semantic Low-Rank Decomposition
Vision Graph Prompting via Semantic Low-Rank Decomposition
Zixiang Ai
Zichen Liu
Jiahuan Zhou
48
0
0
07 May 2025
Tetrahedron-Net for Medical Image Registration
Tetrahedron-Net for Medical Image Registration
Jinhai Xiang
Shuai Guo
Qianru Han
Dantong Shi
Xinwei He
Xiang Bai
3DV
52
0
0
07 May 2025
ORBIT-2: Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling
ORBIT-2: Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling
Xiao Wang
Jong Youl Choi
Takuya Kurihaya
Isaac Lyngaas
Hong-Jun Yoon
...
Dali Wang
Peter Thornton
Prasanna Balaprakash
M. Ashfaq
Dan Lu
26
0
0
07 May 2025
False Promises in Medical Imaging AI? Assessing Validity of Outperformance Claims
False Promises in Medical Imaging AI? Assessing Validity of Outperformance Claims
Evangelia Christodoulou
Annika Reinke
Pascaline Andrè
Patrick Godau
P. Kalinowski
...
Amber L. Simpson
A. Kopp-Schneider
Gaël Varoquaux
O. Colliot
Lena Maier-Hein
33
0
0
07 May 2025
Balancing Accuracy, Calibration, and Efficiency in Active Learning with Vision Transformers Under Label Noise
Balancing Accuracy, Calibration, and Efficiency in Active Learning with Vision Transformers Under Label Noise
Moseli Motsóehli
Hope Mogale
Kyungim Baek
38
0
0
07 May 2025
Breaking Annotation Barriers: Generalized Video Quality Assessment via Ranking-based Self-Supervision
Breaking Annotation Barriers: Generalized Video Quality Assessment via Ranking-based Self-Supervision
Linhan Cao
Wei Sun
Kaiwei Zhang
Yicong Peng
Guangtao Zhai
Xiongkuo Min
50
0
0
06 May 2025
Action Spotting and Precise Event Detection in Sports: Datasets, Methods, and Challenges
Action Spotting and Precise Event Detection in Sports: Datasets, Methods, and Challenges
Hao Xu
Arbind Agrahari Baniya
Sam Well
Mohamed Reda Bouadjenek
Richard Dazeley
S. Aryal
AI4TS
22
0
0
06 May 2025
Towards Efficient Benchmarking of Foundation Models in Remote Sensing: A Capabilities Encoding Approach
Towards Efficient Benchmarking of Foundation Models in Remote Sensing: A Capabilities Encoding Approach
Pierre Adorni
M. Pham
Stéphane May
Sébastien Lefèvre
37
0
0
06 May 2025
Image Recognition with Online Lightweight Vision Transformer: A Survey
Image Recognition with Online Lightweight Vision Transformer: A Survey
Zherui Zhang
Rongtao Xu
Jie Zhou
Changwei Wang
Xingtian Pei
...
Jiguang Zhang
Li Guo
Longxiang Gao
W. Xu
Shibiao Xu
ViT
54
0
0
06 May 2025
DCS-ST for Classification of Breast Cancer Histopathology Images with Limited Annotations
DCS-ST for Classification of Breast Cancer Histopathology Images with Limited Annotations
Liu Suxing
Byungwon Min
27
0
0
06 May 2025
OccCylindrical: Multi-Modal Fusion with Cylindrical Representation for 3D Semantic Occupancy Prediction
OccCylindrical: Multi-Modal Fusion with Cylindrical Representation for 3D Semantic Occupancy Prediction
Zhenxing Ming
J. S. Berrio
Mao Shan
Yaoqi Huang
Hongyu Lyu
Nguyen Hoang Khoi Tran
Tzu-Yun Tseng
Stewart Worrall
3DPC
55
0
0
06 May 2025
3D Can Be Explored In 2D: Pseudo-Label Generation for LiDAR Point Clouds Using Sensor-Intensity-Based 2D Semantic Segmentation
3D Can Be Explored In 2D: Pseudo-Label Generation for LiDAR Point Clouds Using Sensor-Intensity-Based 2D Semantic Segmentation
Andrew Caunes
T. Chateau
Vincent Frémont
3DPC
24
0
0
06 May 2025
Database-Agnostic Gait Enrollment using SetTransformers
Database-Agnostic Gait Enrollment using SetTransformers
Nicoleta Basoc
Adrian Cosma
Andy Catruna
Emilian Radoi
SLR
24
0
0
05 May 2025
No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves
No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves
D. Jiang
Mengmeng Wang
Liuzhuozheng Li
Lei Zhang
Haoyu Wang
Wei Wei
Guang Dai
Yanning Zhang
Jingdong Wang
DiffM
44
0
0
05 May 2025
An Adaptive Data-Resilient Multi-Modal Framework for Hierarchical Multi-Label Book Genre Identification
An Adaptive Data-Resilient Multi-Modal Framework for Hierarchical Multi-Label Book Genre Identification
Utsav Nareti
S. Chattopadhyay
Prolay Mallick
Suraj Kumar
Ayush Vikas Daga
Chandranath Adak
Adarsh Wase
Arjab Roy
12
0
0
05 May 2025
Token Coordinated Prompt Attention is Needed for Visual Prompting
Token Coordinated Prompt Attention is Needed for Visual Prompting
Zichen Liu
Xu Zou
Gang Hua
Jiahuan Zhou
26
0
0
05 May 2025
Benchmarking Feature Upsampling Methods for Vision Foundation Models using Interactive Segmentation
Benchmarking Feature Upsampling Methods for Vision Foundation Models using Interactive Segmentation
Volodymyr Havrylov
Haiwen Huang
Dan Zhang
Andreas Geiger
46
0
0
04 May 2025
Unaligned RGB Guided Hyperspectral Image Super-Resolution with Spatial-Spectral Concordance
Unaligned RGB Guided Hyperspectral Image Super-Resolution with Spatial-Spectral Concordance
Y. Zhang
Zeqiang Lai
T. Zhang
Ying Fu
Chenghu Zhou
40
1
0
04 May 2025
Always Skip Attention
Always Skip Attention
Yiping Ji
Hemanth Saratchandran
Peyman Moghaddam
Simon Lucey
55
0
0
04 May 2025
Adversarial Robustness of Deep Learning Models for Inland Water Body Segmentation from SAR Images
Adversarial Robustness of Deep Learning Models for Inland Water Body Segmentation from SAR Images
Siddharth Kothari
Srinivasan Murali
Sankalp Kothari
Ujjwal Verma
Jaya Sreevalsan-Nair
35
0
0
03 May 2025
Accelerating Deep Neural Network Training via Distributed Hybrid Order Optimization
Accelerating Deep Neural Network Training via Distributed Hybrid Order Optimization
Shunxian Gu
Chaoqun You
Bangbang Ren
Lailong Luo
Junxu Xia
Deke Guo
29
0
0
02 May 2025
CAMELTrack: Context-Aware Multi-cue ExpLoitation for Online Multi-Object Tracking
CAMELTrack: Context-Aware Multi-cue ExpLoitation for Online Multi-Object Tracking
Vladimir Somers
Baptiste Standaert
Victor Joos
Alexandre Alahi
Christophe De Vleeschouwer
VOT
49
0
0
02 May 2025
When Dynamic Data Selection Meets Data Augmentation
When Dynamic Data Selection Meets Data Augmentation
S. M. I. Simon X. Yang
Peng Ye
F. Shen
Dongzhan Zhou
22
0
0
02 May 2025
Pack-PTQ: Advancing Post-training Quantization of Neural Networks by Pack-wise Reconstruction
Pack-PTQ: Advancing Post-training Quantization of Neural Networks by Pack-wise Reconstruction
Changjun Li
Runqing Jiang
Zhuo Song
Pengpeng Yu
Ye Zhang
Yulan Guo
MQ
49
0
0
01 May 2025
Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook
Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook
Muyi Bao
Shuchang Lyu
Zhaoyang Xu
Huiyu Zhou
Jinchang Ren
Shiming Xiang
X. Li
Guangliang Cheng
Mamba
72
0
0
01 May 2025
Improving Routing in Sparse Mixture of Experts with Graph of Tokens
Improving Routing in Sparse Mixture of Experts with Graph of Tokens
Tam Minh Nguyen
Ngoc N. Tran
Khai Nguyen
Richard G. Baraniuk
MoE
59
0
0
01 May 2025
1234...323334
Next