ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.14030
  4. Cited By
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

25 March 2021
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng-Wei Zhang
Stephen Lin
B. Guo
    ViT
ArXivPDFHTML

Papers citing "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows"

50 / 1,646 papers shown
Title
SteelBlastQC: Shot-blasted Steel Surface Dataset with Interpretable Detection of Surface Defects
SteelBlastQC: Shot-blasted Steel Surface Dataset with Interpretable Detection of Surface Defects
Irina Ruzavina
Lisa Sophie Theis
Jesse Lemeer
Rutger de Groen
Leo Ebeling
Andrej Hulak
Jouaria Ali
Guangzhi Tang
Rico Mockel
24
0
0
29 Apr 2025
GMAR: Gradient-Driven Multi-Head Attention Rollout for Vision Transformer Interpretability
GMAR: Gradient-Driven Multi-Head Attention Rollout for Vision Transformer Interpretability
Sehyeong Jo
Gangjae Jang
Haesol Park
32
0
0
28 Apr 2025
SRMF: A Data Augmentation and Multimodal Fusion Approach for Long-Tail UHR Satellite Image Segmentation
SRMF: A Data Augmentation and Multimodal Fusion Approach for Long-Tail UHR Satellite Image Segmentation
Yulong Guo
Zilun Zhang
Yongheng Shang
Tiancheng Zhao
Shuiguang Deng
Yingchun Yang
Jianwei Yin
68
0
0
28 Apr 2025
Breast Cancer Detection from Multi-View Screening Mammograms with Visual Prompt Tuning
Breast Cancer Detection from Multi-View Screening Mammograms with Visual Prompt Tuning
Han Chen
Anne L. Martel
54
0
0
28 Apr 2025
Leveraging Neural Graph Compilers in Machine Learning Research for Edge-Cloud Systems
Leveraging Neural Graph Compilers in Machine Learning Research for Edge-Cloud Systems
Alireza Furutanpey
Carmen Walser
Philipp Raith
P. Frangoudis
Schahram Dustdar
GNN
NAI
83
0
0
28 Apr 2025
Magnifier: A Multi-grained Neural Network-based Architecture for Burned Area Delineation
Magnifier: A Multi-grained Neural Network-based Architecture for Burned Area Delineation
Daniele Rege Cambrin
Luca Colomba
Paolo Garza
44
0
0
28 Apr 2025
Enhancing breast cancer detection on screening mammogram using self-supervised learning and a hybrid deep model of Swin Transformer and Convolutional Neural Network
Enhancing breast cancer detection on screening mammogram using self-supervised learning and a hybrid deep model of Swin Transformer and Convolutional Neural Network
Han Chen
Anne L. Martel
41
0
0
28 Apr 2025
BARIS: Boundary-Aware Refinement with Environmental Degradation Priors for Robust Underwater Instance Segmentation
BARIS: Boundary-Aware Refinement with Environmental Degradation Priors for Robust Underwater Instance Segmentation
Pin-Chi Pan
Soo-Chang Pei
54
0
0
28 Apr 2025
ODExAI: A Comprehensive Object Detection Explainable AI Evaluation
ODExAI: A Comprehensive Object Detection Explainable AI Evaluation
Loc Phuc Truong Nguyen
Hung Truong Thanh Nguyen
Hung Cao
59
0
0
27 Apr 2025
Adaptive Dual-domain Learning for Underwater Image Enhancement
Adaptive Dual-domain Learning for Underwater Image Enhancement
Lingtao Peng
Liheng Bian
22
0
0
27 Apr 2025
CARL: Camera-Agnostic Representation Learning for Spectral Image Analysis
CARL: Camera-Agnostic Representation Learning for Spectral Image Analysis
Alexander Baumann
Leonardo Ayala
S.
Jan Sellner
Alexander Studier-Fischer
Berkin Özdemir
Lena Maier-Hein
Slobodan Ilic
51
0
0
27 Apr 2025
A Large Vision-Language Model based Environment Perception System for Visually Impaired People
A Large Vision-Language Model based Environment Perception System for Visually Impaired People
Zezhou Chen
Zhaoxiang Liu
Kai Wang
Kohou Wang
Shiguo Lian
47
0
0
25 Apr 2025
Examining the Impact of Optical Aberrations to Image Classification and Object Detection Models
Examining the Impact of Optical Aberrations to Image Classification and Object Detection Models
Patrick Müller
Alexander Braun
M. Keuper
50
0
0
25 Apr 2025
Dream-Box: Object-wise Outlier Generation for Out-of-Distribution Detection
Dream-Box: Object-wise Outlier Generation for Out-of-Distribution Detection
Brian K. S. Isaac-Medina
T. Breckon
OODD
68
0
0
25 Apr 2025
STP4D: Spatio-Temporal-Prompt Consistent Modeling for Text-to-4D Gaussian Splatting
STP4D: Spatio-Temporal-Prompt Consistent Modeling for Text-to-4D Gaussian Splatting
Yunze Deng
Haijun Xiong
Bin Feng
X. Wang
W. Liu
3DGS
42
0
0
25 Apr 2025
A multilevel approach to accelerate the training of Transformers
A multilevel approach to accelerate the training of Transformers
Guillaume Lauga
Maël Chaumette
Edgar Desainte-Maréville
Étienne Lasalle
Arthur Lebeurrier
AI4CE
29
0
0
24 Apr 2025
SuoiAI: Building a Dataset for Aquatic Invertebrates in Vietnam
SuoiAI: Building a Dataset for Aquatic Invertebrates in Vietnam
Tue Vo
Lakshay Sharma
Tuan Dinh
Khuong Dinh
T. Nguyen
Trung Phan
Minh Do
Duong Vu
30
0
0
21 Apr 2025
NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement: KwaiSR Dataset and Study
NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement: KwaiSR Dataset and Study
Xin Li
Xijun Wang
Bingchen Li
Kun Yuan
Yizhen Shao
Suhang Yao
Ming-Ting Sun
Chao Zhou
Radu Timofte
Zhibo Chen
43
3
0
21 Apr 2025
Advancing Video Anomaly Detection: A Bi-Directional Hybrid Framework for Enhanced Single- and Multi-Task Approaches
Advancing Video Anomaly Detection: A Bi-Directional Hybrid Framework for Enhanced Single- and Multi-Task Approaches
Guodong Shen
Yuqi Ouyang
Junru Lu
Yixuan Yang
Victor Sanchez
23
1
0
20 Apr 2025
Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis
Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis
Jingjing Ren
Wenbo Li
Zhongdao Wang
Haoze Sun
Bangzhen Liu
...
Aoxue Li
Shifeng Zhang
Bin Shao
Yong Guo
Lei Zhu
VGen
34
0
0
20 Apr 2025
NTIRE 2025 Challenge on Image Super-Resolution ($\times$4): Methods and Results
NTIRE 2025 Challenge on Image Super-Resolution (×\times×4): Methods and Results
Zheng Chen
Kai Liu
Jue Gong
J. Wang
Lei Sun
...
Prashant Patil
Santosh Kumar Vipparthi
Subrahmanyam Murala
Bilel Benjdira
Anas M. Ali
SupR
56
0
0
20 Apr 2025
LimitNet: Progressive, Content-Aware Image Offloading for Extremely Weak Devices & Networks
LimitNet: Progressive, Content-Aware Image Offloading for Extremely Weak Devices & Networks
A. Hojjat
Janek Haberer
Tayyaba Zainab
Olaf Landsiedel
35
3
0
18 Apr 2025
Towards Accurate and Interpretable Neuroblastoma Diagnosis via Contrastive Multi-scale Pathological Image Analysis
Towards Accurate and Interpretable Neuroblastoma Diagnosis via Contrastive Multi-scale Pathological Image Analysis
Zhu Zhu
Shuo Jiang
Jingyuan Zheng
Yawen Li
Yifei Chen
Manli Zhao
Weizhong Gu
Feiwei Qin
Jinhu Wang
Gang Yu
MedIm
33
0
0
18 Apr 2025
HMPE:HeatMap Embedding for Efficient Transformer-Based Small Object Detection
HMPE:HeatMap Embedding for Efficient Transformer-Based Small Object Detection
YangChen Zeng
ViT
26
0
0
18 Apr 2025
A Novel Hybrid Approach for Retinal Vessel Segmentation with Dynamic Long-Range Dependency and Multi-Scale Retinal Edge Fusion Enhancement
A Novel Hybrid Approach for Retinal Vessel Segmentation with Dynamic Long-Range Dependency and Multi-Scale Retinal Edge Fusion Enhancement
Yihao Ouyang
Xunheng Kuang
Mengjia Xiong
Zhida Wang
Yuanquan Wang
36
0
0
18 Apr 2025
FocusNet: Transformer-enhanced Polyp Segmentation with Local and Pooling Attention
FocusNet: Transformer-enhanced Polyp Segmentation with Local and Pooling Attention
Jun Zeng
KC Santosh
Deepak Rajan Nayak
Thomas de Lange
Jonas Varkey
Tyler Berzin
Debesh Jha
ViT
MedIm
31
0
0
18 Apr 2025
HDBFormer: Efficient RGB-D Semantic Segmentation with A Heterogeneous Dual-Branch Framework
HDBFormer: Efficient RGB-D Semantic Segmentation with A Heterogeneous Dual-Branch Framework
Shuobin Wei
Zhuang Zhou
Zhengan Lu
Zizhao Yuan
Binghua Su
MDE
42
0
0
18 Apr 2025
Perception Encoder: The best visual embeddings are not at the output of the network
Perception Encoder: The best visual embeddings are not at the output of the network
Daniel Bolya
Po-Yao (Bernie) Huang
Peize Sun
Jang Hyun Cho
Andrea Madotto
...
Shiyu Dong
Nikhila Ravi
Daniel Li
Piotr Dollár
Christoph Feichtenhofer
ObjD
VOS
103
0
0
17 Apr 2025
Rethinking Temporal Fusion with a Unified Gradient Descent View for 3D Semantic Occupancy Prediction
Rethinking Temporal Fusion with a Unified Gradient Descent View for 3D Semantic Occupancy Prediction
Dubing Chen
Huan Zheng
Jin Fang
Xingping Dong
Xianfei Li
Wenlong Liao
Tao He
Pai Peng
Jianbing Shen
35
0
0
17 Apr 2025
NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement: Methods and Results
NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement: Methods and Results
Xin Li
Kun Yuan
B. Li
Fengbin Guan
Yizhen Shao
...
Guohua Zhang
Z. Huang
Y. Deng
Qingmiao Jiang
Lu Chen
53
7
0
17 Apr 2025
Search is All You Need for Few-shot Anomaly Detection
Search is All You Need for Few-shot Anomaly Detection
Qishan Wang
Jia Guo
Shuyong Gao
H. Wang
Li Xiong
J. Hu
Hanqi Guo
Wenqiang Zhang
53
0
0
16 Apr 2025
FACT: Foundation Model for Assessing Cancer Tissue Margins with Mass Spectrometry
FACT: Foundation Model for Assessing Cancer Tissue Margins with Mass Spectrometry
Mohammad Farahmand
A. Jamzad
Fahimeh Fooladgar
Laura Connolly
Martin Kaufmann
Kevin Yi Mi Ren
John Rudan
Doug McKay
Gabor Fichtinger
P. Mousavi
31
0
0
15 Apr 2025
Enhanced Small Target Detection via Multi-Modal Fusion and Attention Mechanisms: A YOLOv5 Approach
Enhanced Small Target Detection via Multi-Modal Fusion and Attention Mechanisms: A YOLOv5 Approach
Xiaoxiao Ma
Junxiong Tong
26
0
0
15 Apr 2025
Evolved Hierarchical Masking for Self-Supervised Learning
Evolved Hierarchical Masking for Self-Supervised Learning
Zhanzhou Feng
Shiliang Zhang
37
0
0
12 Apr 2025
From Visual Explanations to Counterfactual Explanations with Latent Diffusion
From Visual Explanations to Counterfactual Explanations with Latent Diffusion
Tung Luu
Nam Le
Duc Le
Bac Le
DiffM
AAML
FAtt
44
0
0
12 Apr 2025
Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model
Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model
Team Seawead
Ceyuan Yang
Zhijie Lin
Yang Zhao
Shanchuan Lin
...
Zuquan Song
Zhenheng Yang
Jiashi Feng
Jianchao Yang
Lu Jiang
DiffM
79
1
0
11 Apr 2025
SRVP: Strong Recollection Video Prediction Model Using Attention-Based Spatiotemporal Correlation Fusion
SRVP: Strong Recollection Video Prediction Model Using Attention-Based Spatiotemporal Correlation Fusion
Yuseon Kim
Kyongseok Park
27
0
0
10 Apr 2025
Distilling Textual Priors from LLM to Efficient Image Fusion
Distilling Textual Priors from LLM to Efficient Image Fusion
Ran Zhang
Xuanhua He
Ke Cao
L. Liu
Li Zhang
Man Zhou
Jie Zhang
21
0
0
09 Apr 2025
Learning Optimal Prompt Ensemble for Multi-source Visual Prompt Transfer
Learning Optimal Prompt Ensemble for Multi-source Visual Prompt Transfer
Enming Zhang
Liwen Cao
Yanru Wu
Zijie Zhao
Guan Wang
Yang Li
45
0
0
09 Apr 2025
Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation
Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation
Xiaoxing Hu
Ziyang Gong
Y. Wang
Yuru Jia
Gen Luo
Xue Yang
47
0
0
08 Apr 2025
D-Feat Occlusions: Diffusion Features for Robustness to Partial Visual Occlusions in Object Recognition
D-Feat Occlusions: Diffusion Features for Robustness to Partial Visual Occlusions in Object Recognition
Rupayan Mallick
Sibo Dong
Nataniel Ruiz
Sarah Adel Bargal
DiffM
42
0
0
08 Apr 2025
AVP-AP: Self-supervised Automatic View Positioning in 3D cardiac CT via Atlas Prompting
AVP-AP: Self-supervised Automatic View Positioning in 3D cardiac CT via Atlas Prompting
Xiaolin Fan
Y. Wang
Y. Zhang
Mingkun Bao
Bosen Jia
Dong Lu
Yifan Gu
Jian Cheng
Haogang Zhu
30
0
0
08 Apr 2025
A Robust Real-Time Lane Detection Method with Fog-Enhanced Feature Fusion for Foggy Conditions
A Robust Real-Time Lane Detection Method with Fog-Enhanced Feature Fusion for Foggy Conditions
Ronghui Zhang
Yuhang Ma
Tengfei Li
Ziyu Lin
Yueying Wu
Junzhou Chen
Lin Zhang
Jia Hu
Tony Z. Qiu
Konghui Guo
34
0
0
08 Apr 2025
EffOWT: Transfer Visual Language Models to Open-World Tracking Efficiently and Effectively
EffOWT: Transfer Visual Language Models to Open-World Tracking Efficiently and Effectively
Bingyang Wang
Kaer Huang
Bin Li
Yiqiang Yan
L. Zhang
Huchuan Lu
You He
VLM
26
0
0
07 Apr 2025
Transformer representation learning is necessary for dynamic multi-modal physiological data on small-cohort patients
Transformer representation learning is necessary for dynamic multi-modal physiological data on small-cohort patients
Bingxu Wang
Kunzhi Cai
Yuqi Zhang
Yachong Guo
Zeyi Zhou
Yachong Guo
Yachong Guo
Wei Wang
Qing Zhou
MedIm
26
0
0
05 Apr 2025
MIMRS: A Survey on Masked Image Modeling in Remote Sensing
MIMRS: A Survey on Masked Image Modeling in Remote Sensing
Shabnam Choudhury
Akhil Vasim
Michael Schmitt
Biplab Banerjee
30
0
0
04 Apr 2025
HGFormer: Topology-Aware Vision Transformer with HyperGraph Learning
HGFormer: Topology-Aware Vision Transformer with HyperGraph Learning
Hao Wang
Shuo Zhang
Biao Leng
ViT
59
0
0
03 Apr 2025
Spline-based Transformers
Spline-based Transformers
Prashanth Chandran
Agon Serifi
Markus Gross
Moritz Bächer
36
0
0
03 Apr 2025
ShieldGemma 2: Robust and Tractable Image Content Moderation
ShieldGemma 2: Robust and Tractable Image Content Moderation
Wenjun Zeng
D. Kurniawan
Ryan Mullins
Yuchi Liu
Tamoghna Saha
...
Mani Malek
Hamid Palangi
Joon Baek
Rick Pereira
Karthik Narasimhan
AI4MH
31
0
0
01 Apr 2025
The geomagnetic storm and Kp prediction using Wasserstein transformer
The geomagnetic storm and Kp prediction using Wasserstein transformer
Beibei Li
32
0
0
29 Mar 2025
Previous
12345...313233
Next