ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.14030
  4. Cited By
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
v1v2 (latest)

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

IEEE International Conference on Computer Vision (ICCV), 2021
25 March 2021
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
    ViT
ArXiv (abs)PDFHTMLHuggingFace (5 upvotes)Github (14835★)

Papers citing "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows"

50 / 8,510 papers shown
HDW-SR: High-Frequency Guided Diffusion Model based on Wavelet Decomposition for Image Super-Resolution
HDW-SR: High-Frequency Guided Diffusion Model based on Wavelet Decomposition for Image Super-Resolution
Chao Yang
Boqian Zhang
Jinghao Xu
Guang Jiang
225
0
0
17 Nov 2025
SkyReels-Text: Fine-grained Font-Controllable Text Editing for Poster Design
SkyReels-Text: Fine-grained Font-Controllable Text Editing for Poster Design
Yunjie Yu
Jingchen Wu
J. Zhu
Chunze Lin
Guibin Chen
DiffM
254
0
0
17 Nov 2025
Segment Anything Across Shots: A Method and Benchmark
Segment Anything Across Shots: A Method and Benchmark
Hengrui Hu
Kaining Ying
Henghui Ding
VOS
333
0
0
17 Nov 2025
End-to-End Multi-Person Pose Estimation with Pose-Aware Video Transformer
End-to-End Multi-Person Pose Estimation with Pose-Aware Video Transformer
Yonghui Yu
Jiahang Cai
Xun Wang
Wenwu Yang
ViT
114
0
0
17 Nov 2025
Concept Regions Matter: Benchmarking CLIP with a New Cluster-Importance Approach
Concept Regions Matter: Benchmarking CLIP with a New Cluster-Importance Approach
Aishwarya Agarwal
Srikrishna Karanam
Vineet Gandhi
VLM
254
0
0
17 Nov 2025
Semi-Supervised Multi-Task Learning for Interpretable Quality As- sessment of Fundus Images
Semi-Supervised Multi-Task Learning for Interpretable Quality As- sessment of Fundus ImagesBiomedical Signal Processing and Control (BSPC), 2025
Lucas Gabriel Telesco
Danila Nejamkin
Estefanía Mata
Francisco Filizzola
Kevin Wignall
...
María de los Angeles Cenoz
Melissa Thompson
Mercedes Leguía
Ignacio Larrabide
J. Orlando
76
0
0
17 Nov 2025
DiffPixelFormer: Differential Pixel-Aware Transformer for RGB-D Indoor Scene Segmentation
DiffPixelFormer: Differential Pixel-Aware Transformer for RGB-D Indoor Scene Segmentation
Yan Gong
J. Lu
Yongsheng Gao
Jie Zhao
X. Zhang
Susanto Rahardja
121
0
0
17 Nov 2025
CapeNext: Rethinking and Refining Dynamic Support Information for Category-Agnostic Pose Estimation
CapeNext: Rethinking and Refining Dynamic Support Information for Category-Agnostic Pose Estimation
Yu Zhu
Dan Zeng
Shuiwang Li
Qijun Zhao
Qiaomu Shen
Bo Tang
135
0
0
17 Nov 2025
H-CNN-ViT: A Hierarchical Gated Attention Multi-Branch Model for Bladder Cancer Recurrence Prediction
H-CNN-ViT: A Hierarchical Gated Attention Multi-Branch Model for Bladder Cancer Recurrence Prediction
Xueyang Li
Zongren Wang
Yuliang Zhang
Zixuan Pan
Yu-Jen Chen
Nishchal Sapkota
Gelei Xu
Danny Chen
Yiyu Shi
162
0
0
17 Nov 2025
MRIQT: Physics-Aware Diffusion Model for Image Quality Transfer in Neonatal Ultra-Low-Field MRI
MRIQT: Physics-Aware Diffusion Model for Image Quality Transfer in Neonatal Ultra-Low-Field MRI
Malek Al Abed
Sebiha Demir
Anne Groteklaes
Elodie Germani
Shahrooz Faghihroohi
Hemmen Sabir
Shadi Albarqouni
MedIm
335
0
0
17 Nov 2025
Towards 3D Object-Centric Feature Learning for Semantic Scene Completion
Towards 3D Object-Centric Feature Learning for Semantic Scene Completion
Weihua Wang
Yubo Cui
Xiangru Lin
Z. Li
Zheng Fang
3DPC
256
0
0
17 Nov 2025
Global-Lens Transformers: Adaptive Token Mixing for Dynamic Link Prediction
Global-Lens Transformers: Adaptive Token Mixing for Dynamic Link Prediction
Tao Zou
Chengfeng Wu
Tianxi Liao
Junchen Ye
Bowen Du
96
0
0
16 Nov 2025
MaskAnyNet: Rethinking Masked Image Regions as Valuable Information in Supervised Learning
MaskAnyNet: Rethinking Masked Image Regions as Valuable Information in Supervised Learning
Jingshan Hong
Haigen Hu
Huihuang Zhang
Q. Zhou
Zhao Li
125
0
0
16 Nov 2025
SAGE: Saliency-Guided Contrastive Embeddings
SAGE: Saliency-Guided Contrastive Embeddings
Colton R. Crum
A. Czajka
Adam Czajka
135
0
0
16 Nov 2025
Seg-VAR: Image Segmentation with Visual Autoregressive Modeling
Seg-VAR: Image Segmentation with Visual Autoregressive Modeling
Rongkun Zheng
Lu Qi
Xi Chen
Yi Wang
K. Wang
Hengshuang Zhao
137
0
0
16 Nov 2025
LAYA: Layer-wise Attention Aggregation for Interpretable Depth-Aware Neural Networks
LAYA: Layer-wise Attention Aggregation for Interpretable Depth-Aware Neural Networks
Gennaro Vessio
FAtt
191
0
0
16 Nov 2025
DINO-Detect: A Simple yet Effective Framework for Blur-Robust AI-Generated Image Detection
DINO-Detect: A Simple yet Effective Framework for Blur-Robust AI-Generated Image Detection
Jialiang Shen
Jiyang Zheng
Yunqi Xue
Huajie Chen
Yu Yao
...
Ruiqi Liu
Helin Gong
Yang Yang
Dadong Wang
Tongliang Liu
243
0
0
16 Nov 2025
Backdoor Attacks on Open Vocabulary Object Detectors via Multi-Modal Prompt Tuning
Backdoor Attacks on Open Vocabulary Object Detectors via Multi-Modal Prompt Tuning
Ankita Raj
Chetan Arora
ObjDAAMLVLM
294
0
0
16 Nov 2025
Towards Temporal Fusion Beyond the Field of View for Camera-based Semantic Scene Completion
Towards Temporal Fusion Beyond the Field of View for Camera-based Semantic Scene Completion
Jongseong Bae
Junwoo Ha
Jinnyeong Heo
Yeongin Lee
H. Kim
3DGS
133
0
0
16 Nov 2025
MSLoRA: Multi-Scale Low-Rank Adaptation via Attention Reweighting
MSLoRA: Multi-Scale Low-Rank Adaptation via Attention Reweighting
Xu Yang
Gady Agam
127
0
0
16 Nov 2025
DCMM-Transformer: Degree-Corrected Mixed-Membership Attention for Medical Imaging
DCMM-Transformer: Degree-Corrected Mixed-Membership Attention for Medical Imaging
Huimin Cheng
Xiaowei Yu
Shushan Wu
Luyang Fang
Chao-Yang Cao
Jing Zhang
Tianming Liu
Dajiang Zhu
Wenxuan Zhong
Ping Ma
MedIm
113
0
0
15 Nov 2025
Application of Graph Based Vision Transformers Architectures for Accurate Temperature Prediction in Fiber Specklegram Sensors
Application of Graph Based Vision Transformers Architectures for Accurate Temperature Prediction in Fiber Specklegram Sensors
Abhishek Sebastian
141
0
0
15 Nov 2025
MTMed3D: A Multi-Task Transformer-Based Model for 3D Medical Imaging
MTMed3D: A Multi-Task Transformer-Based Model for 3D Medical Imaging
Fan Li
Arun Iyengar
Lanyu Xu
MedImViT
124
0
0
15 Nov 2025
AGGRNet: Selective Feature Extraction and Aggregation for Enhanced Medical Image Classification
AGGRNet: Selective Feature Extraction and Aggregation for Enhanced Medical Image Classification
Ansh Makwe
Akansh Agrawal
Prateek Jain
Akshan Agrawal
Priyanka Bagade
109
0
0
15 Nov 2025
A Best-of-Both-Worlds Proof for Tsallis-INF without Fenchel Conjugates
A Best-of-Both-Worlds Proof for Tsallis-INF without Fenchel Conjugates
Wei-Cheng Lee
Francesco Orabona
123
19
0
14 Nov 2025
Spatial Reasoning in Multimodal Large Language Models: A Survey of Tasks, Benchmarks and Methods
Weichen Liu
Qiyao Xue
Haoming Wang
Xiangyu Yin
Boyuan Yang
Wei Gao
114
1
0
14 Nov 2025
Feature Quality and Adaptability of Medical Foundation Models: A Comparative Evaluation for Radiographic Classification and Segmentation
Feature Quality and Adaptability of Medical Foundation Models: A Comparative Evaluation for Radiographic Classification and Segmentation
Frank Li
Theo Dapamede
Mohammadreza Chavoshi
Young Seok Jeon
Bardia Khosravi
...
Janice Newsome
S. Purkayastha
Imon Banerjee
Hari M. Trivedi
J. Gichoya
AI4CE
109
0
0
12 Nov 2025
From Street to Orbit: Training-Free Cross-View Retrieval via Location Semantics and LLM Guidance
From Street to Orbit: Training-Free Cross-View Retrieval via Location Semantics and LLM Guidance
Jeongho Min
Dongyoung Kim
J. Lee
206
0
0
12 Nov 2025
Stratified Knowledge-Density Super-Network for Scalable Vision Transformers
Stratified Knowledge-Density Super-Network for Scalable Vision Transformers
Longhua Li
Lei Qi
Xin Geng
ViT
128
0
0
12 Nov 2025
Selective Sinkhorn Routing for Improved Sparse Mixture of Experts
Selective Sinkhorn Routing for Improved Sparse Mixture of Experts
Duc Nguyen
Huu Binh Ta
Nhuan Le Duc
T. Nguyen
T. Tran
MoE
452
0
0
12 Nov 2025
MPCM-Net: Multi-scale network integrates partial attention convolution with Mamba for ground-based cloud image segmentation
MPCM-Net: Multi-scale network integrates partial attention convolution with Mamba for ground-based cloud image segmentation
Penghui Niu
Jiashuai She
Taotao Cai
Yajuan Zhang
Ping Zhang
Junhua Gu
Jianxin Li
81
0
0
12 Nov 2025
CSF-Net: Context-Semantic Fusion Network for Large Mask Inpainting
CSF-Net: Context-Semantic Fusion Network for Large Mask Inpainting
Chae-Yeon Heo
Yeong-Jun Cho
122
0
0
11 Nov 2025
How Modality Shapes Perception and Reasoning: A Study of Error Propagation in ARC-AGI
Bo Wen
Chen Wang
Erhan Bilal
52
0
0
11 Nov 2025
Invisible Triggers, Visible Threats! Road-Style Adversarial Creation Attack for Visual 3D Detection in Autonomous Driving
Invisible Triggers, Visible Threats! Road-Style Adversarial Creation Attack for Visual 3D Detection in Autonomous Driving
Jian Wang
Lijun He
Yixing Yong
Haixia Bi
Fan Li
AAML
294
0
0
11 Nov 2025
The Impact of Longitudinal Mammogram Alignment on Breast Cancer Risk Assessment
The Impact of Longitudinal Mammogram Alignment on Breast Cancer Risk Assessment
Solveig Thrun
Stine Hansen
Zijun Sun
Nele Blum
Suaiba Amina Salahuddin
...
Kristoffer Wickstrøm
Elisabeth Wetzer
Robert Jenssen
M. Stille
Michael C. Kampffmeyer
90
0
0
11 Nov 2025
H-Model: Dynamic Neural Architectures for Adaptive Processing
H-Model: Dynamic Neural Architectures for Adaptive Processing
Dmytro Hospodarchuk
98
0
0
11 Nov 2025
From Noise to Latent: Generating Gaussian Latents for INR-Based Image Compression
From Noise to Latent: Generating Gaussian Latents for INR-Based Image Compression
Chaoyi Lin
Yaojun Wu
Yue Li
Junru Li
Kai Zhang
Li Zhang
183
0
0
11 Nov 2025
Distributed Zero-Shot Learning for Visual Recognition
Distributed Zero-Shot Learning for Visual Recognition
Zhi Chen
Yadan Luo
Zi-Rui Huang
Jingjing Li
Sen Wang
Xin Yu
FedML
212
0
0
11 Nov 2025
Range Asymmetric Numeral Systems-Based Lightweight Intermediate Feature Compression for Split Computing of Deep Neural Networks
Range Asymmetric Numeral Systems-Based Lightweight Intermediate Feature Compression for Split Computing of Deep Neural Networks
Mingyu Sung
Suhwan Im
Vikas Palakonda
Jae-Mo Kang
101
0
0
11 Nov 2025
A Circular Argument : Does RoPE need to be Equivariant for Vision?
A Circular Argument : Does RoPE need to be Equivariant for Vision?
Chase van de Geijn
Timo Lüddecke
Polina Turishcheva
Alexander S. Ecker
160
2
0
11 Nov 2025
Rethinking Explanation Evaluation under the Retraining Scheme
Rethinking Explanation Evaluation under the Retraining Scheme
Yi Cai
Thibaud Ardoin
Mayank Gulati
Gerhard Wunder
LRM
124
0
0
11 Nov 2025
Cross Modal Fine-Grained Alignment via Granularity-Aware and Region-Uncertain Modeling
Cross Modal Fine-Grained Alignment via Granularity-Aware and Region-Uncertain Modeling
Jiale Liu
Haoming Zhou
Yishu Zhu
Bingzhi Chen
Yuncheng Jiang
166
0
0
11 Nov 2025
Real-Time LiDAR Super-Resolution via Frequency-Aware Multi-Scale Fusion
Real-Time LiDAR Super-Resolution via Frequency-Aware Multi-Scale Fusion
June Moh Goo
Zichao Zeng
Jan Boehm
87
0
0
10 Nov 2025
REOcc: Camera-Radar Fusion with Radar Feature Enrichment for 3D Occupancy Prediction
REOcc: Camera-Radar Fusion with Radar Feature Enrichment for 3D Occupancy Prediction
Chaehee Song
Sanmin Kim
H. Jeong
Juyeb Shin
Joonhee Lim
Dongsuk Kum
144
0
0
10 Nov 2025
Spatial-Frequency Enhanced Mamba for Multi-Modal Image Fusion
Spatial-Frequency Enhanced Mamba for Multi-Modal Image Fusion
Hui Sun
Long Lv
Pingping Zhang
Tongdan Tang
Feng Tian
Weibing Sun
Huchuan Lu
Mamba
455
0
0
10 Nov 2025
QUARK: Quantization-Enabled Circuit Sharing for Transformer Acceleration by Exploiting Common Patterns in Nonlinear Operations
QUARK: Quantization-Enabled Circuit Sharing for Transformer Acceleration by Exploiting Common Patterns in Nonlinear Operations
Zhixiong Zhao
Haomin Li
Fangxin Liu
Yuncheng Lu
Zongwu Wang
Tao Yang
Li Jiang
Haibing Guan
263
2
0
10 Nov 2025
Beyond Boundaries: Leveraging Vision Foundation Models for Source-Free Object Detection
Beyond Boundaries: Leveraging Vision Foundation Models for Source-Free Object Detection
Huizai Yao
Sicheng Zhao
Pengteng Li
Yi Cui
Shuo Lu
Weiyu Guo
Yunfan Lu
Ziyang Chen
Hui Xiong
VLM
114
0
0
10 Nov 2025
LeCoT: revisiting network architecture for two-view correspondence pruning
LeCoT: revisiting network architecture for two-view correspondence pruning
Luanyuan Dai
Xiaoyu Du
Jinhui Tang
3DV3DPC
185
0
0
10 Nov 2025
MirrorMamba: Towards Scalable and Robust Mirror Detection in Videos
MirrorMamba: Towards Scalable and Robust Mirror Detection in Videos
Rui Song
Jiaying Lin
Rynson W. H. Lau
Mamba
229
0
0
10 Nov 2025
Anatomy-Aware Lymphoma Lesion Detection in Whole-Body PET/CT
Anatomy-Aware Lymphoma Lesion Detection in Whole-Body PET/CT
Simone Bendazzoli
A. Tzortzakakis
Andreas Abrahamsson
Björn Engelbrekt Wahlin
Orjan Smedby
Maria Holstensson
R. Moreno
ViTMedIm
362
1
0
10 Nov 2025
Previous
123456...169170171
Next
Page 3 of 171
Pageof 171