ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.14030
  4. Cited By
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
v1v2 (latest)

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

IEEE International Conference on Computer Vision (ICCV), 2021
25 March 2021
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
    ViT
ArXiv (abs)PDFHTMLHuggingFace (5 upvotes)Github (14835★)

Papers citing "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows"

50 / 8,530 papers shown
Understanding What Is Not Said:Referring Remote Sensing Image Segmentation with Scarce Expressions
Understanding What Is Not Said:Referring Remote Sensing Image Segmentation with Scarce Expressions
Kai Ye
Bowen Liu
Jianghang Lin
Jiayi Ji
Pingyang Dai
Liujuan Cao
84
0
0
26 Oct 2025
Simplifying Knowledge Transfer in Pretrained Models
Simplifying Knowledge Transfer in Pretrained Models
Siddharth Jain
Shyamgopal Karthik
Vineet Gandhi
163
0
0
25 Oct 2025
Diffusion-Driven Two-Stage Active Learning for Low-Budget Semantic Segmentation
Diffusion-Driven Two-Stage Active Learning for Low-Budget Semantic Segmentation
Jeongin Kim
Wonho Bae
YouLee Han
Giyeong Oh
Youngjae Yu
Danica J. Sutherland
Junhyug Noh
DiffM
150
0
0
25 Oct 2025
Efficient Large-Deformation Medical Image Registration via Recurrent Dynamic Correlation
Efficient Large-Deformation Medical Image Registration via Recurrent Dynamic Correlation
Tianran Li
Marius Staring
Yuchuan Qiao
MedIm
137
2
0
25 Oct 2025
Enpowering Your Pansharpening Models with Generalizability: Unified Distribution is All You Need
Enpowering Your Pansharpening Models with Generalizability: Unified Distribution is All You Need
Yongchuan Cui
Peng Liu
H. Zhang
OOD
140
0
0
25 Oct 2025
FrameShield: Adversarially Robust Video Anomaly Detection
FrameShield: Adversarially Robust Video Anomaly Detection
Mojtaba Nafez
Mobina Poulaei
Nikan Vasei
Bardia Soltani Moakhar
Mohammad Sabokrou
M. Rohban
AAML
176
0
0
24 Oct 2025
S3OD: Towards Generalizable Salient Object Detection with Synthetic Data
S3OD: Towards Generalizable Salient Object Detection with Synthetic Data
Orest Kupyn
Hirokatsu Kataoka
Christian Rupprecht
128
1
0
24 Oct 2025
Relieving the Over-Aggregating Effect in Graph Transformers
Relieving the Over-Aggregating Effect in Graph Transformers
Junshu Sun
Wanxing Chang
Chenxue Yang
Qingming Huang
Shuhui Wang
146
0
0
24 Oct 2025
MAGIC-Flow: Multiscale Adaptive Conditional Flows for Generation and Interpretable Classification
MAGIC-Flow: Multiscale Adaptive Conditional Flows for Generation and Interpretable Classification
Luca Caldera
Giacomo Bottacini
Lara Cavinato
OODMedIm
188
0
0
24 Oct 2025
Controllable-LPMoE: Adapting to Challenging Object Segmentation via Dynamic Local Priors from Mixture-of-Experts
Controllable-LPMoE: Adapting to Challenging Object Segmentation via Dynamic Local Priors from Mixture-of-Experts
Yanguang Sun
Jiawei Lian
Jian Yang
Lei Luo
123
1
0
24 Oct 2025
AutoOpt: A Dataset and a Unified Framework for Automating Optimization Problem Solving
AutoOpt: A Dataset and a Unified Framework for Automating Optimization Problem Solving
Ankur Sinha
Shobhit Arora
Dhaval Pujara
129
1
0
24 Oct 2025
WaveSeg: Enhancing Segmentation Precision via High-Frequency Prior and Mamba-Driven Spectrum Decomposition
WaveSeg: Enhancing Segmentation Precision via High-Frequency Prior and Mamba-Driven Spectrum Decomposition
Guoan Xu
Yang Xiao
Wenjing Jia
Guangwei Gao
Guo-Jun Qi
Chia-Wen Lin
Mamba
222
0
0
24 Oct 2025
LLMComp: A Language Modeling Paradigm for Error-Bounded Scientific Data Compression (Technical Report)
LLMComp: A Language Modeling Paradigm for Error-Bounded Scientific Data Compression (Technical Report)
Guozhong Li
Muhannad Alhumaidi
Spiros Skiadopoulos
Panos Kalnis
152
0
0
24 Oct 2025
Dynamic Semantic-Aware Correlation Modeling for UAV Tracking
Dynamic Semantic-Aware Correlation Modeling for UAV Tracking
Xinyu Zhou
Tongxin Pan
Lingyi Hong
Pinxue Guo
Haijing Guo
Zhaoyu Chen
Kaixun Jiang
Wenqiang Zhang
80
0
0
24 Oct 2025
Efficient Multi-bit Quantization Network Training via Weight Bias Correction and Bit-wise Coreset Sampling
Efficient Multi-bit Quantization Network Training via Weight Bias Correction and Bit-wise Coreset Sampling
Jinhee Kim
Jae Jun An
Kang Eun Jeon
Jong Hwan Ko
MQ
202
0
0
23 Oct 2025
Memory Constrained Dynamic Subnetwork Update for Transfer Learning
Memory Constrained Dynamic Subnetwork Update for Transfer Learning
Ael Quélennec
Pavlo Mozharovskyi
Van-Tam Nguyen
Enzo Tartaglione
99
0
0
23 Oct 2025
Attentive Convolution: Unifying the Expressivity of Self-Attention with Convolutional Efficiency
Attentive Convolution: Unifying the Expressivity of Self-Attention with Convolutional Efficiency
Hao Yu
H. G. Chen
Yan Jiang
Wei Peng
Zhaodong Sun
Samuel Kaski
Guoying Zhao
156
0
0
23 Oct 2025
GranViT: A Fine-Grained Vision Model With Autoregressive Perception For MLLMs
GranViT: A Fine-Grained Vision Model With Autoregressive Perception For MLLMs
Guanghao Zheng
Bowen Shi
Mingxing Xu
Ruoyu Sun
Peisen Zhao
...
Wenrui Dai
Junni Zou
Hongkai Xiong
Xiaopeng Zhang
Qi Tian
VLM
163
0
0
23 Oct 2025
SutureBot: A Precision Framework & Benchmark For Autonomous End-to-End Suturing
SutureBot: A Precision Framework & Benchmark For Autonomous End-to-End Suturing
Jesse Haworth
Juo-Tung Chen
Nigel Nelson
Ji Woong Kim
Masoud Moghani
Chelsea Finn
A. Krieger
182
2
0
23 Oct 2025
DARE: A Deformable Adaptive Regularization Estimator for Learning-Based Medical Image Registration
DARE: A Deformable Adaptive Regularization Estimator for Learning-Based Medical Image Registration
Ahsan Raza Siyal
Markus Haltmeier
R. Steiger
Malik Galijasevic
E. Gizewski
A. E. Grams
OODMedImCML
187
1
0
22 Oct 2025
SFGFusion: Surface Fitting Guided 3D Object Detection with 4D Radar and Camera Fusion
SFGFusion: Surface Fitting Guided 3D Object Detection with 4D Radar and Camera Fusion
Xiaozhi Li
Huijun Di
Jian Li
Feng Liu
Wei Liang
198
1
0
22 Oct 2025
Seabed-Net: A multi-task network for joint bathymetry estimation and seabed classification from remote sensing imagery in shallow waters
Seabed-Net: A multi-task network for joint bathymetry estimation and seabed classification from remote sensing imagery in shallow waters
P. Agrafiotis
Tim Siebert
120
0
0
22 Oct 2025
Study of Training Dynamics for Memory-Constrained Fine-Tuning
Study of Training Dynamics for Memory-Constrained Fine-Tuning
Ael Quélennec
Nour Hezbri
Pavlo Mozharovskyi
Van-Tam Nguyen
Enzo Tartaglione
106
1
0
22 Oct 2025
Guiding diffusion models to reconstruct flow fields from sparse data
Guiding diffusion models to reconstruct flow fields from sparse data
Marc Amorós-Trepat
Luis Medrano-Navarro
Qiang Liu
Luca Guastoni
Nils Thuerey
DiffMAI4CE
213
3
0
22 Oct 2025
Matrix-Free Least Squares Solvers: Values, Gradients, and What to Do With Them
Matrix-Free Least Squares Solvers: Values, Gradients, and What to Do With Them
Hrittik Roy
Søren Hauberg
Nicholas Krämer
155
1
0
22 Oct 2025
FutrTrack: A Camera-LiDAR Fusion Transformer for 3D Multiple Object Tracking
FutrTrack: A Camera-LiDAR Fusion Transformer for 3D Multiple Object Tracking
Martha Teiko Teye
Ori Maoz
Matthias Rottmann
206
0
0
22 Oct 2025
AegisRF: Adversarial Perturbations Guided with Sensitivity for Protecting Intellectual Property of Neural Radiance Fields
AegisRF: Adversarial Perturbations Guided with Sensitivity for Protecting Intellectual Property of Neural Radiance Fields
Woo Jae Kim
Kyu Beom Han
Y. Cho
Youngju Na
Junsik Jung
Sooel Son
Sung-eui Yoon
AAML
166
0
0
22 Oct 2025
ProLAP: Probabilistic Language-Audio Pre-Training
ProLAP: Probabilistic Language-Audio Pre-Training
Toranosuke Manabe
Yuchi Ishikawa
Hokuto Munakata
Tatsuya Komatsu
139
0
0
21 Oct 2025
Integrated representational signatures strengthen specificity in brains and models
Integrated representational signatures strengthen specificity in brains and models
Jialin Wu
Shreya Saha
Yiqing Bo
Meenakshi Khosla
89
0
0
21 Oct 2025
Detection and Simulation of Urban Heat Islands Using a Fine-Tuned Geospatial Foundation Model for Microclimate Impact Prediction
Detection and Simulation of Urban Heat Islands Using a Fine-Tuned Geospatial Foundation Model for Microclimate Impact Prediction
Jannis Fleckenstein
David Kreismann
Tamara Rosemary Govindasamy
Thomas Brunschwiler
Etienne Vos
Mattia Rigotti
AI4CE
69
0
0
21 Oct 2025
A Renaissance of Explicit Motion Information Mining from Transformers for Action Recognition
A Renaissance of Explicit Motion Information Mining from Transformers for Action Recognition
Peiqin Zhuang
Wenlong Zhang
Yichao Wu
Ding Liang
Luping Zhou
Yali Wang
Wanli Ouyang
213
0
0
21 Oct 2025
Learning Task-Agnostic Representations through Multi-Teacher Distillation
Learning Task-Agnostic Representations through Multi-Teacher Distillation
Philippe Formont
Maxime Darrin
Banafsheh Karimian
Jackie Chi Kit Cheung
Eric Granger
Ismail Ben Ayed
Mohammadhadi Shateri
Pablo Piantanida
165
0
0
21 Oct 2025
$Δ$t-Mamba3D: A Time-Aware Spatio-Temporal State-Space Model for Breast Cancer Risk Prediction
ΔΔΔt-Mamba3D: A Time-Aware Spatio-Temporal State-Space Model for Breast Cancer Risk Prediction
Zhengbo Zhou
Dooman Arefan
M. Zuley
Shandong Wu
Mamba
198
0
0
21 Oct 2025
Binary Quadratic Quantization: Beyond First-Order Quantization for Real-Valued Matrix Compression
Binary Quadratic Quantization: Beyond First-Order Quantization for Real-Valued Matrix Compression
Kyo Kuroki
Yasuyuki Okoshi
Thiem Van Chu
Kazushi Kawamura
Masato Motomura
MQ
225
0
0
21 Oct 2025
ScaleNet: Scaling up Pretrained Neural Networks with Incremental Parameters
ScaleNet: Scaling up Pretrained Neural Networks with Incremental Parameters
Zhiwei Hao
Jianyuan Guo
Li Shen
Kai Han
Yehui Tang
Han Hu
Yunhe Wang
234
0
0
21 Oct 2025
UltraGen: High-Resolution Video Generation with Hierarchical Attention
UltraGen: High-Resolution Video Generation with Hierarchical Attention
Teng Hu
Jiangning Zhang
Zihan Su
Ran Yi
DiffMVGen
210
5
0
21 Oct 2025
SOLE: Hardware-Software Co-design of Softmax and LayerNorm for Efficient Transformer Inference
SOLE: Hardware-Software Co-design of Softmax and LayerNorm for Efficient Transformer Inference
Wenxun Wang
Shuchang Zhou
Wenyu Sun
Peiqin Sun
Y. Liu
138
40
0
20 Oct 2025
Accelerating Vision Transformers with Adaptive Patch Sizes
Accelerating Vision Transformers with Adaptive Patch Sizes
Rohan Choudhury
JungEun Kim
Jeongseok Lee
Eunho Yang
László A. Jeni
Kishore Venkateshan
ViT
123
1
0
20 Oct 2025
Rethinking PCA Through Duality
Rethinking PCA Through Duality
Jan Quan
Johan A. K. Suykens
Panagiotis Patrinos
100
0
0
20 Oct 2025
M2H: Multi-Task Learning with Efficient Window-Based Cross-Task Attention for Monocular Spatial Perception
M2H: Multi-Task Learning with Efficient Window-Based Cross-Task Attention for Monocular Spatial Perception
U.V.B.L Udugama
G. Vosselman
F. Nex
137
0
0
20 Oct 2025
ZACH-ViT: A Zero-Token Vision Transformer with ShuffleStrides Data Augmentation for Robust Lung Ultrasound Classification
ZACH-ViT: A Zero-Token Vision Transformer with ShuffleStrides Data Augmentation for Robust Lung Ultrasound Classification
Athanasios Angelakis
Amne Mousa
Micah L. A. Heldeweg
Laurens A. Biesheuvel
Mark A. Haaksma
Jasper M. Smit
Pieter R. Tuinman
Paul W. G. Elbers
MedIm
97
0
0
20 Oct 2025
Facial Expression-based Parkinson's Disease Severity Diagnosis via Feature Fusion and Adaptive Class Balancing
Facial Expression-based Parkinson's Disease Severity Diagnosis via Feature Fusion and Adaptive Class Balancing
Yintao Zhou
Wei Huang
Zhengyu Li
Jing Huang
Meng Pang
CVBM
222
0
0
20 Oct 2025
ArmFormer: Lightweight Transformer Architecture for Real-Time Multi-Class Weapon Segmentation and Classification
ArmFormer: Lightweight Transformer Architecture for Real-Time Multi-Class Weapon Segmentation and Classification
Akhila Kambhatla
Taminul Islam
Khaled R Ahmed
ViT
161
0
0
19 Oct 2025
UKANFormer: Noise-Robust Semantic Segmentation for Coral Reef Mapping via a Kolmogorov-Arnold Network-Transformer Hybrid
UKANFormer: Noise-Robust Semantic Segmentation for Coral Reef Mapping via a Kolmogorov-Arnold Network-Transformer Hybrid
Tianyang Dou
Ming Li
J. Qin
Xuan Liao
J. Zhong
Armin Gruen
Mengyi Deng
ViT
191
0
0
19 Oct 2025
Beyond RGB: Leveraging Vision Transformers for Thermal Weapon Segmentation
Beyond RGB: Leveraging Vision Transformers for Thermal Weapon Segmentation
Akhila Kambhatla
Ahmed R Khaled
ViT
82
0
0
19 Oct 2025
ReefNet: A Large scale, Taxonomically Enriched Dataset and Benchmark for Hard Coral Classification
ReefNet: A Large scale, Taxonomically Enriched Dataset and Benchmark for Hard Coral Classification
Yahia Battach
Abdulwahab Felemban
Faizan Farooq Khan
Yousef Radwan
Xiang Li
Fabio Marchese
Sara Beery
Burton H. Jones
Francesca Benzoni
Mohamed Elhoseiny
148
0
0
19 Oct 2025
BARL: Bilateral Alignment in Representation and Label Spaces for Semi-Supervised Volumetric Medical Image Segmentation
BARL: Bilateral Alignment in Representation and Label Spaces for Semi-Supervised Volumetric Medical Image Segmentation
Shujian Gao
Y Samuel Wang
Zekuan Yu
117
0
0
19 Oct 2025
Efficient High-Accuracy PDEs Solver with the Linear Attention Neural Operator
Efficient High-Accuracy PDEs Solver with the Linear Attention Neural Operator
Ming Zhong
Zhenya Yan
AI4CE
120
0
0
19 Oct 2025
Res-Bench: Benchmarking the Robustness of Multimodal Large Language Models to Dynamic Resolution Input
Res-Bench: Benchmarking the Robustness of Multimodal Large Language Models to Dynamic Resolution Input
Chenxu Li
Zhicai Wang
Yuan Sheng
Xingyu Zhu
Y. Hao
Xiang Wang
AAML
205
0
0
19 Oct 2025
EDVD-LLaMA: Explainable Deepfake Video Detection via Multimodal Large Language Model Reasoning
EDVD-LLaMA: Explainable Deepfake Video Detection via Multimodal Large Language Model Reasoning
Haoran Sun
Chen Cai
Huiping Zhuang
Kong Aik Lee
Lap-Pui Chau
Yi Wang
123
0
0
18 Oct 2025
Previous
123...567...169170171
Next
Page 6 of 171
Pageof 171