ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.14030
  4. Cited By
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
v1v2 (latest)

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

IEEE International Conference on Computer Vision (ICCV), 2021
25 March 2021
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
    ViT
ArXiv (abs)PDFHTMLHuggingFace (5 upvotes)Github (14835★)

Papers citing "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows"

50 / 8,530 papers shown
Good Representation, Better Explanation: Role of Convolutional Neural Networks in Transformer-Based Remote Sensing Image Captioning
Good Representation, Better Explanation: Role of Convolutional Neural Networks in Transformer-Based Remote Sensing Image Captioning
Swadhin Das
Saarthak Gupta
and Kamal Kumar
Raksha Sharma
201
2
0
22 Feb 2025
TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba
TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba
Xiuwei Chen
Sihao Lin
Xiao Dong
Sihao Lin
Meng Cao
Jiawei Han
Yina Zhuang
J. N. Han
Hang Xu
Xiaodan Liang
Mamba
370
4
0
21 Feb 2025
Surface Vision Mamba: Leveraging Bidirectional State Space Model for Efficient Spherical Manifold Representation
Surface Vision Mamba: Leveraging Bidirectional State Space Model for Efficient Spherical Manifold RepresentationInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Rongzhao He
Weihao Zheng
Leilei Zhao
Ying Wang
Dalin Zhu
Dan Wu
Bin Hu
Mamba
734
3
0
21 Feb 2025
Tight Clusters Make Specialized Experts
Tight Clusters Make Specialized ExpertsInternational Conference on Learning Representations (ICLR), 2025
Stefan K. Nielsen
R. Teo
Laziz U. Abdullaev
Tan M. Nguyen
MoE
467
6
0
21 Feb 2025
Intelligent Anomaly Detection for Lane Rendering Using Transformer with Self-Supervised Pre-Training and Customized Fine-Tuning
Intelligent Anomaly Detection for Lane Rendering Using Transformer with Self-Supervised Pre-Training and Customized Fine-TuningTransportation Research Record (TRR), 2023
Yongqi Dong
Xingmin Lu
Ruohan Li
Wei Song
B. Arem
Haneen Farah
ViT
394
2
0
21 Feb 2025
Dissecting Human Body Representations in Deep Networks Trained for Person Identification
Dissecting Human Body Representations in Deep Networks Trained for Person IdentificationIEEE International Conference on Automatic Face & Gesture Recognition (FG), 2025
Thomas M. Metz
Matthew Q. Hill
Blake Myers
Veda Nandan Gandi
Rahul Chilakapati
A. O’toole
CVBM3DH
260
3
0
21 Feb 2025
Myna: Masking-Based Contrastive Learning of Musical Representations
Myna: Masking-Based Contrastive Learning of Musical Representations
Ori Yonay
Tracy Hammond
Tianbao Yang
AAML
372
0
0
20 Feb 2025
Bridging Text and Vision: A Multi-View Text-Vision Registration Approach for Cross-Modal Place Recognition
Bridging Text and Vision: A Multi-View Text-Vision Registration Approach for Cross-Modal Place Recognition
Tianyi Shang
Zhenyu Li
Pengjie Xu
Jinwei Qiao
Gang Chen
Zihan Ruan
Weijun Hu
436
4
0
20 Feb 2025
Thicker and Quicker: A Jumbo Token for Fast Plain Vision Transformers
Thicker and Quicker: A Jumbo Token for Fast Plain Vision Transformers
A. Fuller
Yousef Yassin
Daniel G. Kyrollos
Evan Shelhamer
James R. Green
491
1
0
20 Feb 2025
Event-Based Video Frame Interpolation With Cross-Modal Asymmetric Bidirectional Motion Fields
Event-Based Video Frame Interpolation With Cross-Modal Asymmetric Bidirectional Motion FieldsComputer Vision and Pattern Recognition (CVPR), 2023
Taewoo Kim
Yujeong Chae
Hyun-Kurl Jang
Kuk-Jin Yoon
372
45
0
20 Feb 2025
Variance Reduction Methods Do Not Need to Compute Full Gradients: Improved Efficiency through Shuffling
Variance Reduction Methods Do Not Need to Compute Full Gradients: Improved Efficiency through Shuffling
Daniil Medyakov
Gleb Molodtsov
S. Chezhegov
Alexey Rebrikov
Aleksandr Beznosikov
454
1
0
20 Feb 2025
Quantifying Memorization and Parametric Response Rates in Retrieval-Augmented Vision-Language Models
Quantifying Memorization and Parametric Response Rates in Retrieval-Augmented Vision-Language Models
Peter Carragher
Abhinand Jha
R Raghav
Kathleen M. Carley
RALM
395
0
0
19 Feb 2025
A Comprehensive Survey on Composed Image Retrieval
A Comprehensive Survey on Composed Image Retrieval
Xuemeng Song
Haoqiang Lin
Haokun Wen
Bohan Hou
Mingzhu Xu
Liqiang Nie
482
10
0
19 Feb 2025
MaxSup: Overcoming Representation Collapse in Label Smoothing
MaxSup: Overcoming Representation Collapse in Label Smoothing
Yuxuan Zhou
Heng Li
Zhi-Qi Cheng
Xudong Yan
Yifei Dong
Mario Fritz
Margret Keuper
511
0
0
18 Feb 2025
RingFormer: Rethinking Recurrent Transformer with Adaptive Level Signals
RingFormer: Rethinking Recurrent Transformer with Adaptive Level Signals
Jaemu Heo
Eldor Fozilov
Hyunmin Song
Taehwan Kim
123
1
0
18 Feb 2025
Unsupervised Structural-Counterfactual Generation under Domain Shift
Unsupervised Structural-Counterfactual Generation under Domain Shift
Krishn Vishwas Kher
Lokesh Venkata Siva Maruthi Badisa
Saksham Mittal
Kusampudi Venkata Datta Sri Harsha
Chitneedi Geetha Sowmya
SakethaNath Jagarlapudi
OODCML
249
0
0
17 Feb 2025
Precise GPS-Denied UAV Self-Positioning via Context-Enhanced Cross-View Geo-Localization
Precise GPS-Denied UAV Self-Positioning via Context-Enhanced Cross-View Geo-Localization
Yuanze Xu
Ming Dai
Wenxiao Cai
Wankou Yang
267
3
0
17 Feb 2025
ProMRVL-CAD: Proactive Dialogue System with Multi-Round Vision-Language Interactions for Computer-Aided Diagnosis
ProMRVL-CAD: Proactive Dialogue System with Multi-Round Vision-Language Interactions for Computer-Aided Diagnosis
Xueshen Li
Xinlong Hou
Ziyi Huang
Yu Gan
LM&MAMedIm
222
0
0
15 Feb 2025
NPSim: Nighttime Photorealistic Simulation From Daytime Images With Monocular Inverse Rendering and Ray Tracing
NPSim: Nighttime Photorealistic Simulation From Daytime Images With Monocular Inverse Rendering and Ray Tracing
Shutong Zhang
357
1
0
15 Feb 2025
Harnessing Vision Models for Time Series Analysis: A Survey
Harnessing Vision Models for Time Series Analysis: A SurveyInternational Joint Conference on Artificial Intelligence (IJCAI), 2024
Jingchao Ni
Ziming Zhao
ChengAo Shen
Hanghang Tong
Dongjin Song
Wei Cheng
Dongsheng Luo
Haifeng Chen
AI4TS
505
16
0
13 Feb 2025
CFIRSTNET: Comprehensive Features for Static IR Drop Estimation with Neural Network
CFIRSTNET: Comprehensive Features for Static IR Drop Estimation with Neural NetworkInternational Conference on Computer Aided Design (ICCAD), 2024
Yu-Tung Liu
Yu-Hao Cheng
Shao-Yu Wu
Hung-Ming Chen
171
3
0
13 Feb 2025
Towards Virtual Clinical Trials of Radiology AI with Conditional Generative Modeling
Towards Virtual Clinical Trials of Radiology AI with Conditional Generative Modeling
Benjamin Killeen
Bohua Wan
Aditya V. Kulkarni
Nathan G. Drenkow
Michael Oberst
Paul H. Yi
Mathias Unberath
MedIm
281
1
0
13 Feb 2025
CoL3D: Collaborative Learning of Single-view Depth and Camera Intrinsics for Metric 3D Shape Recovery
CoL3D: Collaborative Learning of Single-view Depth and Camera Intrinsics for Metric 3D Shape RecoveryIEEE International Conference on Robotics and Automation (ICRA), 2025
Chenghao Zhang
Lubin Fan
Shen Cao
Bojian Wu
Jieping Ye
478
0
0
13 Feb 2025
Top-Theta Attention: Sparsifying Transformers by Compensated Thresholding
Top-Theta Attention: Sparsifying Transformers by Compensated Thresholding
Konstantin Berestizshevsky
Renzo Andri
Lukas Cavigelli
442
2
0
12 Feb 2025
Uncertainty Aware Human-machine Collaboration in Camouflaged Object Detection
Uncertainty Aware Human-machine Collaboration in Camouflaged Object Detection
Zhiyong Yang
Kehan Wang
Yuhang Ming
Yong Peng
Han Yang
Qiong Chen
Wanzeng Kong
330
0
0
12 Feb 2025
Color-Quality Invariance for Robust Medical Image Segmentation
Color-Quality Invariance for Robust Medical Image Segmentation
Ravi Shah
Atsushi Fukuda
Q. H. Cap
OOD
435
0
0
11 Feb 2025
SparseFormer: Detecting Objects in HRW Shots via Sparse Vision Transformer
SparseFormer: Detecting Objects in HRW Shots via Sparse Vision TransformerACM Multimedia (MM), 2024
Wenxi Li
Yuchen Guo
Jilai Zheng
Haozhe Lin
Chao Ma
Lu Fang
Yunbo Wang
ViT
472
5
0
11 Feb 2025
Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis
Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis
Amir Hosein Fadaei
M. Dehaqani
343
0
0
11 Feb 2025
The Value of Information in Human-AI Decision-making
The Value of Information in Human-AI Decision-making
Ziyang Guo
Yifan Wu
Jason D. Hartline
Jessica Hullman
FAtt
703
9
0
10 Feb 2025
KARST: Multi-Kernel Kronecker Adaptation with Re-Scaling Transmission for Visual Classification
KARST: Multi-Kernel Kronecker Adaptation with Re-Scaling Transmission for Visual ClassificationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Yue Zhu
Haiwen Diao
Shang Gao
Long Chen
Huchuan Lu
513
1
0
10 Feb 2025
Fully Exploiting Vision Foundation Model's Profound Prior Knowledge for Generalizable RGB-Depth Driving Scene Parsing
Sicen Guo
Tianyou Wen
Chuang-Wei Liu
Qijun Chen
Rui Fan
423
1
0
10 Feb 2025
Do we really have to filter out random noise in pre-training data for language models?
Do we really have to filter out random noise in pre-training data for language models?
Jinghan Ru
Yuxin Xie
Xianwei Zhuang
Yuguo Yin
Zhihui Guo
Zhiming Liu
Qianli Ren
Yuexian Zou
449
9
0
10 Feb 2025
Multi-Level Decoupled Relational Distillation for Heterogeneous Architectures
Yaoxin Yang
Peng Ye
Weihao Lin
Kangcong Li
Yan Wen
Jia Hao
Tao Chen
323
0
0
10 Feb 2025
Learning Clustering-based Prototypes for Compositional Zero-shot Learning
Learning Clustering-based Prototypes for Compositional Zero-shot LearningInternational Conference on Learning Representations (ICLR), 2025
Hongyu Qu
Jianan Wei
Xiangbo Shu
Wenguan Wang
VLM
462
17
0
10 Feb 2025
Unconstrained Body Recognition at Altitude and Range: Comparing Four Approaches
Unconstrained Body Recognition at Altitude and Range: Comparing Four ApproachesIEEE International Conference on Automatic Face & Gesture Recognition (FG), 2025
Blake Myers
Matthew Q. Hill
Veda Nandan Gandi
Thomas M. Metz
A. O’toole
188
2
0
10 Feb 2025
Cell Nuclei Detection and Classification in Whole Slide Images with Transformers
Oscar Pina
Eduard Dorca
Verónica Vilaplana
158
0
0
10 Feb 2025
Linear Attention Modeling for Learned Image Compression
Linear Attention Modeling for Learned Image CompressionComputer Vision and Pattern Recognition (CVPR), 2025
Donghui Feng
Zhengxue Cheng
Shen Wang
Ronghua Wu
Hongwei Hu
Guo Lu
Li Song
749
7
0
09 Feb 2025
DiTASK: Multi-Task Fine-Tuning with Diffeomorphic Transformations
DiTASK: Multi-Task Fine-Tuning with Diffeomorphic TransformationsComputer Vision and Pattern Recognition (CVPR), 2025
Krishna Sri Ipsit Mantri
Carola-Bibiane Schönlieb
Bruno Ribeiro
Chaim Baskin
Moshe Eliasof
482
5
0
09 Feb 2025
AI-Driven HSI: Multimodality, Fusion, Challenges, and the Deep Learning Revolution
AI-Driven HSI: Multimodality, Fusion, Challenges, and the Deep Learning Revolution
David S. Bhatti
Yougin Choi
Rahman S M Wahidur
Maleeka Bakhtawar
Sumin Kim
Surin Lee
Yongtae Lee
Heung-No Lee
337
5
0
09 Feb 2025
Contrastive Representation Distillation via Multi-Scale Feature Decoupling
Contrastive Representation Distillation via Multi-Scale Feature Decoupling
Cuipeng Wang
Tieyuan Chen
377
1
0
09 Feb 2025
A Novel Convolutional-Free Method for 3D Medical Imaging Segmentation
Canxuan Gang
MedImViT
272
1
0
08 Feb 2025
Drone Detection and Tracking with YOLO and a Rule-based Method
Drone Detection and Tracking with YOLO and a Rule-based Method
Purbaditya Bhattacharya
Patrick Nowak
361
2
0
07 Feb 2025
Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More
Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More
Feng Wang
Yaodong Yu
Guoyizhe Wei
Wei Shao
Yuyin Zhou
Alan Yuille
Cihang Xie
ViT
406
19
0
06 Feb 2025
L2GNet: Optimal Local-to-Global Representation of Anatomical Structures for Generalized Medical Image Segmentation
Vandan Gorade
Sparsh Mittal
N. Dasu
Rekha Singhal
KC Santosh
Debesh Jha
173
0
0
06 Feb 2025
Conditional Diffusion Models are Medical Image Classifiers that Provide Explainability and Uncertainty for Free
Conditional Diffusion Models are Medical Image Classifiers that Provide Explainability and Uncertainty for Free
Gian Mario Favero
Parham Saremi
Emily Kaczmarek
Brennan Nichyporuk
Tal Arbel
DiffMMedIm
322
5
0
06 Feb 2025
Improving Adversarial Robustness via Phase and Amplitude-aware Prompting
Improving Adversarial Robustness via Phase and Amplitude-aware Prompting
Yibo Xu
Dawei Zhou
Decheng Liu
N. Wang
AAML
267
0
0
06 Feb 2025
All-in-One Image Compression and Restoration
All-in-One Image Compression and RestorationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025
Huimin Zeng
Jiacheng Li
Ziqiang Zheng
Zhiwei Xiong
342
2
0
05 Feb 2025
Edge Attention Module for Object Classification
Edge Attention Module for Object Classification
Santanu Roy
Ashvath Suresh
Archit Gupta
242
1
0
05 Feb 2025
LoCA: Location-Aware Cosine Adaptation for Parameter-Efficient Fine-Tuning
LoCA: Location-Aware Cosine Adaptation for Parameter-Efficient Fine-TuningInternational Conference on Learning Representations (ICLR), 2025
Zhekai Du
Yinjie Min
Jingjing Li
Ke Lu
Changliang Zou
Liuhua Peng
Tingjin Chu
Mingming Gong
910
3
0
05 Feb 2025
Exploiting Ensemble Learning for Cross-View Isolated Sign Language Recognition
Exploiting Ensemble Learning for Cross-View Isolated Sign Language RecognitionThe Web Conference (WWW), 2025
Fei Wang
Kun Li
Yiqi Nie
Zhangling Duan
Peng Zou
Zhikai Wu
Longji Xu
Yanyan Wei
SLR
466
10
0
04 Feb 2025
Previous
123...353637...169170171
Next
Page 36 of 171
Pageof 171