ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.14030
  4. Cited By
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
v1v2 (latest)

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

IEEE International Conference on Computer Vision (ICCV), 2021
25 March 2021
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
    ViT
ArXiv (abs)PDFHTMLHuggingFace (5 upvotes)Github (14835★)

Papers citing "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows"

50 / 8,530 papers shown
Particle Trajectory Representation Learning with Masked Point Modeling
Particle Trajectory Representation Learning with Masked Point Modeling
Sam Young
Yeon-jae Jwa
Kazuhiro Terao
3DPC
346
3
0
04 Feb 2025
MATCNN: Infrared and Visible Image Fusion Method Based on Multi-scale CNN with Attention Transformer
MATCNN: Infrared and Visible Image Fusion Method Based on Multi-scale CNN with Attention TransformerIEEE Transactions on Instrumentation and Measurement (IEEE Trans. Instrum. Meas.), 2025
Jingjing Liu
Li Zhang
Xiaoyang Zeng
Wanquan Liu
Jing Zhang
287
11
0
04 Feb 2025
SimBEV: A Synthetic Multi-Task Multi-Sensor Driving Data Generation Tool and Dataset
SimBEV: A Synthetic Multi-Task Multi-Sensor Driving Data Generation Tool and Dataset
Goodarz Mehr
A. Eskandarian
650
4
0
04 Feb 2025
A Framework for Double-Blind Federated Adaptation of Foundation Models
A Framework for Double-Blind Federated Adaptation of Foundation Models
Nurbek Tastan
Karthik Nandakumar
FedML
322
0
0
03 Feb 2025
LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model Adaptation
LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model AdaptationInternational Conference on Learning Representations (ICLR), 2025
Can Jin
Ying Li
Mingyu Zhao
Shiyu Zhao
Zhenting Wang
Xiaoxiao He
Ligong Han
Tong Che
Dimitris N. Metaxas
VPVLMVLM
1.1K
8
0
02 Feb 2025
BEEM: Boosting Performance of Early Exit DNNs using Multi-Exit Classifiers as Experts
BEEM: Boosting Performance of Early Exit DNNs using Multi-Exit Classifiers as ExpertsInternational Conference on Learning Representations (ICLR), 2025
Divya J. Bajpai
M. Hanawal
426
3
0
02 Feb 2025
Contrastive Forward-Forward: A Training Algorithm of Vision Transformer
Contrastive Forward-Forward: A Training Algorithm of Vision TransformerNeural Networks (NN), 2025
Hossein Aghagolzadeh
Mehdi Ezoji
ViT
464
2
0
01 Feb 2025
Leveraging Stable Diffusion for Monocular Depth Estimation via Image Semantic Encoding
Leveraging Stable Diffusion for Monocular Depth Estimation via Image Semantic EncodingTowards Autonomous Robotic Systems (TAROS), 2025
Jingming Xia
Guanqun Cao
Guang Ma
Yiben Luo
Qinzhao Li
John Oyekan
MDE
313
0
0
01 Feb 2025
Beyond the Permutation Symmetry of Transformers: The Role of Rotation for Model Fusion
Beyond the Permutation Symmetry of Transformers: The Role of Rotation for Model Fusion
Binchi Zhang
Zaiyi Zheng
Zhengzhang Chen
Wenlin Yao
667
6
0
01 Feb 2025
LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models
LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language ModelsComputer Vision and Pattern Recognition (CVPR), 2025
Shenghao Fu
Q. Yang
Qijie Mo
Junkai Yan
Xihan Wei
Jingke Meng
Xiaohua Xie
Wei-Shi Zheng
MLLMObjDVLM
453
33
0
31 Jan 2025
Ground Awareness in Deep Learning for Large Outdoor Point Cloud Segmentation
Ground Awareness in Deep Learning for Large Outdoor Point Cloud SegmentationVISIGRAPP (VISIGRAPP), 2025
Kevin Qiu
Dimitri Bulatov
Dorota Iwaszczuk
3DPC
269
2
0
30 Jan 2025
VICCA: Visual Interpretation and Comprehension of Chest X-ray Anomalies in Generated Report Without Human Feedback
VICCA: Visual Interpretation and Comprehension of Chest X-ray Anomalies in Generated Report Without Human FeedbackMachine Learning with Applications (MLWA), 2025
Sayeh Gholipour Picha
D. Chanti
A. Caplier
MedIm
338
1
0
29 Jan 2025
B-RIGHT: Benchmark Re-evaluation for Integrity in Generalized Human-Object Interaction Testing
B-RIGHT: Benchmark Re-evaluation for Integrity in Generalized Human-Object Interaction Testing
Yoojin Jang
Junsu Kim
H. Kim
Eun-ki Lee
Eun-sol Kim
Seungryul Baek
Jaejun Yoo
187
0
0
28 Jan 2025
State-space models are accurate and efficient neural operators for dynamical systems
State-space models are accurate and efficient neural operators for dynamical systems
Zheyuan Hu
Nazanin Ahmadi Daryakenari
Qianli Shen
Kenji Kawaguchi
George Karniadakis
MambaAI4CE
496
26
0
28 Jan 2025
MultiPDENet: PDE-embedded Learning with Multi-time-stepping for Accelerated Flow Simulation
Qi Wang
Yuan Mi
Jian Shu
Yi Zhang
Ruizhi Chengze
Hongsheng Liu
J. Wen
Hao Sun
AI4CE
302
5
0
28 Jan 2025
SpatioTemporal Learning for Human Pose Estimation in Sparsely-Labeled VideosAAAI Conference on Artificial Intelligence (AAAI), 2025
Yingying Jiao
Zhigang Wang
Sifan Wu
Shaojing Fan
Zhenguang Liu
Zhuoyue Xu
Zheqi Wu
400
4
0
28 Jan 2025
V2X-DGPE: Addressing Domain Gaps and Pose Errors for Robust Collaborative 3D Object Detection
Sichao Wang
Chuang Zhang
Ming Yuan
Qing Xu
Lei He
Jianqiang Wang
358
4
0
28 Jan 2025
Collective Intelligence for 2D Push Manipulations with Mobile Robots
Collective Intelligence for 2D Push Manipulations with Mobile RobotsIEEE Robotics and Automation Letters (RA-L), 2022
So Kuroki
T. Matsushima
Jumpei Arima
Hiroki Furuta
Yutaka Matsuo
S. Gu
Yujin Tang
447
5
0
28 Jan 2025
Radiologist-in-the-Loop Self-Training for Generalizable CT Metal Artifact ReductionIEEE Transactions on Medical Imaging (IEEE TMI), 2025
Chenglong Ma
Zilong Li
Yongqian Li
Jing Han
Junping Zhang
Yi Zhang
Jiannan Liu
Hongming Shan
220
3
0
28 Jan 2025
MedPromptX: Grounded Multimodal Prompting for Chest X-ray Diagnosis
MedPromptX: Grounded Multimodal Prompting for Chest X-ray Diagnosis
Mai A. Shaaban
Adnan Khan
Mohammad Yaqub
LM&MA
422
5
0
28 Jan 2025
Prion-ViT: Prions-Inspired Vision Transformers for Temperature prediction with Specklegrams
Prion-ViT: Prions-Inspired Vision Transformers for Temperature prediction with Specklegrams
Abhishek Sebastian
Pragna R
Sonaa Rajagopal
Muralikrishnan Mani
310
2
0
28 Jan 2025
Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection
Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection
Xiangyu Gao
Yu Dai
Benliu Qiu
Hongliang Li
Heqian Qiu
Hongliang Li
ObjDVLM
1.0K
0
0
28 Jan 2025
iFormer: Integrating ConvNet and Transformer for Mobile Application
iFormer: Integrating ConvNet and Transformer for Mobile ApplicationInternational Conference on Learning Representations (ICLR), 2025
Chuanyang Zheng
ViT
397
4
0
26 Jan 2025
Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data
Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised DataInternational Conference on Learning Representations (ICLR), 2025
Jiajie Li
Brian R Quaranto
Chenhui Xu
Ishan Mishra
Ruiyang Qin
Dancheng Liu
Peter C W Kim
Jinjun Xiong
435
2
0
25 Jan 2025
PolaFormer: Polarity-aware Linear Attention for Vision TransformersInternational Conference on Learning Representations (ICLR), 2025
Weikang Meng
Yadan Luo
Xin Li
Shihong Deng
Zheng Zhang
1.1K
36
0
25 Jan 2025
Rethinking Encoder-Decoder Flow Through Shared Structures
Rethinking Encoder-Decoder Flow Through Shared StructuresIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Frederik Laboyrie
M. K. Yucel
Albert Saà-Garriga
AI4CE
251
0
0
24 Jan 2025
Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation Models
Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation ModelsIEEE International Symposium on Biomedical Imaging (ISBI), 2025
Jakob Krogh Petersen
Valdemar Licht
Mads Nielsen
Asbjørn Munk
VLM
190
0
0
23 Jan 2025
FreEformer: Frequency Enhanced Transformer for Multivariate Time Series Forecasting
FreEformer: Frequency Enhanced Transformer for Multivariate Time Series ForecastingInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Wenzhen Yue
Yixiao Liu
Xianghua Ying
Bowei Xing
Ruohao Guo
Ji Shi
AI4TS
209
13
0
23 Jan 2025
Unified CNNs and transformers underlying learning mechanism reveals multi-head attention modus vivendi
Unified CNNs and transformers underlying learning mechanism reveals multi-head attention modus vivendi
Ella Koresh
Ronit D. Gross
Yuval Meir
Yarden Tzach
Tal Halevi
Ido Kanter
ViT
383
8
0
22 Jan 2025
Parallel Sequence Modeling via Generalized Spatial Propagation Network
Parallel Sequence Modeling via Generalized Spatial Propagation NetworkComputer Vision and Pattern Recognition (CVPR), 2025
Hongjun Wang
Wonmin Byeon
Jiarui Xu
Liang Feng
Ka Chun Cheung
Xiaolong Wang
Kai Han
Jan Kautz
Sifei Liu
847
3
0
21 Jan 2025
Vision-Language Models for Automated Chest X-ray Interpretation: Leveraging ViT and GPT-2
Vision-Language Models for Automated Chest X-ray Interpretation: Leveraging ViT and GPT-2Engineering Reports (ER), 2025
Md. Rakibul Islam
Md. Zahid Hossain
Mustofa Ahmed
Most. Sharmin Sultana Samu
LM&MAMedIm
239
3
0
21 Jan 2025
Towards Accurate Unified Anomaly Segmentation
Towards Accurate Unified Anomaly SegmentationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025
Wenxin Ma
Qingsong Yao
Xiang Zhang
Zhelong Huang
Zihang Jiang
S. Kevin Zhou
367
11
0
21 Jan 2025
A margin-based replacement for cross-entropy loss
A margin-based replacement for cross-entropy loss
Michael W. Spratling
Heiko H. Schütt
319
0
0
21 Jan 2025
Scalable Whole Slide Image Representation Using K-Mean Clustering and Fisher Vector Aggregation
Scalable Whole Slide Image Representation Using K-Mean Clustering and Fisher Vector AggregationIEEE International Symposium on Biomedical Imaging (ISBI), 2025
R. Gupta
Shounak Das
Ardhendu Sekhar
Amit Sethi
163
0
0
21 Jan 2025
Comparative Analysis of Pre-trained Deep Learning Models and DINOv2 for Cushing's Syndrome Diagnosis in Facial Analysis
Comparative Analysis of Pre-trained Deep Learning Models and DINOv2 for Cushing's Syndrome Diagnosis in Facial AnalysisAnnual International Computer Software and Applications Conference (COMPSAC), 2025
Hongjun Liu
Changwei Song
Jiaqi Qiang
Jianqiang Li
Hui Pan
Lin Lu
Xiao Long
Qing Zhao
Jiuzuo Huang
Shi Chen
MedIm
84
1
0
21 Jan 2025
TFLOP: Table Structure Recognition Framework with Layout Pointer Mechanism
TFLOP: Table Structure Recognition Framework with Layout Pointer MechanismInternational Joint Conference on Artificial Intelligence (IJCAI), 2024
Minsoo Khang
Teakgyu Hong
LMTD
314
6
0
21 Jan 2025
UAV-Assisted Real-Time Disaster Detection Using Optimized Transformer Model
UAV-Assisted Real-Time Disaster Detection Using Optimized Transformer Model
Branislava Jankovic
Sabina Jangirova
Waseem Ullah
Latif U. Khan
Mohsen Guizani
391
5
0
21 Jan 2025
A Survey on Memory-Efficient Transformer-Based Model Training in AI for Science
A Survey on Memory-Efficient Transformer-Based Model Training in AI for Science
Kaiyuan Tian
Linbo Qiao
Baihui Liu
Gongqingjian Jiang
Shanshan Li
Dongsheng Li
383
0
0
21 Jan 2025
A generalizable 3D framework and model for self-supervised learning in medical imaging
A generalizable 3D framework and model for self-supervised learning in medical imaging
Tony Xu
Sepehr Hosseini
Chris Anderson
Anthony Rinaldi
Rahul G. Krishnan
Anne L. Martel
Maged Goubran
MedIm
343
11
0
20 Jan 2025
Subjective and Objective Quality Assessment of Non-Uniformly Distorted Omnidirectional Images
Subjective and Objective Quality Assessment of Non-Uniformly Distorted Omnidirectional ImagesIEEE transactions on multimedia (TMM), 2025
Jiebin Yan
Jiale Rao
Xuelin Liu
Yuming Fang
Yifan Zuo
Weide Liu
181
9
0
20 Jan 2025
ACE: Anatomically Consistent Embeddings in Composition and Decomposition
ACE: Anatomically Consistent Embeddings in Composition and DecompositionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025
Ziyu Zhou
Haozhe Luo
M. Taher
Jiaxuan Pang
Xiaowei Ding
Michael B. Gotway
Jianming Liang
MedIm
358
0
0
20 Jan 2025
3rd Workshop on Maritime Computer Vision (MaCVi) 2025: Challenge Results
3rd Workshop on Maritime Computer Vision (MaCVi) 2025: Challenge Results
Benjamin Kiefer
Lojze Žust
Jon Muhovič
Matej Kristan
J. Pers
...
Ashraf Saleem
Ching-Heng Cheng
Yu-Fan Lin
Tzu-Yu Lin
Chih-Chung Hsu
199
7
0
20 Jan 2025
Elucidating the Design Space of Dataset Condensation
Elucidating the Design Space of Dataset CondensationNeural Information Processing Systems (NeurIPS), 2024
Shitong Shao
Zikai Zhou
Huanran Chen
Zhiqiang Shen
DD
755
24
0
20 Jan 2025
Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks
Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural NetworksIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025
Michael Schwingshackl
Fabio Francisco Oberweger
Markus Murschitz
266
1
0
20 Jan 2025
MRI2Speech: Speech Synthesis from Articulatory Movements Recorded by Real-time MRI
MRI2Speech: Speech Synthesis from Articulatory Movements Recorded by Real-time MRIIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
N. Shah
Ayan Kashyap
Shirish S. Karande
Vineet Gandhi
240
1
0
20 Jan 2025
A Comprehensive Survey of Foundation Models in Medicine
A Comprehensive Survey of Foundation Models in MedicineIEEE Reviews in Biomedical Engineering (RBME), 2024
Wasif Khan
Seowung Leem
Kyle B. See
Joshua K. Wong
Shaoting Zhang
R. Fang
AI4CELM&MAVLM
780
75
0
17 Jan 2025
FutureDepth: Learning to Predict the Future Improves Video Depth Estimation
FutureDepth: Learning to Predict the Future Improves Video Depth EstimationEuropean Conference on Computer Vision (ECCV), 2024
R. Yasarla
Manish Kumar Singh
Hong Cai
Yunxiao Shi
Jisoo Jeong
Yinhao Zhu
Shizhong Han
Risheek Garrepalli
Fatih Porikli
MDE
516
12
0
17 Jan 2025
Geometric Distortion Guided Transformer for Omnidirectional Image Super-Resolution
Geometric Distortion Guided Transformer for Omnidirectional Image Super-Resolution
Cuixin Yang
Rongkang Dong
Jun Xiao
Cong Zhang
Kin-Man Lam
Fei Zhou
Guoping Qiu
457
8
0
17 Jan 2025
MAMo: Leveraging Memory and Attention for Monocular Video Depth Estimation
MAMo: Leveraging Memory and Attention for Monocular Video Depth EstimationIEEE International Conference on Computer Vision (ICCV), 2023
R. Yasarla
H. Cai
Jisoo Jeong
Y. Shi
Risheek Garrepalli
Fatih Porikli
MDE
601
27
0
17 Jan 2025
Unified Face Matching and Physical-Digital Spoofing Attack Detection
Unified Face Matching and Physical-Digital Spoofing Attack Detection
Arun Kunwar
Ajita Rattani
CVBMAAML
289
0
0
17 Jan 2025
Previous
123...363738...169170171
Next
Page 37 of 171
Pageof 171