ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.14030
  4. Cited By
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
v1v2 (latest)

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

IEEE International Conference on Computer Vision (ICCV), 2021
25 March 2021
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
    ViT
ArXiv (abs)PDFHTMLHuggingFace (5 upvotes)Github (14835★)

Papers citing "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows"

50 / 8,519 papers shown
Dual Degradation-Inspired Deep Unfolding Network for Low-Light Image Enhancement
Dual Degradation-Inspired Deep Unfolding Network for Low-Light Image Enhancement
Huake Wang
Xingsong Hou
Xiaoyang Yan
Kaibing Zhang
Xiangyong Cao
Xueming Qian
519
3
0
03 Jan 2025
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language TasksNeural Information Processing Systems (NeurIPS), 2024
Jiannan Wu
Muyan Zhong
Sen Xing
Zeqiang Lai
Zhaoyang Liu
...
Lewei Lu
Tong Lu
Ping Luo
Yu Qiao
Jifeng Dai
MLLMVLMLRM
850
119
0
03 Jan 2025
Boosting Adversarial Transferability with Spatial Adversarial Alignment
Boosting Adversarial Transferability with Spatial Adversarial Alignment
Zhaoyu Chen
Haijing Guo
Kaixun Jiang
Jiyuan Fu
Xinyu Zhou
Jinjie Wei
Hao Tang
Yue Liu
Wenqiang Zhang
AAML
361
1
0
02 Jan 2025
STARFormer: A Novel Spatio-Temporal Aggregation Reorganization Transformer of FMRI for Brain Disorder Diagnosis
STARFormer: A Novel Spatio-Temporal Aggregation Reorganization Transformer of FMRI for Brain Disorder DiagnosisNeural Networks (NN), 2024
Wenhao Dong
Yuchen Ren
Weiming Zeng
Lei Chen
Hongjie Yan
W. Siok
Nizhuan Wang
282
2
0
31 Dec 2024
From Generalist to Specialist: A Survey of Large Language Models for Chemistry
From Generalist to Specialist: A Survey of Large Language Models for ChemistryInternational Conference on Computational Linguistics (COLING), 2024
Yang Han
Ziping Wan
Lu Chen
Kai Yu
Xin Chen
LM&MA
268
6
0
31 Dec 2024
RGBT Tracking via All-layer Multimodal Interactions with Progressive Fusion Mamba
RGBT Tracking via All-layer Multimodal Interactions with Progressive Fusion MambaAAAI Conference on Artificial Intelligence (AAAI), 2024
Andong Lu
Wanyu Wang
Chenglong Li
Jin Tang
Bin Luo
Mamba
300
12
0
31 Dec 2024
RobustBlack: Challenging Black-Box Adversarial Attacks on State-of-the-Art Defenses
RobustBlack: Challenging Black-Box Adversarial Attacks on State-of-the-Art Defenses
Mohamed Djilani
Salah Ghamizi
Maxime Cordy
415
1
0
31 Dec 2024
Unlocking adaptive digital pathology through dynamic feature learning
Unlocking adaptive digital pathology through dynamic feature learning
Jiawen Li
Tian Guan
Qingxin Xia
Yanjie Wang
Xitong Ling
...
Xiu-Wu Bian
Liang Luo
Lingchuan Guo
Chao He
Yonghong He
AI4CE
210
1
0
31 Dec 2024
Two Heads Are Better Than One: Averaging along Fine-Tuning to Improve Targeted Transferability
Two Heads Are Better Than One: Averaging along Fine-Tuning to Improve Targeted TransferabilityIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Hui Zeng
Sanshuai Cui
Biwei Chen
Anjie Peng
AAML
322
0
0
31 Dec 2024
VMamba: Visual State Space Model
VMamba: Visual State Space ModelNeural Information Processing Systems (NeurIPS), 2024
Yue Liu
Yunjie Tian
Yuzhong Zhao
Hongtian Yu
Lingxi Xie
Yaowei Wang
Qixiang Ye
Jianbin Jiao
Yunfan Liu
Mamba
1.1K
1,554
0
31 Dec 2024
A Contrastive Pretrain Model with Prompt Tuning for Multi-center Medication Recommendation
A Contrastive Pretrain Model with Prompt Tuning for Multi-center Medication Recommendation
Qiang Liu
Zhaopeng Qiu
Xiangyu Zhao
X. Wu
Zijian Zhang
Tong Xu
Feng Tian
424
7
0
31 Dec 2024
Combating Label Noise With A General Surrogate Model For Sample Selection
Combating Label Noise With A General Surrogate Model For Sample SelectionInternational Journal of Computer Vision (IJCV), 2023
Chao Liang
Linchao Zhu
Humphrey Shi
Yi Yang
VLMNoLa
285
4
0
31 Dec 2024
MetricDepth: Enhancing Monocular Depth Estimation with Deep Metric Learning
MetricDepth: Enhancing Monocular Depth Estimation with Deep Metric Learning
Chunpu Liu
Guanglei Yang
Wangmeng Zuo
Tianyi Zan
MDE
343
0
0
31 Dec 2024
PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement
PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement
Chengyou Jia
Minnan Luo
Zhuohang Dang
Guangwen Dai
Xiao Chang
Jiangming Wang
DiffM
453
1
0
31 Dec 2024
Open-Set Object Detection By Aligning Known Class Representations
Open-Set Object Detection By Aligning Known Class RepresentationsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Hiran Sarkar
Vishal M. Chudasama
N. Onoe
Pankaj Wasnik
Vineeth N. Balasubramanian
ObjD
208
8
0
31 Dec 2024
SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection
SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection
Yuchen Ren
Xianrui Li
Yunheng Li
Yanzhe Zhang
Yimian Dai
Qibin Hou
Ming-Ming Cheng
Zhiqiang Wang
533
20
0
30 Dec 2024
PTQ4VM: Post-Training Quantization for Visual Mamba
PTQ4VM: Post-Training Quantization for Visual MambaIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Jun-gyu Jin
Changhun Lee
Seonggon Kim
Eunhyeok Park
MQMamba
334
7
0
29 Dec 2024
UniRestorer: Universal Image Restoration via Adaptively Estimating Image Degradation at Proper Granularity
UniRestorer: Universal Image Restoration via Adaptively Estimating Image Degradation at Proper Granularity
Jingbo Lin
Zhilu Zhang
Wenbo Li
Renjing Pei
Hang Xu
Hongzhi Zhang
Wangmeng Zuo
618
6
0
28 Dec 2024
Towards Visual Grounding: A Survey
Towards Visual Grounding: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Linhui Xiao
Xiaoshan Yang
X. Lan
Yaowei Wang
Changsheng Xu
ObjD
985
31
0
28 Dec 2024
Optimizing Local-Global Dependencies for Accurate 3D Human Pose
  Estimation
Optimizing Local-Global Dependencies for Accurate 3D Human Pose Estimation
Guangsheng Xu
Guoyi Zhang
Lejia Ye
Shuwei Gan
Xiaohu Zhang
Xia Yang
180
1
0
27 Dec 2024
RecConv: Efficient Recursive Convolutions for Multi-Frequency Representations
RecConv: Efficient Recursive Convolutions for Multi-Frequency Representations
Mingshu Zhao
Yi Luo
Yong Ouyang
352
0
0
27 Dec 2024
Data-driven tool wear prediction in milling, based on a process-integrated single-sensor approach
Data-driven tool wear prediction in milling, based on a process-integrated single-sensor approach
Eric Hirsch
Christian Friedrich
428
2
0
27 Dec 2024
"I've Heard of You!": Generate Spoken Named Entity Recognition Data for
  Unseen Entities
"I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities
Jiawei Yu
Xiang Geng
Yongqian Li
Mengxin Ren
Wei Tang
...
Zhibin Lan
Hao Fei
Hao Yang
Shujian Huang
Jinsong Su
216
1
0
26 Dec 2024
DRDM: A Disentangled Representations Diffusion Model for Synthesizing
  Realistic Person Images
DRDM: A Disentangled Representations Diffusion Model for Synthesizing Realistic Person Images
Enbo Huang
Yuan Zhang
Faliang Huang
Guangyu Zhang
Wenshu Fan
DiffM
195
0
0
25 Dec 2024
Advancing Deformable Medical Image Registration with Multi-axis
  Cross-covariance Attention
Advancing Deformable Medical Image Registration with Multi-axis Cross-covariance Attention
Mingyuan Meng
M. Fulham
Lei Bi
Jinman Kim
OODViTMedIm
188
0
0
24 Dec 2024
Cross-View Referring Multi-Object Tracking
Cross-View Referring Multi-Object TrackingAAAI Conference on Artificial Intelligence (AAAI), 2024
Sijia Chen
En Yu
Wenbing Tao
VOT
342
8
0
23 Dec 2024
Detail-Preserving Latent Diffusion for Stable Shadow Removal
Detail-Preserving Latent Diffusion for Stable Shadow RemovalComputer Vision and Pattern Recognition (CVPR), 2024
Jiamin Xu
Yuxin Zheng
Zelong Li
Chi-Yin Wang
Renshu Gu
Wenyuan Xu
Gang Xu
DiffM
157
7
0
23 Dec 2024
Personalized Large Vision-Language Models
Personalized Large Vision-Language Models
Chau Pham
Hoang Phan
David Doermann
Yunjie Tian
VLM
325
7
0
23 Dec 2024
SBS Figures: Pre-training Figure QA from Stage-by-Stage Synthesized
  Images
SBS Figures: Pre-training Figure QA from Stage-by-Stage Synthesized Images
Risa Shinoda
Kuniaki Saito
Shohei Tanaka
Tosho Hirasawa
Yoshitaka Ushiku
180
3
0
23 Dec 2024
PointVoxelFormer -- Reviving point cloud networks for 3D medical imaging
PointVoxelFormer -- Reviving point cloud networks for 3D medical imaging
Mattias Paul Heinrich
3DPC
250
1
0
23 Dec 2024
A Conditional Diffusion Model for Electrical Impedance Tomography Image
  Reconstruction
A Conditional Diffusion Model for Electrical Impedance Tomography Image ReconstructionIEEE Transactions on Instrumentation and Measurement (IEEE Trans. Instrum. Meas.), 2024
Shuaikai Shi
Ruiyuan Kang
P. Liatsis
MedImDiffM
294
5
0
22 Dec 2024
ImagineMap: Enhanced HD Map Construction with SD Maps
ImagineMap: Enhanced HD Map Construction with SD Maps
Yishen Ji
Zhiqi Li
Tong Lu
321
1
0
22 Dec 2024
Adaptive Dataset Quantization
Adaptive Dataset QuantizationAAAI Conference on Artificial Intelligence (AAAI), 2024
Muquan Li
Dongyang Zhang
Qiang Dong
Xiurui Xie
Ke Qin
DDMQ
382
3
0
22 Dec 2024
MAGIC++: Efficient and Resilient Modality-Agnostic Semantic Segmentation
  via Hierarchical Modality Selection
MAGIC++: Efficient and Resilient Modality-Agnostic Semantic Segmentation via Hierarchical Modality Selection
Xu Zheng
Yuanhuiyi Lyu
Lutao Jiang
Jiazhou Zhou
Lin Wang
Xuming Hu
332
12
0
22 Dec 2024
V"Mean"ba: Visual State Space Models only need 1 hidden dimension
V"Mean"ba: Visual State Space Models only need 1 hidden dimension
Tien-Yu Chi
Hung-Yueh Chiang
Chi-Chih Chang
N. Huang
Kai-Chiang Wu
254
1
0
21 Dec 2024
ImagePiece: Content-aware Re-tokenization for Efficient Image
  Recognition
ImagePiece: Content-aware Re-tokenization for Efficient Image RecognitionAAAI Conference on Artificial Intelligence (AAAI), 2024
Seungdong Yoa
Seungjun Lee
Hyeseung Cho
Bumsoo Kim
Woohyung Lim
ViT
219
1
0
21 Dec 2024
Sensitive Image Classification by Vision Transformers
Sensitive Image Classification by Vision TransformersIEEE International Conference on Systems, Man and Cybernetics (SMC), 2024
Hanxian He
Campbell Wilson
Thanh Thi Nguyen
Janis Dalins
ViT
323
1
0
21 Dec 2024
Semantic Alignment and Reinforcement for Data-Free Quantization of Vision Transformers
Semantic Alignment and Reinforcement for Data-Free Quantization of Vision Transformers
Mingliang Xu
Yuyao Zhou
Yuxin Zhang
Shen Li
Shen Li
Jiayi Ji
Zhanpeng Zeng
Rongrong Ji
MQ
841
0
0
21 Dec 2024
IV-tuning: Parameter-Efficient Transfer Learning for Infrared-Visible Tasks
IV-tuning: Parameter-Efficient Transfer Learning for Infrared-Visible Tasks
Yaming Zhang
Chenqiang Gao
Fangcen Liu
Junjie Guo
Lan Wang
Xinggan Peng
Deyu Meng
562
0
0
21 Dec 2024
Segmentation of arbitrary features in very high resolution remote
  sensing imagery
Segmentation of arbitrary features in very high resolution remote sensing imagery
Henry Cording
Yves Plancherel
Pablo Brito-Parada
312
1
0
20 Dec 2024
Bag of Tricks for Multimodal AutoML with Image, Text, and Tabular Data
Bag of Tricks for Multimodal AutoML with Image, Text, and Tabular Data
Zhiqiang Tang
Zihan Zhong
Tong He
Gerald Friedland
384
4
0
19 Dec 2024
TRecViT: A Recurrent Video Transformer
TRecViT: A Recurrent Video Transformer
Viorica Patraucean
Xu He
Joseph Heyward
Chuhan Zhang
Mehdi S. M. Sajjadi
...
Ross Goroshin
Yutian Chen
Simon Osindero
João Carreira
Razvan Pascanu
ViT
177
2
0
18 Dec 2024
Evidential Deep Learning for Probabilistic Modelling of Extreme Storm
  Events
Evidential Deep Learning for Probabilistic Modelling of Extreme Storm Events
Ayush Khot
Xihaier Luo
Ai Kagawa
Shinjae Yoo
EDLBDL
332
3
0
18 Dec 2024
InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal
  Large Language Models
InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models
Cong Wei
Yujie Zhong
Haoxian Tan
Yingsen Zeng
Yong Liu
Zheng Zhao
Yujiu Yang
MLLMVLMVOS
293
12
0
18 Dec 2024
Data-Efficient Inference of Neural Fluid Fields via SciML Foundation
  Model
Data-Efficient Inference of Neural Fluid Fields via SciML Foundation Model
Yuqiu Liu
Jingxuan Xu
Mauricio Soroco
Yunchao Wei
Wuyang Chen
AI4CE
419
2
0
18 Dec 2024
Navigating limitations with precision: A fine-grained ensemble approach
  to wrist pathology recognition on a limited x-ray dataset
Navigating limitations with precision: A fine-grained ensemble approach to wrist pathology recognition on a limited x-ray datasetInternational Conference on Information Photonics (ICIP), 2024
Ammar Ahmed
Ali Shariq Imran
M. Ullah
Zenun Kastrati
Sher Muhammad Daudpota
322
3
0
18 Dec 2024
Distilled Pooling Transformer Encoder for Efficient Realistic Image
  Dehazing
Distilled Pooling Transformer Encoder for Efficient Realistic Image Dehazing
Le-Anh Tran
Dong-Chul Park
ViT
238
8
0
18 Dec 2024
Mesoscopic Insights: Orchestrating Multi-scale & Hybrid Architecture for
  Image Manipulation Localization
Mesoscopic Insights: Orchestrating Multi-scale & Hybrid Architecture for Image Manipulation LocalizationAAAI Conference on Artificial Intelligence (AAAI), 2024
Xuekang Zhu
Xiaochen Ma
Lei Su
Zhuohang Jiang
Bo Du
Xiwen Wang
Zeyu Lei
Wentao Feng
Chi-Man Pun
Jizhe Zhou
AI4CE
289
25
0
18 Dec 2024
Robust Tracking via Mamba-based Context-aware Token Learning
Robust Tracking via Mamba-based Context-aware Token LearningAAAI Conference on Artificial Intelligence (AAAI), 2024
Jinxia Xie
Bineng Zhong
Qihua Liang
Ning Li
Zhiyi Mo
Shuxiang Song
Mamba
249
19
0
18 Dec 2024
Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion
Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion
Massimiliano Viola
Kevin Qu
Nando Metzger
Bingxin Ke
Alexander Becker
Konrad Schindler
Anton Obukhov
VLMMDE
773
20
0
18 Dec 2024
Previous
123...383940...169170171
Next
Page 39 of 171
Pageof 171