Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.14030
Cited By
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
25 March 2021
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng-Wei Zhang
Stephen Lin
B. Guo
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Swin Transformer: Hierarchical Vision Transformer using Shifted Windows"
50 / 1,659 papers shown
Title
Decoupled Multimodal Prototypes for Visual Recognition with Missing Modalities
Jueqing Lu
Yuanyuan Qi
Xiaohao Yang
Shujie Zhou
Lan Du
14
0
0
13 May 2025
H
3
^{\mathbf{3}}
3
DP: Triply-Hierarchical Diffusion Policy for Visuomotor Learning
Yiyang Lu
Yufeng Tian
Zhecheng Yuan
X. Wang
Pu Hua
Zhengrong Xue
Huazhe Xu
16
0
0
12 May 2025
Weakly Supervised Temporal Sentence Grounding via Positive Sample Mining
Lu Dong
H. Zhang
Hongjie Zhang
Y. Huang
Z. Ling
Yu Qiao
Limin Wang
Y. Wang
AI4TS
21
0
0
10 May 2025
The Application of Deep Learning for Lymph Node Segmentation: A Systematic Review
Jingguo Qu
Xinyang Han
Man-Lik Chui
Yao Pu
Simon Takadiyi Gunda
...
Jing Qin
Ann Dorothy King
Winnie Chiu-Wing Chu
J. Cai
Michael Tin-Cheung Ying
21
0
0
09 May 2025
A review of advancements in low-light image enhancement using deep learning
Fangxue Liu
Lei Fan
37
0
0
09 May 2025
Brain Hematoma Marker Recognition Using Multitask Learning: SwinTransformer and Swin-Unet
Kodai Hirata
Tsuyoshi Okita
ViT
36
0
0
09 May 2025
Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation
Kunpeng Qiu
Zhiqiang Gao
Zhiying Zhou
Mingjie Sun
Yongxin Guo
MedIm
29
0
0
09 May 2025
Accurate and Efficient Multivariate Time Series Forecasting via Offline Clustering
Yiming Niu
Jinliang Deng
L. Zhang
Zimu Zhou
Yongxin Tong
AI4TS
21
0
0
09 May 2025
Predicting Diabetic Macular Edema Treatment Responses Using OCT: Dataset and Methods of APTOS Competition
Weiyi Zhang
Peranut Chotcomwongse
Yinwen Li
Pusheng Xu
Ruijie Yao
...
Shujun Wang
Yalin Zheng
M. He
Danli Shi
Paisan Ruamviboonsuk
13
0
0
09 May 2025
SSH-Net: A Self-Supervised and Hybrid Network for Noisy Image Watermark Removal
Wenyang Liu
Jianjun Gao
Kim-Hui Yap
36
0
0
08 May 2025
Cross-Branch Orthogonality for Improved Generalization in Face Deepfake Detection
Tharindu Fernando
Clinton Fookes
S. Sridharan
Simon Denman
39
0
0
08 May 2025
Adaptive Contextual Embedding for Robust Far-View Borehole Detection
Xuesong Liu
Tianyu Hao
Emmett J. Ientilucci
34
0
0
08 May 2025
The Moon's Many Faces: A Single Unified Transformer for Multimodal Lunar Reconstruction
Tom Sander
Moritz Tenthoff
Kay Wohlfarth
Christian Wöhler
19
0
0
08 May 2025
A Simple Detector with Frame Dynamics is a Strong Tracker
Chenxu Peng
C. Wang
Minrui Zou
Danyang Li
Z. Yang
Yimian Dai
Ming-Ming Cheng
Xiang Li
40
0
0
08 May 2025
Pro2SAM: Mask Prompt to SAM with Grid Points for Weakly Supervised Object Localization
Xi Yang
Songsong Duan
Nannan Wang
Xinbo Gao
WSOL
71
0
0
08 May 2025
FF-PNet: A Pyramid Network Based on Feature and Field for Brain Image Registration
Ying Zhang
Shuai Guo
Chenxi Sun
Yuchen Zhu
Jinhai Xiang
MedIm
45
0
0
08 May 2025
GaMNet: A Hybrid Network with Gabor Fusion and NMamba for Efficient 3D Glioma Segmentation
Chengwei Ye
H. Zhang
Yufei Lin
Kangsheng Wang
Linuo Xu
Shuyan Liu
MedIm
50
0
0
08 May 2025
T-T: Table Transformer for Tagging-based Aspect Sentiment Triplet Extraction
Kun Peng
Chaodong Tong
Cong Cao
Hao Peng
Q. Li
Guanlin Wu
Lei Jiang
Yanbing Liu
Philip S. Yu
LMTD
43
0
0
08 May 2025
CM1 - A Dataset for Evaluating Few-Shot Information Extraction with Large Vision Language Models
Fabian Wolf
Oliver Tüselmann
Arthur Matei
Lukas Hennies
Christoph Rass
Gernot A. Fink
46
0
0
07 May 2025
Lightweight RGB-D Salient Object Detection from a Speed-Accuracy Tradeoff Perspective
Songsong Duan
Xi Yang
Nannan Wang
Xinbo Gao
53
0
0
07 May 2025
TS-SNN: Temporal Shift Module for Spiking Neural Networks
Kairong Yu
Tianqing Zhang
Qi Xu
Gang Pan
Hongwei Wang
60
0
0
07 May 2025
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer
Young-Hu Park
R.-H. Park
Hyung-Min Park
49
0
0
07 May 2025
Image Restoration via Multi-domain Learning
Xingyu Jiang
Ning Gao
Xiuhui Zhang
Hongkun Dou
Shaowen Fu
Xiaoqing Zhong
H. Li
Yue Deng
ViT
29
0
0
07 May 2025
ORXE: Orchestrating Experts for Dynamically Configurable Efficiency
Qingyuan Wang
Guoxin Wang
B. Cardiff
Deepu John
31
0
0
07 May 2025
Vision Graph Prompting via Semantic Low-Rank Decomposition
Zixiang Ai
Zichen Liu
Jiahuan Zhou
48
0
0
07 May 2025
Tetrahedron-Net for Medical Image Registration
Jinhai Xiang
Shuai Guo
Qianru Han
Dantong Shi
Xinwei He
Xiang Bai
3DV
52
0
0
07 May 2025
ORBIT-2: Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling
Xiao Wang
Jong Youl Choi
Takuya Kurihaya
Isaac Lyngaas
Hong-Jun Yoon
...
Dali Wang
Peter Thornton
Prasanna Balaprakash
M. Ashfaq
Dan Lu
26
0
0
07 May 2025
False Promises in Medical Imaging AI? Assessing Validity of Outperformance Claims
Evangelia Christodoulou
Annika Reinke
Pascaline Andrè
Patrick Godau
P. Kalinowski
...
Amber L. Simpson
A. Kopp-Schneider
Gaël Varoquaux
O. Colliot
Lena Maier-Hein
33
0
0
07 May 2025
Balancing Accuracy, Calibration, and Efficiency in Active Learning with Vision Transformers Under Label Noise
Moseli Motsóehli
Hope Mogale
Kyungim Baek
38
0
0
07 May 2025
Breaking Annotation Barriers: Generalized Video Quality Assessment via Ranking-based Self-Supervision
Linhan Cao
Wei Sun
Kaiwei Zhang
Yicong Peng
Guangtao Zhai
Xiongkuo Min
50
0
0
06 May 2025
Action Spotting and Precise Event Detection in Sports: Datasets, Methods, and Challenges
Hao Xu
Arbind Agrahari Baniya
Sam Well
Mohamed Reda Bouadjenek
Richard Dazeley
S. Aryal
AI4TS
22
0
0
06 May 2025
Towards Efficient Benchmarking of Foundation Models in Remote Sensing: A Capabilities Encoding Approach
Pierre Adorni
M. Pham
Stéphane May
Sébastien Lefèvre
37
0
0
06 May 2025
Image Recognition with Online Lightweight Vision Transformer: A Survey
Zherui Zhang
Rongtao Xu
Jie Zhou
Changwei Wang
Xingtian Pei
...
Jiguang Zhang
Li Guo
Longxiang Gao
W. Xu
Shibiao Xu
ViT
54
0
0
06 May 2025
DCS-ST for Classification of Breast Cancer Histopathology Images with Limited Annotations
Liu Suxing
Byungwon Min
27
0
0
06 May 2025
OccCylindrical: Multi-Modal Fusion with Cylindrical Representation for 3D Semantic Occupancy Prediction
Zhenxing Ming
J. S. Berrio
Mao Shan
Yaoqi Huang
Hongyu Lyu
Nguyen Hoang Khoi Tran
Tzu-Yun Tseng
Stewart Worrall
3DPC
55
0
0
06 May 2025
3D Can Be Explored In 2D: Pseudo-Label Generation for LiDAR Point Clouds Using Sensor-Intensity-Based 2D Semantic Segmentation
Andrew Caunes
T. Chateau
Vincent Frémont
3DPC
24
0
0
06 May 2025
Database-Agnostic Gait Enrollment using SetTransformers
Nicoleta Basoc
Adrian Cosma
Andy Catruna
Emilian Radoi
SLR
24
0
0
05 May 2025
No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves
D. Jiang
Mengmeng Wang
Liuzhuozheng Li
Lei Zhang
Haoyu Wang
Wei Wei
Guang Dai
Yanning Zhang
Jingdong Wang
DiffM
44
0
0
05 May 2025
An Adaptive Data-Resilient Multi-Modal Framework for Hierarchical Multi-Label Book Genre Identification
Utsav Nareti
S. Chattopadhyay
Prolay Mallick
Suraj Kumar
Ayush Vikas Daga
Chandranath Adak
Adarsh Wase
Arjab Roy
12
0
0
05 May 2025
Token Coordinated Prompt Attention is Needed for Visual Prompting
Zichen Liu
Xu Zou
Gang Hua
Jiahuan Zhou
26
0
0
05 May 2025
Benchmarking Feature Upsampling Methods for Vision Foundation Models using Interactive Segmentation
Volodymyr Havrylov
Haiwen Huang
Dan Zhang
Andreas Geiger
46
0
0
04 May 2025
Unaligned RGB Guided Hyperspectral Image Super-Resolution with Spatial-Spectral Concordance
Y. Zhang
Zeqiang Lai
T. Zhang
Ying Fu
Chenghu Zhou
40
1
0
04 May 2025
Always Skip Attention
Yiping Ji
Hemanth Saratchandran
Peyman Moghaddam
Simon Lucey
55
0
0
04 May 2025
Adversarial Robustness of Deep Learning Models for Inland Water Body Segmentation from SAR Images
Siddharth Kothari
Srinivasan Murali
Sankalp Kothari
Ujjwal Verma
Jaya Sreevalsan-Nair
35
0
0
03 May 2025
Accelerating Deep Neural Network Training via Distributed Hybrid Order Optimization
Shunxian Gu
Chaoqun You
Bangbang Ren
Lailong Luo
Junxu Xia
Deke Guo
29
0
0
02 May 2025
CAMELTrack: Context-Aware Multi-cue ExpLoitation for Online Multi-Object Tracking
Vladimir Somers
Baptiste Standaert
Victor Joos
Alexandre Alahi
Christophe De Vleeschouwer
VOT
49
0
0
02 May 2025
When Dynamic Data Selection Meets Data Augmentation
S. M. I. Simon X. Yang
Peng Ye
F. Shen
Dongzhan Zhou
22
0
0
02 May 2025
Pack-PTQ: Advancing Post-training Quantization of Neural Networks by Pack-wise Reconstruction
Changjun Li
Runqing Jiang
Zhuo Song
Pengpeng Yu
Ye Zhang
Yulan Guo
MQ
49
0
0
01 May 2025
Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook
Muyi Bao
Shuchang Lyu
Zhaoyang Xu
Huiyu Zhou
Jinchang Ren
Shiming Xiang
X. Li
Guangliang Cheng
Mamba
72
0
0
01 May 2025
Improving Routing in Sparse Mixture of Experts with Graph of Tokens
Tam Minh Nguyen
Ngoc N. Tran
Khai Nguyen
Richard G. Baraniuk
MoE
59
0
0
01 May 2025
1
2
3
4
...
32
33
34
Next