Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2103.14030
Cited By
v1
v2 (latest)
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
IEEE International Conference on Computer Vision (ICCV), 2021
25 March 2021
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (5 upvotes)
Github (14835★)
Papers citing
"Swin Transformer: Hierarchical Vision Transformer using Shifted Windows"
50 / 8,530 papers shown
Particle Trajectory Representation Learning with Masked Point Modeling
Sam Young
Yeon-jae Jwa
Kazuhiro Terao
3DPC
346
3
0
04 Feb 2025
MATCNN: Infrared and Visible Image Fusion Method Based on Multi-scale CNN with Attention Transformer
IEEE Transactions on Instrumentation and Measurement (IEEE Trans. Instrum. Meas.), 2025
Jingjing Liu
Li Zhang
Xiaoyang Zeng
Wanquan Liu
Jing Zhang
287
11
0
04 Feb 2025
SimBEV: A Synthetic Multi-Task Multi-Sensor Driving Data Generation Tool and Dataset
Goodarz Mehr
A. Eskandarian
650
4
0
04 Feb 2025
A Framework for Double-Blind Federated Adaptation of Foundation Models
Nurbek Tastan
Karthik Nandakumar
FedML
322
0
0
03 Feb 2025
LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model Adaptation
International Conference on Learning Representations (ICLR), 2025
Can Jin
Ying Li
Mingyu Zhao
Shiyu Zhao
Zhenting Wang
Xiaoxiao He
Ligong Han
Tong Che
Dimitris N. Metaxas
VPVLM
VLM
1.1K
8
0
02 Feb 2025
BEEM: Boosting Performance of Early Exit DNNs using Multi-Exit Classifiers as Experts
International Conference on Learning Representations (ICLR), 2025
Divya J. Bajpai
M. Hanawal
426
3
0
02 Feb 2025
Contrastive Forward-Forward: A Training Algorithm of Vision Transformer
Neural Networks (NN), 2025
Hossein Aghagolzadeh
Mehdi Ezoji
ViT
464
2
0
01 Feb 2025
Leveraging Stable Diffusion for Monocular Depth Estimation via Image Semantic Encoding
Towards Autonomous Robotic Systems (TAROS), 2025
Jingming Xia
Guanqun Cao
Guang Ma
Yiben Luo
Qinzhao Li
John Oyekan
MDE
313
0
0
01 Feb 2025
Beyond the Permutation Symmetry of Transformers: The Role of Rotation for Model Fusion
Binchi Zhang
Zaiyi Zheng
Zhengzhang Chen
Wenlin Yao
667
6
0
01 Feb 2025
LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models
Computer Vision and Pattern Recognition (CVPR), 2025
Shenghao Fu
Q. Yang
Qijie Mo
Junkai Yan
Xihan Wei
Jingke Meng
Xiaohua Xie
Wei-Shi Zheng
MLLM
ObjD
VLM
453
33
0
31 Jan 2025
Ground Awareness in Deep Learning for Large Outdoor Point Cloud Segmentation
VISIGRAPP (VISIGRAPP), 2025
Kevin Qiu
Dimitri Bulatov
Dorota Iwaszczuk
3DPC
269
2
0
30 Jan 2025
VICCA: Visual Interpretation and Comprehension of Chest X-ray Anomalies in Generated Report Without Human Feedback
Machine Learning with Applications (MLWA), 2025
Sayeh Gholipour Picha
D. Chanti
A. Caplier
MedIm
338
1
0
29 Jan 2025
B-RIGHT: Benchmark Re-evaluation for Integrity in Generalized Human-Object Interaction Testing
Yoojin Jang
Junsu Kim
H. Kim
Eun-ki Lee
Eun-sol Kim
Seungryul Baek
Jaejun Yoo
187
0
0
28 Jan 2025
State-space models are accurate and efficient neural operators for dynamical systems
Zheyuan Hu
Nazanin Ahmadi Daryakenari
Qianli Shen
Kenji Kawaguchi
George Karniadakis
Mamba
AI4CE
496
26
0
28 Jan 2025
MultiPDENet: PDE-embedded Learning with Multi-time-stepping for Accelerated Flow Simulation
Qi Wang
Yuan Mi
Jian Shu
Yi Zhang
Ruizhi Chengze
Hongsheng Liu
J. Wen
Hao Sun
AI4CE
302
5
0
28 Jan 2025
SpatioTemporal Learning for Human Pose Estimation in Sparsely-Labeled Videos
AAAI Conference on Artificial Intelligence (AAAI), 2025
Yingying Jiao
Zhigang Wang
Sifan Wu
Shaojing Fan
Zhenguang Liu
Zhuoyue Xu
Zheqi Wu
400
4
0
28 Jan 2025
V2X-DGPE: Addressing Domain Gaps and Pose Errors for Robust Collaborative 3D Object Detection
Sichao Wang
Chuang Zhang
Ming Yuan
Qing Xu
Lei He
Jianqiang Wang
358
4
0
28 Jan 2025
Collective Intelligence for 2D Push Manipulations with Mobile Robots
IEEE Robotics and Automation Letters (RA-L), 2022
So Kuroki
T. Matsushima
Jumpei Arima
Hiroki Furuta
Yutaka Matsuo
S. Gu
Yujin Tang
447
5
0
28 Jan 2025
Radiologist-in-the-Loop Self-Training for Generalizable CT Metal Artifact Reduction
IEEE Transactions on Medical Imaging (IEEE TMI), 2025
Chenglong Ma
Zilong Li
Yongqian Li
Jing Han
Junping Zhang
Yi Zhang
Jiannan Liu
Hongming Shan
220
3
0
28 Jan 2025
MedPromptX: Grounded Multimodal Prompting for Chest X-ray Diagnosis
Mai A. Shaaban
Adnan Khan
Mohammad Yaqub
LM&MA
422
5
0
28 Jan 2025
Prion-ViT: Prions-Inspired Vision Transformers for Temperature prediction with Specklegrams
Abhishek Sebastian
Pragna R
Sonaa Rajagopal
Muralikrishnan Mani
310
2
0
28 Jan 2025
Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection
Xiangyu Gao
Yu Dai
Benliu Qiu
Hongliang Li
Heqian Qiu
Hongliang Li
ObjD
VLM
1.0K
0
0
28 Jan 2025
iFormer: Integrating ConvNet and Transformer for Mobile Application
International Conference on Learning Representations (ICLR), 2025
Chuanyang Zheng
ViT
397
4
0
26 Jan 2025
Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data
International Conference on Learning Representations (ICLR), 2025
Jiajie Li
Brian R Quaranto
Chenhui Xu
Ishan Mishra
Ruiyang Qin
Dancheng Liu
Peter C W Kim
Jinjun Xiong
435
2
0
25 Jan 2025
PolaFormer: Polarity-aware Linear Attention for Vision Transformers
International Conference on Learning Representations (ICLR), 2025
Weikang Meng
Yadan Luo
Xin Li
Shihong Deng
Zheng Zhang
1.1K
36
0
25 Jan 2025
Rethinking Encoder-Decoder Flow Through Shared Structures
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Frederik Laboyrie
M. K. Yucel
Albert Saà-Garriga
AI4CE
251
0
0
24 Jan 2025
Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation Models
IEEE International Symposium on Biomedical Imaging (ISBI), 2025
Jakob Krogh Petersen
Valdemar Licht
Mads Nielsen
Asbjørn Munk
VLM
190
0
0
23 Jan 2025
FreEformer: Frequency Enhanced Transformer for Multivariate Time Series Forecasting
International Joint Conference on Artificial Intelligence (IJCAI), 2025
Wenzhen Yue
Yixiao Liu
Xianghua Ying
Bowei Xing
Ruohao Guo
Ji Shi
AI4TS
209
13
0
23 Jan 2025
Unified CNNs and transformers underlying learning mechanism reveals multi-head attention modus vivendi
Ella Koresh
Ronit D. Gross
Yuval Meir
Yarden Tzach
Tal Halevi
Ido Kanter
ViT
383
8
0
22 Jan 2025
Parallel Sequence Modeling via Generalized Spatial Propagation Network
Computer Vision and Pattern Recognition (CVPR), 2025
Hongjun Wang
Wonmin Byeon
Jiarui Xu
Liang Feng
Ka Chun Cheung
Xiaolong Wang
Kai Han
Jan Kautz
Sifei Liu
847
3
0
21 Jan 2025
Vision-Language Models for Automated Chest X-ray Interpretation: Leveraging ViT and GPT-2
Engineering Reports (ER), 2025
Md. Rakibul Islam
Md. Zahid Hossain
Mustofa Ahmed
Most. Sharmin Sultana Samu
LM&MA
MedIm
239
3
0
21 Jan 2025
Towards Accurate Unified Anomaly Segmentation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025
Wenxin Ma
Qingsong Yao
Xiang Zhang
Zhelong Huang
Zihang Jiang
S. Kevin Zhou
367
11
0
21 Jan 2025
A margin-based replacement for cross-entropy loss
Michael W. Spratling
Heiko H. Schütt
319
0
0
21 Jan 2025
Scalable Whole Slide Image Representation Using K-Mean Clustering and Fisher Vector Aggregation
IEEE International Symposium on Biomedical Imaging (ISBI), 2025
R. Gupta
Shounak Das
Ardhendu Sekhar
Amit Sethi
163
0
0
21 Jan 2025
Comparative Analysis of Pre-trained Deep Learning Models and DINOv2 for Cushing's Syndrome Diagnosis in Facial Analysis
Annual International Computer Software and Applications Conference (COMPSAC), 2025
Hongjun Liu
Changwei Song
Jiaqi Qiang
Jianqiang Li
Hui Pan
Lin Lu
Xiao Long
Qing Zhao
Jiuzuo Huang
Shi Chen
MedIm
84
1
0
21 Jan 2025
TFLOP: Table Structure Recognition Framework with Layout Pointer Mechanism
International Joint Conference on Artificial Intelligence (IJCAI), 2024
Minsoo Khang
Teakgyu Hong
LMTD
314
6
0
21 Jan 2025
UAV-Assisted Real-Time Disaster Detection Using Optimized Transformer Model
Branislava Jankovic
Sabina Jangirova
Waseem Ullah
Latif U. Khan
Mohsen Guizani
391
5
0
21 Jan 2025
A Survey on Memory-Efficient Transformer-Based Model Training in AI for Science
Kaiyuan Tian
Linbo Qiao
Baihui Liu
Gongqingjian Jiang
Shanshan Li
Dongsheng Li
383
0
0
21 Jan 2025
A generalizable 3D framework and model for self-supervised learning in medical imaging
Tony Xu
Sepehr Hosseini
Chris Anderson
Anthony Rinaldi
Rahul G. Krishnan
Anne L. Martel
Maged Goubran
MedIm
343
11
0
20 Jan 2025
Subjective and Objective Quality Assessment of Non-Uniformly Distorted Omnidirectional Images
IEEE transactions on multimedia (TMM), 2025
Jiebin Yan
Jiale Rao
Xuelin Liu
Yuming Fang
Yifan Zuo
Weide Liu
181
9
0
20 Jan 2025
ACE: Anatomically Consistent Embeddings in Composition and Decomposition
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025
Ziyu Zhou
Haozhe Luo
M. Taher
Jiaxuan Pang
Xiaowei Ding
Michael B. Gotway
Jianming Liang
MedIm
358
0
0
20 Jan 2025
3rd Workshop on Maritime Computer Vision (MaCVi) 2025: Challenge Results
Benjamin Kiefer
Lojze Žust
Jon Muhovič
Matej Kristan
J. Pers
...
Ashraf Saleem
Ching-Heng Cheng
Yu-Fan Lin
Tzu-Yu Lin
Chih-Chung Hsu
199
7
0
20 Jan 2025
Elucidating the Design Space of Dataset Condensation
Neural Information Processing Systems (NeurIPS), 2024
Shitong Shao
Zikai Zhou
Huanran Chen
Zhiqiang Shen
DD
755
24
0
20 Jan 2025
Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025
Michael Schwingshackl
Fabio Francisco Oberweger
Markus Murschitz
266
1
0
20 Jan 2025
MRI2Speech: Speech Synthesis from Articulatory Movements Recorded by Real-time MRI
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
N. Shah
Ayan Kashyap
Shirish S. Karande
Vineet Gandhi
240
1
0
20 Jan 2025
A Comprehensive Survey of Foundation Models in Medicine
IEEE Reviews in Biomedical Engineering (RBME), 2024
Wasif Khan
Seowung Leem
Kyle B. See
Joshua K. Wong
Shaoting Zhang
R. Fang
AI4CE
LM&MA
VLM
780
75
0
17 Jan 2025
FutureDepth: Learning to Predict the Future Improves Video Depth Estimation
European Conference on Computer Vision (ECCV), 2024
R. Yasarla
Manish Kumar Singh
Hong Cai
Yunxiao Shi
Jisoo Jeong
Yinhao Zhu
Shizhong Han
Risheek Garrepalli
Fatih Porikli
MDE
516
12
0
17 Jan 2025
Geometric Distortion Guided Transformer for Omnidirectional Image Super-Resolution
Cuixin Yang
Rongkang Dong
Jun Xiao
Cong Zhang
Kin-Man Lam
Fei Zhou
Guoping Qiu
457
8
0
17 Jan 2025
MAMo: Leveraging Memory and Attention for Monocular Video Depth Estimation
IEEE International Conference on Computer Vision (ICCV), 2023
R. Yasarla
H. Cai
Jisoo Jeong
Y. Shi
Risheek Garrepalli
Fatih Porikli
MDE
601
27
0
17 Jan 2025
Unified Face Matching and Physical-Digital Spoofing Attack Detection
Arun Kunwar
Ajita Rattani
CVBM
AAML
289
0
0
17 Jan 2025
Previous
1
2
3
...
36
37
38
...
169
170
171
Next
Page 37 of 171
Page
of 171
Go