Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2103.14030
Cited By
v1
v2 (latest)
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
IEEE International Conference on Computer Vision (ICCV), 2021
25 March 2021
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (5 upvotes)
Github (14835★)
Papers citing
"Swin Transformer: Hierarchical Vision Transformer using Shifted Windows"
50 / 8,509 papers shown
Neural Collapse Inspired Knowledge Distillation
AAAI Conference on Artificial Intelligence (AAAI), 2024
Shuoxi Zhang
Zijian Song
Kun He
431
1
0
16 Dec 2024
Oriented Tiny Object Detection: A Dataset, Benchmark, and Dynamic Unbiased Learning
Chang Xu
Ruixiang Zhang
Wen Yang
Haoran Zhu
Fang Xu
Jian Ding
Gui-Song Xia
ObjD
342
2
0
16 Dec 2024
HResFormer: Hybrid Residual Transformer for Volumetric Medical Image Segmentation
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024
Sucheng Ren
Xiaomeng Li
MedIm
273
11
0
16 Dec 2024
Multilabel Classification for Lung Disease Detection: Integrating Deep Learning and Natural Language Processing
Maria Efimovich
Jayden Lim
Vedant Mehta
Ethan Poon
233
2
0
16 Dec 2024
SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation
Computer Vision and Pattern Recognition (CVPR), 2024
Yunxiang Fu
Meng Lou
Yizhou Yu
667
21
0
16 Dec 2024
Online Writer Retrieval with Chinese Handwritten Phrases: A Synergistic Temporal-Frequency Representation Learning Approach
IEEE Transactions on Information Forensics and Security (IEEE TIFS), 2024
Peirong Zhang
Lianwen Jin
201
2
0
16 Dec 2024
Towards Context-aware Convolutional Network for Image Restoration
Knowledge-Based Systems (KBS), 2024
Fangwei Hao
Ji Du
Weiyun Liang
Jing Xu
Xiaoxuan Xu
SupR
327
2
0
15 Dec 2024
Benchmarking and Learning Multi-Dimensional Quality Evaluator for Text-to-3D Generation
Yujie Zhang
Bingyang Cui
Qi Yang
Zhu Li
Yiling Xu
378
5
0
15 Dec 2024
Mask Enhanced Deeply Supervised Prostate Cancer Detection on B-mode Micro-Ultrasound
Lichun Zhang
Steve Zhou
Moon Hyung Choi
Jeong Hoon Lee
Shengtian Sang
...
Wei Shao
Ahmed N. El Kaffas
Richard E. Fan
G. Sonn
M. Rusu
MedIm
267
0
0
14 Dec 2024
RapidNet: Multi-Level Dilated Convolution Based Mobile Backbone
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Mustafa Munir
Md Mostafijur Rahman
R. Marculescu
MedIm
ViT
341
6
0
14 Dec 2024
Video Representation Learning with Joint-Embedding Predictive Architectures
Katrina Drozdov
Ravid Shwartz-Ziv
Yann LeCun
AI4TS
365
7
0
14 Dec 2024
Boosting ViT-based MRI Reconstruction from the Perspectives of Frequency Modulation, Spatial Purification, and Scale Diversification
AAAI Conference on Artificial Intelligence (AAAI), 2024
Yucong Meng
Zhiwei Yang
Yonghong Shi
Zhijian Song
275
6
0
14 Dec 2024
Rebalanced Vision-Language Retrieval Considering Structure-Aware Distillation
IEEE Transactions on Image Processing (TIP), 2024
Yang Yang
Wenjuan Xi
Luping Zhou
Jinhui Tang
298
7
0
14 Dec 2024
Patch-level Sounding Object Tracking for Audio-Visual Question Answering
AAAI Conference on Artificial Intelligence (AAAI), 2024
Zhangbin Li
Jinxing Zhou
Jing Zhang
Shengeng Tang
Kun Li
Dan Guo
313
13
0
14 Dec 2024
Memory Efficient Matting with Adaptive Token Routing
AAAI Conference on Artificial Intelligence (AAAI), 2024
Yiheng Lin
Yihan Hu
Chenyi Zhang
Ting Liu
Xiaochao Qu
Luoqi Liu
Yao Zhao
Y. X. Wei
443
0
0
14 Dec 2024
One Pixel is All I Need
Deng Siqin
Zhou Xiaoyi
ViT
1.0K
0
0
14 Dec 2024
DSRC: Learning Density-insensitive and Semantic-aware Collaborative Representation against Corruptions
AAAI Conference on Artificial Intelligence (AAAI), 2024
Jingyu Zhang
Yilei Wang
Lang Qian
Yang Liu
Zengwen Li
Sudong Jiang
Maolin Liu
Liang Song
442
3
0
14 Dec 2024
Video Diffusion Transformers are In-Context Learners
Zhengcong Fei
Di Qiu
Changqian Yu
Debang Li
Mingyuan Fan
VGen
DiffM
889
7
0
14 Dec 2024
Rethinking Detecting Salient and Camouflaged Objects in Unconstrained Scenes
Zhangjun Zhou
Yiping Li
Chunlin Zhong
Jianuo Huang
Jialun Pei
He Tang
He Tang
460
0
0
14 Dec 2024
A Decade of Deep Learning: A Survey on The Magnificent Seven
Dilshod Azizov
Muhammad Arslan Manzoor
Velibor Bojkovic
Yingxu Wang
Liang Luo
...
Liang Li
Houcheng Su
Yu Zhong
Wei Liu
Shangsong Liang
OOD
AI4TS
MedIm
300
0
0
13 Dec 2024
Coherent 3D Scene Diffusion From a Single RGB Image
Neural Information Processing Systems (NeurIPS), 2024
Manuel Dahnert
Angela Dai
Norman Muller
Matthias Nießner
259
3
0
13 Dec 2024
Deep Learning for Spectrum Prediction in Cognitive Radio Networks: State-of-the-Art, New Opportunities, and Challenges
Guangliang Pan
David K. Y. Yau
Bo Zhou
Qihui Wu
133
7
0
13 Dec 2024
LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity
Computer Vision and Pattern Recognition (CVPR), 2024
Hongjie Wang
Chih-Yao Ma
Yen-Cheng Liu
Ji Hou
Tao Xu
...
Peizhao Zhang
Tingbo Hou
Peter Vajda
N. Jha
Xiaoliang Dai
LMTD
VGen
VLM
DiffM
426
27
0
13 Dec 2024
UADet: A Remarkably Simple Yet Effective Uncertainty-Aware Open-Set Object Detection Framework
Silin Cheng
Yuanpei Liu
Kai Han
EDL
356
0
0
12 Dec 2024
On the effectiveness of Rotation-Equivariance in U-Net: A Benchmark for Image Segmentation
Robin Ghyselinck
Valentin Delchevalerie
Bruno Dumas
Benoit Frénay
347
0
0
12 Dec 2024
Cross-View Completion Models are Zero-shot Correspondence Estimators
Computer Vision and Pattern Recognition (CVPR), 2024
Honggyu An
J. Kim
Seonghoon Park
Jaewoo Jung
Jisang Han
Sunghwan Hong
Seungryong Kim
3DV
351
18
0
12 Dec 2024
Reversing the Damage: A QP-Aware Transformer-Diffusion Approach for 8K Video Restoration under Codec Compression
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Ali Mollaahmadi Dehaghi
Reza Razavi
Mohammad Moshirpour
328
3
0
12 Dec 2024
Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation
Neural Information Processing Systems (NeurIPS), 2024
Jiaming Lv
Haoyuan Yang
P. Li
395
16
0
11 Dec 2024
PETALface: Parameter Efficient Transfer Learning for Low-resolution Face Recognition
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Kartik Narayan
Nithin Gopalakrishnan Nair
Jennifer Xu
Rama Chellappa
Vishal M. Patel
CVBM
CLL
240
5
0
10 Dec 2024
PVP: Polar Representation Boost for 3D Semantic Occupancy Prediction
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Yujing Xue
Jiaxiang Liu
Jiawei Du
Qiufeng Wang
MDE
421
0
0
10 Dec 2024
Repetitive Action Counting with Hybrid Temporal Relation Modeling
IEEE transactions on multimedia (IEEE TMM), 2024
Kun Li
Xinge Peng
Dan Guo
Xun Yang
Meng Wang
246
29
0
10 Dec 2024
Toward Non-Invasive Diagnosis of Bankart Lesions with Deep Learning
Sahil Sethi
Sai Reddy
Mansi Sakarvadia
Jordan Serotte
Darlington Nwaudo
Nicholas Maassen
Lewis Shi
214
5
0
09 Dec 2024
Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark Dataset
Computer Vision and Pattern Recognition (CVPR), 2024
Xinyu Wang
Yu Jin
Wentao Wu
Wei Zhang
Lin Zhu
Bo Jiang
Yonghong Tian
270
18
0
09 Dec 2024
Bridging the Divide: Reconsidering Softmax and Linear Attention
Neural Information Processing Systems (NeurIPS), 2024
Dongchen Han
Yifan Pu
Zhuofan Xia
Yizeng Han
Xuran Pan
Xiu Li
Jiwen Lu
Shiji Song
Gao Huang
289
35
0
09 Dec 2024
A Lightweight U-like Network Utilizing Neural Memory Ordinary Differential Equations for Slimming the Decoder
International Joint Conference on Artificial Intelligence (IJCAI), 2024
Quansong He
Xiaojun Yao
Jun Wu
Zhang Yi
Tao He
241
6
0
09 Dec 2024
MSCrackMamba: Leveraging Vision Mamba for Crack Detection in Fused Multispectral Imagery
Qinfeng Zhu
Yuan-Sheng Fang
Lei Fan
Mamba
180
1
0
09 Dec 2024
A4-Unet: Deformable Multi-Scale Attention Network for Brain Tumor Segmentation
IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2024
Ruoxin Wang
Tianyi Tang
Haiming Du
Yuxuan Cheng
Yu Wang
Lingjie Yang
Xiaohui Duan
Yunfang Yu
Yu Zhou
Donglong Chen
315
1
0
08 Dec 2024
Epistemic Uncertainty for Generated Image Detection
Jun Nie
Yonggang Zhang
Tongliang Liu
Yiu-ming Cheung
Bo Han
Xinmei Tian
UQCV
309
1
0
08 Dec 2024
Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
K. Hashmi
Talha Uddin Sheikh
Didier Stricker
Muhammad Zeshan Afzal
287
0
0
06 Dec 2024
ARTeFACT: Benchmarking Segmentation Models on Diverse Analogue Media Damage
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
D. Ivanova
Marco Aversa
Paul Henderson
John Williamson
265
0
0
05 Dec 2024
Exploring Real&Synthetic Dataset and Linear Attention in Image Restoration
Yuzhen Du
Teng Hu
Jing Zhang
Ran Yi Chengming Xu
Xiaobin Hu
Kai WU
Donghao Luo
Yun Wang
Lizhuang Ma
372
1
0
05 Dec 2024
Frequency-Adaptive Low-Latency Object Detection Using Events and Frames
Haitian Zhang
Xiangyuan Wang
Chang Xu
Xinya Wang
Fang Xu
Huai Yu
Lei Yu
Wen Yang
ObjD
515
0
0
05 Dec 2024
HIIF: Hierarchical Encoding based Implicit Image Function for Continuous Super-resolution
Computer Vision and Pattern Recognition (CVPR), 2024
Yuxuan Jiang
Ho Man Kwan
Tianhao Peng
Ge Gao
Fan Zhang
Xiaoqing Zhu
Joel Sole
David Bull
SupR
291
9
0
04 Dec 2024
DiffuPT: Class Imbalance Mitigation for Glaucoma Detection via Diffusion Based Generation and Model Pretraining
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Youssof Nawar
Nouran Soliman
Moustafa Wassel
Mohamed ElHabebe
Noha Adly
Marwan Torki
Ahmed Elmassry
Islam Ahmed
MedIm
293
0
0
04 Dec 2024
Gesture Classification in Artworks Using Contextual Image Features
Azhar Hussian
Mathias Zinnen
Thi My Hang Tran
Andreas Maier
Vincent Christlein
304
1
0
04 Dec 2024
End-to-end Triple-domain PET Enhancement: A Hybrid Denoising-and-reconstruction Framework for Reconstructing Standard-dose PET Images from Low-dose PET Sinograms
C. Jiang
Mianxin Liu
Kaicong Sun
Dinggang Shen
MedIm
327
1
0
04 Dec 2024
Optimizing Dense Visual Predictions Through Multi-Task Coherence and Prioritization
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Maxime Fontana
Michael W. Spratling
Miaojing Shi
MoE
VLM
353
0
0
04 Dec 2024
CLAP: Unsupervised 3D Representation Learning for Fusion 3D Perception via Curvature Sampling and Prototype Learning
Runjian Chen
Han Zhang
Avinash Ravichandran
Wenqi Shao
Alex Wong
Ping Luo
Ping Luo
3DPC
455
1
0
04 Dec 2024
Mixture of Physical Priors Adapter for Parameter-Efficient Fine-Tuning
Xiping Hu
C. J. Li
QiXiang Ye
Tong Zhang
MoE
258
1
0
03 Dec 2024
ProbPose: A Probabilistic Approach to 2D Human Pose Estimation
Computer Vision and Pattern Recognition (CVPR), 2024
Miroslav Purkrábek
Jiri Matas
3DH
338
7
0
03 Dec 2024
Previous
1
2
3
...
39
40
41
...
169
170
171
Next