Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2111.09883
Cited By
v1
v2 (latest)
Swin Transformer V2: Scaling Up Capacity and Resolution
18 November 2021
Ze Liu
Han Hu
Yutong Lin
Zhuliang Yao
Zhenda Xie
Yixuan Wei
Jia Ning
Yue Cao
Zheng Zhang
Li Dong
Furu Wei
B. Guo
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Github (14834★)
Papers citing
"Swin Transformer V2: Scaling Up Capacity and Resolution"
50 / 931 papers shown
Title
MamT
4
^4
4
: Multi-view Attention Networks for Mammography Cancer Classification
Annual International Computer Software and Applications Conference (COMPSAC), 2024
Alisher Ibragimov
Sofya Senotrusova
Arsenii Litvinov
E. Ushakov
E. Karpulevich
Yury Markin
137
1
0
03 Nov 2024
IO Transformer: Evaluating SwinV2-Based Reward Models for Computer Vision
Maxwell Meyer
Jack Spruyt
ViT
110
0
0
31 Oct 2024
DiffPAD: Denoising Diffusion-based Adversarial Patch Decontamination
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Jia Fu
Xiao Zhang
Sepideh Pashami
Fatemeh Rahimian
Anders Holst
DiffM
AAML
281
1
0
31 Oct 2024
Context-Aware Token Selection and Packing for Enhanced Vision Transformer
Tianyi Zhang
B. Li
Jae-sun Seo
Yu Cao
175
1
0
31 Oct 2024
Multi-Level Feature Distillation of Joint Teachers Trained on Distinct Image Datasets
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Adrian Iordache
B. Alexe
Radu Tudor Ionescu
299
2
0
29 Oct 2024
SAM-Swin: SAM-Driven Dual-Swin Transformers with Adaptive Lesion Enhancement for Laryngo-Pharyngeal Tumor Detection
Jia Wei
Yun Li
Xiaomao Fan
Wenjun Ma
Meiyu Qiu
Hongyu Chen
Wenbin Lei
139
0
0
29 Oct 2024
Enhancing Community Vision Screening -- AI Driven Retinal Photography for Early Disease Detection and Patient Trust
Xiaofeng Lei
Yih-Chung Tham
Jocelyn Hui Lin Goh
Yangqin Feng
Yang Bai
Z. Soh
Rick Siow Mong Goh
Xinxing Xu
Yong Liu
Ching-Yu Cheng
87
0
0
27 Oct 2024
PESFormer: Boosting Macro- and Micro-expression Spotting with Direct Timestamp Encoding
Wang-Wang Yu
Kai-Fu Yang
Xiangrui Hu
Jingwen Jiang
Hong-Mei Yan
Yong-Jie Li
180
0
0
24 Oct 2024
FIPER: Factorized Features for Robust Image Super-Resolution and Compression
Yang-Che Sun
Cheng Yu Yeo
Ernie Chu
Jun-Cheng Chen
Yu-Lun Liu
SupR
567
0
0
23 Oct 2024
LoRA-C: Parameter-Efficient Fine-Tuning of Robust CNN for IoT Devices
Chuntao Ding
Xu Cao
Jianhang Xie
Linlin Fan
Shangguang Wang
Zhichao Lu
295
11
0
22 Oct 2024
Test-time Adversarial Defense with Opposite Adversarial Path and High Attack Time Cost
Cheng-Han Yeh
Kuanchun Yu
Chun-Shien Lu
DiffM
AAML
524
1
0
22 Oct 2024
Are Large-scale Soft Labels Necessary for Large-scale Dataset Distillation?
Neural Information Processing Systems (NeurIPS), 2024
Lingao Xiao
Yang He
DD
336
11
0
21 Oct 2024
D-SarcNet: A Dual-stream Deep Learning Framework for Automatic Analysis of Sarcomere Structures in Fluorescently Labeled hiPSC-CMs
IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2024
Huyen Le
Khiet Dang
N. H. Nguyen
Mai Tran
Hieu Pham
42
0
0
19 Oct 2024
Towards Zero-Shot Camera Trap Image Categorization
Jiří Vyskočil
Lukas Picek
VLM
120
3
0
16 Oct 2024
Transformer based super-resolution downscaling for regional reanalysis: Full domain vs tiling approaches
Antonio Pérez
Mario Santa Cruz
Daniel San Martín
José Manuel Gutiérrez
97
3
0
16 Oct 2024
DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention
Asian Conference on Computer Vision (ACCV), 2024
Nguyen Huu Bao Long
Chenyu Zhang
Yuzhi Shi
Tsubasa Hirakawa
Takayoshi Yamashita
Tohgoroh Matsui
H. Fujiyoshi
204
9
0
11 Oct 2024
HorGait: A Hybrid Model for Accurate Gait Recognition in LiDAR Point Cloud Planar Projections
IEEE Access (IEEE Access), 2024
Jiaxing Hao
Yanxi Wang
Zhigang Chang
Hongmin Gao
Zihao Cheng
Chen Wu
Xin Zhao
Peiye Fang
Rachmat Muwardi
ViT
251
0
0
11 Oct 2024
Hespi: A pipeline for automatically detecting information from hebarium specimen sheets
Robert Turnbull
Emily Fitzgerald
Karen Thompson
Joanne L. Birch
113
3
0
11 Oct 2024
When Graph meets Multimodal: Benchmarking and Meditating on Multimodal Attributed Graphs Learning
Hao Yan
Xuefei Liu
Zhigang Yu
Jun Yin
Ruochen Liu
Peiyan Zhang
Weihao Han
Mingzheng Li
Zhengxin Zeng
182
0
0
11 Oct 2024
IceDiff: High Resolution and High-Quality Sea Ice Forecasting with Generative Diffusion Prior
Jingyi Xu
Siwei Tu
Weidong Yang
Shuhao Li
Keyi Liu
Yeqi Luo
Lipeng Ma
Ben Fei
Junlin Wu
DiffM
AI4Cl
162
2
0
10 Oct 2024
Iterative Optimization Annotation Pipeline and ALSS-YOLO-Seg for Efficient Banana Plantation Segmentation in UAV Imagery
Frontiers in Plant Science (Front. Plant Sci.), 2024
Ang He
Ximei Wu
Xing Xu
Jing Chen
Xiaobin Guo
Sheng Xu
159
5
0
09 Oct 2024
GLRT-Based Metric Learning for Remote Sensing Object Retrieval
Linping Zhang
Yu Liu
Xueqian Wang
Gang Li
You He
198
0
0
08 Oct 2024
Guided Self-attention: Find the Generalized Necessarily Distinct Vectors for Grain Size Grading
Fang Gao
XueTao Li
Jiabao Wang
Shengheng Ma
Jun Yu
112
0
0
08 Oct 2024
Rank Matters: Understanding and Defending Model Inversion Attacks via Low-Rank Feature Filtering
Hongyao Yu
Yixiang Qiu
Hao Fang
Tianqu Zhuang
Sijin Yu
Sijin Yu
Shu-Tao Xia
Ke Xu
K. Xu
259
4
0
08 Oct 2024
MetaDD: Boosting Dataset Distillation with Neural Network Architecture-Invariant Generalization
Yunlong Zhao
Xiaoheng Deng
Xiu Su
Hongyan Xu
Xiuxing Li
Yijing Liu
Shan You
FedML
DD
240
1
0
07 Oct 2024
Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time
European Conference on Computer Vision (ECCV), 2024
Chiao-An Yang
Ziwei Liu
Raymond A. Yeh
144
1
0
01 Oct 2024
CBAM-SwinT-BL: Small Rail Surface Defect Detection Method Based on Swin Transformer with Block Level CBAM Enhancement
IEEE Access (IEEE Access), 2024
Jiayi Zhao
Alison Wun-lam Yeung
Ali Muhammad
Songjiang Lai
Vincent To-Yee NG
209
10
0
30 Sep 2024
Universal Medical Image Representation Learning with Compositional Decoders
Kaini Wang
Ling Yang
Siping Zhou
Guangquan Zhou
Wentao Zhang
Bin Cui
Shuo Li
SSL
MedIm
270
1
0
30 Sep 2024
All-in-One Image Coding for Joint Human-Machine Vision with Multi-Path Aggregation
Neural Information Processing Systems (NeurIPS), 2024
Xu Zhang
Peiyao Guo
Ming Lu
Zhan Ma
237
8
0
29 Sep 2024
Exploring Token Pruning in Vision State Space Models
Neural Information Processing Systems (NeurIPS), 2024
Zheng Zhan
Zhenglun Kong
Yifan Gong
Yushu Wu
Zichong Meng
...
Xuan Shen
Stratis Ioannidis
Wei Niu
Pu Zhao
Yanzhi Wang
367
22
0
27 Sep 2024
Cottention: Linear Transformers With Cosine Attention
Gabriel Mongaras
Trevor Dohm
Eric C. Larson
164
2
0
27 Sep 2024
HR-Extreme: A High-Resolution Dataset for Extreme Weather Forecasting
International Conference on Learning Representations (ICLR), 2024
Nian Ran
Peng Xiao
Yue Wang
Wesley Shi
Jianxin Lin
Qi Meng
Richard Allmendinger
AI4Cl
382
4
0
27 Sep 2024
MALPOLON: A Framework for Deep Species Distribution Modeling
Théo Larcher
Lukás Picek
Benjamin Deneu
Titouan Lorieul
Maximilien Servajean
Alexis Joly
GP
129
1
0
26 Sep 2024
HydraViT: Stacking Heads for a Scalable ViT
Neural Information Processing Systems (NeurIPS), 2024
Janek Haberer
A. Hojjat
Olaf Landsiedel
191
6
0
26 Sep 2024
TSCLIP: Robust CLIP Fine-Tuning for Worldwide Cross-Regional Traffic Sign Recognition
IEEE International Conference on Robotics and Automation (ICRA), 2024
Guoyang Zhao
Fulong Ma
Weiqing Qi
Chenguang Zhang
Yuxuan Liu
Ming Liu
Jun Ma
VLM
CLIP
911
5
0
23 Sep 2024
Fake It till You Make It: Curricular Dynamic Forgery Augmentations towards General Deepfake Detection
European Conference on Computer Vision (ECCV), 2024
Yuzhen Lin
Wentang Song
Bin Li
Yuezun Li
Jiangqun Ni
Han Chen
Qiushi Li
176
33
0
22 Sep 2024
Multi-OCT-SelfNet: Integrating Self-Supervised Learning with Multi-Source Data Fusion for Enhanced Multi-Class Retinal Disease Classification
Fatema Jannat
Sina Gholami
Jennifer I. Lim
Theodore Leng
Minhaj Nur Alam
Hamed Tabkhi
121
4
0
17 Sep 2024
InfoDisent: Explainability of Image Classification Models by Information Disentanglement
Łukasz Struski
Dawid Rymarczyk
Jacek Tabor
344
2
0
16 Sep 2024
GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion
International Conference on 3D Vision (3DV), 2024
Vitor Campagnolo Guizilini
P. Tokmakov
Achal Dave
Rares Andrei Ambrus
DiffM
268
5
0
15 Sep 2024
LACOSTE: Exploiting stereo and temporal contexts for surgical instrument segmentation
Qiyuan Wang
Shang Zhao
Zikang Xu
S Kevin Zhou
384
0
0
14 Sep 2024
PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage
Asian Conference on Computer Vision (ACCV), 2024
Denis Zavadski
Damjan Kalšan
Carsten Rother
DiffM
MDE
281
8
0
13 Sep 2024
Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization
Ling Xing
Hongyu Qu
Rui Yan
Xiangbo Shu
Jinhui Tang
518
8
0
12 Sep 2024
Inf-MLLM: Efficient Streaming Inference of Multimodal Large Language Models on a Single GPU
Zhenyu Ning
Jieru Zhao
Qihao Jin
Wenchao Ding
Minyi Guo
61
17
0
11 Sep 2024
EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation
International Conference on Machine Learning and Applications (ICMLA), 2024
Nischal Khanal
Shivanand Venkanna Sheshappanavar
MDE
324
0
0
10 Sep 2024
Renormalized Connection for Scale-preferred Object Detection in Satellite Imagery
IEEE Transactions on Geoscience and Remote Sensing (TGRS), 2024
Fan Zhang
Lingling Li
Licheng Jiao
Xu Liu
Fang Liu
Shuyuan Yang
B. Hou
ObjD
245
1
0
09 Sep 2024
UNIT: Unifying Image and Text Recognition in One Vision Encoder
Neural Information Processing Systems (NeurIPS), 2024
Yi Zhu
Yanpeng Zhou
Chunwei Wang
Yang Cao
Jianhua Han
Lu Hou
Hang Xu
ViT
VLM
269
9
0
06 Sep 2024
SDformerFlow: Spatiotemporal swin spikeformer for event-based optical flow estimation
Yi Tian
Juan Andrade-Cetto
179
2
0
06 Sep 2024
iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation
Hayeon Jo
Hyesong Choi
Minhee Cho
Dongbo Min
311
3
0
04 Sep 2024
Cross-domain Multi-step Thinking: Zero-shot Fine-grained Traffic Sign Recognition in the Wild
Knowledge-Based Systems (KBS), 2024
Yaozong Gan
Guang Li
Ren Togo
Keisuke Maeda
Takahiro Ogawa
Miki Haseyama
287
1
0
03 Sep 2024
SOOD-ImageNet: a Large-Scale Dataset for Semantic Out-Of-Distribution Image Classification and Semantic Segmentation
Alberto Bacchin
Davide Allegro
Stefano Ghidoni
Emanuele Menegatti
190
1
0
02 Sep 2024
Previous
1
2
3
4
5
6
...
17
18
19
Next