Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2111.09883
Cited By
v1
v2 (latest)
Swin Transformer V2: Scaling Up Capacity and Resolution
18 November 2021
Ze Liu
Han Hu
Yutong Lin
Zhuliang Yao
Zhenda Xie
Yixuan Wei
Jia Ning
Yue Cao
Zheng Zhang
Li Dong
Furu Wei
B. Guo
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Github (14834★)
Papers citing
"Swin Transformer V2: Scaling Up Capacity and Resolution"
50 / 931 papers shown
Title
Cross-Architectural Positive Pairs improve the effectiveness of Self-Supervised Learning
P. Singh
Jacopo Cirrone
SSL
221
0
0
27 Jan 2023
Out of Distribution Performance of State of Art Vision Model
Salman Rahman
W. Lee
351
4
0
25 Jan 2023
Connecting metrics for shape-texture knowledge in computer vision
Tiago Gaspar Oliveira
Tiago Marques
Arlindo L. Oliveira
81
0
0
25 Jan 2023
ClimaX: A foundation model for weather and climate
International Conference on Machine Learning (ICML), 2023
Tung Nguyen
Johannes Brandstetter
Ashish Kapoor
Jayesh K. Gupta
Aditya Grover
AI4Cl
AI4CE
537
360
0
24 Jan 2023
Zorro: the masked multimodal transformer
Adrià Recasens
Jason Lin
João Carreira
Drew Jaegle
Luyu Wang
...
Pauline Luc
Antoine Miech
Lucas Smaira
Ross Hemsley
Andrew Zisserman
207
23
0
23 Jan 2023
Autonomous Rendezvous with Non-cooperative Target Objects with Swarm Chasers and Observers
Trupti Mahendrakar
Steven Holmberg
A. Ekblad
Emma Conti
Ryan T. White
M. Wilde
Isaac Silver
65
8
0
22 Jan 2023
SuperScaler: Supporting Flexible DNN Parallelization via a Unified Abstraction
Zhiqi Lin
Youshan Miao
Guodong Liu
Xiaoxiang Shi
Quanlu Zhang
...
Xu Cao
Cheng-Wu Li
Mao Yang
Lintao Zhang
Lidong Zhou
111
7
0
21 Jan 2023
FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer
Computer Vision and Pattern Recognition (CVPR), 2023
Zhijian Liu
Xinyu Yang
Haotian Tang
Shang Yang
Song Han
236
102
0
20 Jan 2023
CSwin2SR: Circular Swin2SR for Compressed Image Super-Resolution
International Conference on Artificial Intelligence Circuits and Systems (ICAICS), 2023
Honggui Li
M. Trocan
Mohamad Sawan
Dimitri Galayko
107
4
0
20 Jan 2023
RILS: Masked Visual Reconstruction in Language Semantic Space
Computer Vision and Pattern Recognition (CVPR), 2023
Shusheng Yang
Yixiao Ge
Kun Yi
Dian Li
Ying Shan
Xiaohu Qie
Xinggang Wang
CLIP
140
14
0
17 Jan 2023
A Survey on Self-supervised Learning: Algorithms, Applications, and Future Trends
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Jie Gui
Tuo Chen
Jing Zhang
Qiong Cao
Zhe Sun
Haoran Luo
Dacheng Tao
504
329
0
13 Jan 2023
1st Place Solution for ECCV 2022 OOD-CV Challenge Image Classification Track
Yilu Guo
Xing-Jian Shi
Weijie Chen
Shicai Yang
Di Xie
Shiliang Pu
Yueting Zhuang
3DGS
130
1
0
12 Jan 2023
Vision Transformers Are Good Mask Auto-Labelers
Computer Vision and Pattern Recognition (CVPR), 2023
Shiyi Lan
Xitong Yang
Zhiding Yu
Zuxuan Wu
J. Álvarez
Anima Anandkumar
ISeg
ViT
MedIm
184
23
0
10 Jan 2023
All in Tokens: Unifying Output Space of Visual Tasks via Soft Token
IEEE International Conference on Computer Vision (ICCV), 2023
Jia Ning
Chen Li
Zheng Zhang
Zigang Geng
Jingdong Sun
Kun He
Han Hu
286
58
0
05 Jan 2023
Towards Long-Term Time-Series Forecasting: Feature, Pattern, and Distribution
IEEE International Conference on Data Engineering (ICDE), 2023
Yan Li
Xin Lu
Haoyi Xiong
Jian Tang
Jian Su
Bo Jin
Dejing Dou
AI4TS
155
39
0
05 Jan 2023
TinyMIM: An Empirical Study of Distilling MIM Pre-trained Models
Computer Vision and Pattern Recognition (CVPR), 2023
Sucheng Ren
Fangyun Wei
Zheng Zhang
Han Hu
280
48
0
03 Jan 2023
Rethinking Mobile Block for Efficient Attention-based Models
IEEE International Conference on Computer Vision (ICCV), 2023
Jiangning Zhang
Xiangtai Li
Jian Li
Liang Liu
Zhucun Xue
Boshen Zhang
Zhe Jiang
Tianxin Huang
Yabiao Wang
Chengjie Wang
MQ
287
188
0
03 Jan 2023
Edge Enhanced Image Style Transfer via Transformers
International Conference on Multimedia Retrieval (ICMR), 2023
Chi Zhang
Jun Yang
Zaiyan Dai
Peng-Xia Cao
169
15
0
02 Jan 2023
Transformers in Action Recognition: A Review on Temporal Modeling
Elham Shabaninia
Hossein Nezamabadi-pour
Fatemeh Shafizadegan
ViT
195
14
0
29 Dec 2022
Local Learning on Transformers via Feature Reconstruction
P. Pathak
Jingwei Zhang
Dimitris Samaras
ViT
279
6
0
29 Dec 2022
Boosting Out-of-Distribution Detection with Multiple Pre-trained Models
Feng Xue
Zi He
Chuanlong Xie
Falong Tan
Zhenguo Li
OODD
320
7
0
24 Dec 2022
Reversible Column Networks
International Conference on Learning Representations (ICLR), 2022
Yuxuan Cai
Yi Zhou
Qi Han
Jianjian Sun
Xiangwen Kong
Jun Yu Li
Xiangyu Zhang
VLM
242
85
0
22 Dec 2022
SLGTformer: An Attention-Based Approach to Sign Language Recognition
Neil Song
Yu Xiang
SLR
170
7
0
21 Dec 2022
Universal Object Detection with Large Vision Model
International Journal of Computer Vision (IJCV), 2022
Feng-Huei Lin
Wenze Hu
Yaowei Wang
Yonghong Tian
Guangming Lu
Fanglin Chen
Yong-mei Xu
Xiaoyu Wang
VLM
ObjD
271
9
0
19 Dec 2022
Analysis and application of multispectral data for water segmentation using machine learning
Shubham Gupta
D. Uma
R. Hebbar
133
1
0
16 Dec 2022
Attentive Mask CLIP
IEEE International Conference on Computer Vision (ICCV), 2022
Yifan Yang
Weiquan Huang
Yixuan Wei
Houwen Peng
Xinyang Jiang
...
Fangyun Wei
Yin Wang
Han Hu
Lili Qiu
Yuqing Yang
CLIP
VLM
151
32
0
16 Dec 2022
Rethinking Vision Transformers for MobileNet Size and Speed
IEEE International Conference on Computer Vision (ICCV), 2022
Yanyu Li
Ju Hu
Yang Wen
Georgios Evangelidis
Kamyar Salahi
Yanzhi Wang
Sergey Tulyakov
Jian Ren
ViT
326
249
0
15 Dec 2022
FlexiViT: One Model for All Patch Sizes
Computer Vision and Pattern Recognition (CVPR), 2022
Lucas Beyer
Pavel Izmailov
Alexander Kolesnikov
Mathilde Caron
Simon Kornblith
Xiaohua Zhai
Matthias Minderer
Michael Tschannen
Ibrahim Alabdulmohsin
Filip Pavetić
VLM
351
135
0
15 Dec 2022
Vision Transformers are Parameter-Efficient Audio-Visual Learners
Computer Vision and Pattern Recognition (CVPR), 2022
Yan-Bo Lin
Yi-Lin Sung
Jie Lei
Joey Tianyi Zhou
Gedas Bertasius
288
106
0
15 Dec 2022
What do Vision Transformers Learn? A Visual Exploration
Amin Ghiasi
Hamid Kazemi
Eitan Borgnia
Steven Reich
Manli Shu
Micah Goldblum
A. Wilson
Tom Goldstein
ViT
239
77
0
13 Dec 2022
Position Embedding Needs an Independent Layer Normalization
Runyi Yu
Zhennan Wang
Yinhuai Wang
Kehan Li
Yian Zhao
Jian Zhang
Guoli Song
Jie Chen
260
1
0
10 Dec 2022
Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning
Computer Vision and Pattern Recognition (CVPR), 2022
Jishnu Mukhoti
Tsung-Yu Lin
Omid Poursaeed
Rui Wang
Ashish Shah
Juil Sock
Ser-Nam Lim
VLM
231
116
0
09 Dec 2022
Spurious Features Everywhere -- Large-Scale Detection of Harmful Spurious Features in ImageNet
IEEE International Conference on Computer Vision (ICCV), 2022
Yannic Neuhaus
Maximilian Augustin
Valentyn Boreiko
Matthias Hein
AAML
283
39
0
09 Dec 2022
Mitigation of Spatial Nonstationarity with Vision Transformers
Computational Geosciences (Comput. Geosci.), 2022
Lei Liu
Javier E. Santos
Mavsa Prodanović
Michael J. Pyrcz
104
7
0
09 Dec 2022
X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using CLIP and StableDiffusion
International Conference on Machine Learning (ICML), 2022
Hanqing Zhao
Dianmo Sheng
Jianmin Bao
Dongdong Chen
Dong Chen
...
Ce Liu
Wenbo Zhou
Qi Chu
Weiming Zhang
Neng H. Yu
VLM
DiffM
205
59
0
07 Dec 2022
ResFormer: Scaling ViTs with Multi-Resolution Training
Computer Vision and Pattern Recognition (CVPR), 2022
Rui Tian
Zuxuan Wu
Qiuju Dai
Hang-Rui Hu
Yu Qiao
Yu-Gang Jiang
ViT
203
51
0
01 Dec 2022
Finding Differences Between Transformers and ConvNets Using Counterfactual Simulation Testing
Neural Information Processing Systems (NeurIPS), 2022
Nataniel Ruiz
Sarah Adel Bargal
Cihang Xie
Kate Saenko
Stan Sclaroff
ViT
141
7
0
29 Nov 2022
Transferability Estimation Based On Principal Gradient Expectation
Huiyan Qi
Lechao Cheng
Yue Yu
Yue Yu
Haijun Shan
Zunlei Feng
Yueping Jiang
206
4
0
29 Nov 2022
Metal-conscious Embedding for CBCT Projection Inpainting
IEEE International Symposium on Biomedical Imaging (ISBI), 2022
F. Fan
Yangkong Wang
L. Ritschl
R. Biniazan
M. Beister
Björn Kreher
Yixing Huang
Steffen Kappler
Andreas Maier
MedIm
64
1
0
29 Nov 2022
Class Adaptive Network Calibration
Computer Vision and Pattern Recognition (CVPR), 2022
Bingyuan Liu
Jérôme Rony
Adrian Galdran
Jose Dolz
Ismail Ben Ayed
172
13
0
28 Nov 2022
UperFormer: A Multi-scale Transformer-based Decoder for Semantic Segmentation
IEEE Transactions on Emerging Topics in Computational Intelligence (IEEE TETCI), 2022
Jing Xu
W. Shi
Pan Gao
Zhengwei Wang
Qizhu Li
ViT
87
1
0
25 Nov 2022
DETRs with Collaborative Hybrid Assignments Training
IEEE International Conference on Computer Vision (ICCV), 2022
Zhuofan Zong
Guanglu Song
Yu Liu
ViT
525
501
0
22 Nov 2022
Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Qibin Hou
Cheng Lu
Mingg-Ming Cheng
Jiashi Feng
ViT
215
209
0
22 Nov 2022
N-Gram in Swin Transformers for Efficient Lightweight Image Super-Resolution
Computer Vision and Pattern Recognition (CVPR), 2022
Haram Choi
Jeong-Sik Lee
Jihoon Yang
ViT
191
128
0
21 Nov 2022
Crowdsensing-based Road Damage Detection Challenge (CRDDC-2022)
Deeksha M. Arya
Hiroya Maeda
S. Ghosh
Durga Toshniwal
Hiroshi Omata
Takehiro Kashiyama
Osaka University of Economics
124
64
0
21 Nov 2022
Blind Knowledge Distillation for Robust Image Classification
Timo Kaiser
Lukas Ehmann
Christoph Reinders
Bodo Rosenhahn
NoLa
132
14
0
21 Nov 2022
EHSNet: End-to-End Holistic Learning Network for Large-Size Remote Sensing Image Semantic Segmentation
Wei Chen
Yansheng Li
Bo Dang
Yongjun Zhang
172
3
0
21 Nov 2022
DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting
Computer Vision and Pattern Recognition (CVPR), 2022
Maoyuan Ye
Jing Zhang
Shanshan Zhao
Juhua Liu
Tongliang Liu
Bo Du
Dacheng Tao
309
98
0
19 Nov 2022
A survey on knowledge-enhanced multimodal learning
Artificial Intelligence Review (Artif Intell Rev), 2022
Maria Lymperaiou
Giorgos Stamou
437
21
0
19 Nov 2022
CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical Flow
IEEE International Conference on Computer Vision (ICCV), 2022
Philippe Weinzaepfel
Thomas Lucas
Vincent Leroy
Yohann Cabon
Vaibhav Arora
Romain Brégier
G. Csurka
L. Antsfeld
Boris Chidlovskii
Jérôme Revaud
ViT
378
151
0
18 Nov 2022
Previous
1
2
3
...
15
16
17
18
19
Next