Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2112.09133
Cited By
v1
v2 (latest)
Masked Feature Prediction for Self-Supervised Visual Pre-Training
16 December 2021
Chen Wei
Haoqi Fan
Saining Xie
Chaoxia Wu
Alan Yuille
Christoph Feichtenhofer
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Masked Feature Prediction for Self-Supervised Visual Pre-Training"
50 / 498 papers shown
Audiovisual Masked Autoencoders
IEEE International Conference on Computer Vision (ICCV), 2022
Mariana-Iuliana Georgescu
Eduardo Fonseca
Radu Tudor Ionescu
Mario Lucic
Cordelia Schmid
Anurag Arnab
SSL
317
56
0
09 Dec 2022
Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning
Computer Vision and Pattern Recognition (CVPR), 2022
Rui Wang
Dongdong Chen
Zuxuan Wu
Yinpeng Chen
Xiyang Dai
Xiyang Dai
Lu Yuan
Yu-Gang Jiang
VGen
326
120
0
08 Dec 2022
Group Generalized Mean Pooling for Vision Transformer
ByungSoo Ko
Han-Gyu Kim
Byeongho Heo
Sangdoo Yun
Sanghyuk Chun
Geonmo Gu
Wonjae Kim
ViT
295
3
0
08 Dec 2022
SimVTP: Simple Video Text Pre-training with Masked Autoencoders
Yue Ma
Tianyu Yang
Yin Shan
Xiu Li
169
30
0
07 Dec 2022
Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning
Computer Vision and Pattern Recognition (CVPR), 2022
A. Piergiovanni
Weicheng Kuo
A. Angelova
ViT
239
69
0
06 Dec 2022
InternVideo: General Video Foundation Models via Generative and Discriminative Learning
Yi Wang
Kunchang Li
Yizhuo Li
Yinan He
Bingkun Huang
...
Junting Pan
Jiashuo Yu
Yali Wang
Limin Wang
Yu Qiao
VLM
VGen
454
446
0
06 Dec 2022
Location-Aware Self-Supervised Transformers for Semantic Segmentation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Mathilde Caron
N. Houlsby
Cordelia Schmid
ViT
329
23
0
05 Dec 2022
Exploring Stochastic Autoregressive Image Modeling for Visual Representation
AAAI Conference on Artificial Intelligence (AAAI), 2022
Yu-Hang Qi
Fan Yang
Yousong Zhu
Yufei Liu
Liwei Wu
Rui Zhao
Wei Li
DiffM
104
16
0
03 Dec 2022
MIC: Masked Image Consistency for Context-Enhanced Domain Adaptation
Computer Vision and Pattern Recognition (CVPR), 2022
Lukas Hoyer
Dengxin Dai
Haoran Wang
Luc Van Gool
396
323
0
02 Dec 2022
Multi-scale Transformer Network with Edge-aware Pre-training for Cross-Modality MR Image Synthesis
IEEE Transactions on Medical Imaging (IEEE TMI), 2022
Yonghao Li
Tao Zhou
Kelei He
Yi Zhou
Dinggang Shen
ViT
MedIm
343
49
0
02 Dec 2022
Masked Contrastive Pre-Training for Efficient Video-Text Retrieval
Fangxun Shu
Biaolong Chen
Yue Liao
Shuwen Xiao
Wenyu Sun
Xiaobo Li
Yousong Zhu
Jinqiao Wang
Si Liu
CLIP
185
13
0
02 Dec 2022
Scaling Language-Image Pre-training via Masking
Computer Vision and Pattern Recognition (CVPR), 2022
Yanghao Li
Haoqi Fan
Ronghang Hu
Christoph Feichtenhofer
Kaiming He
CLIP
VLM
375
393
0
01 Dec 2022
Spatio-Temporal Crop Aggregation for Video Representation Learning
IEEE International Conference on Computer Vision (ICCV), 2022
Sepehr Sameni
Simon Jenni
Paolo Favaro
312
4
0
30 Nov 2022
Self-Supervised Learning based on Heat Equation
Yinpeng Chen
Xiyang Dai
Dongdong Chen
Xiyang Dai
Lu Yuan
Zicheng Liu
Youzuo Lin
146
6
0
23 Nov 2022
Fast-iTPN: Integrally Pre-Trained Transformer Pyramid Network with Token Migration
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Yunjie Tian
Lingxi Xie
Jihao Qiu
Jianbin Jiao
Yaowei Wang
Qi Tian
Qixiang Ye
ViT
196
20
0
23 Nov 2022
LoopDA: Constructing Self-loops to Adapt Nighttime Semantic Segmentation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Fengyi Shen
Zador Pataki
A. Gurram
Ziyuan Liu
He Wang
Alois Knoll
144
9
0
21 Nov 2022
SMAUG: Sparse Masked Autoencoder for Efficient Video-Language Pre-training
IEEE International Conference on Computer Vision (ICCV), 2022
Yuanze Lin
Chen Wei
Huiyu Wang
Alan Yuille
Cihang Xie
3DGS
305
17
0
21 Nov 2022
CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical Flow
IEEE International Conference on Computer Vision (ICCV), 2022
Philippe Weinzaepfel
Thomas Lucas
Vincent Leroy
Yohann Cabon
Vaibhav Arora
Romain Brégier
G. Csurka
L. Antsfeld
Boris Chidlovskii
Jérôme Revaud
ViT
436
156
0
18 Nov 2022
CAE v2: Context Autoencoder with CLIP Target
Xinyu Zhang
Jiahui Chen
Junkun Yuan
Qiang Chen
Jian Wang
...
Jimin Pi
Kun Yao
Junyu Han
Errui Ding
Jingdong Wang
VLM
CLIP
276
25
0
17 Nov 2022
UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer
Kunchang Li
Yali Wang
Yinan He
Yizhuo Li
Yi Wang
Limin Wang
Yu Qiao
ViT
224
155
0
17 Nov 2022
AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders
Computer Vision and Pattern Recognition (CVPR), 2022
W. G. C. Bandara
Naman Patel
A. Gholami
Mehdi Nikkhah
M. Agrawal
Vishal M. Patel
239
54
0
16 Nov 2022
MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis
Computer Vision and Pattern Recognition (CVPR), 2022
Tianhong Li
Huiwen Chang
Shlok Kumar Mishra
Han Zhang
Dina Katabi
Dilip Krishnan
326
229
0
16 Nov 2022
Stare at What You See: Masked Image Modeling without Reconstruction
Computer Vision and Pattern Recognition (CVPR), 2022
Hongwei Xue
Shiyang Feng
Hongyang Li
Yu Qiao
Hao Sun
Houqiang Li
Jiebo Luo
183
38
0
16 Nov 2022
Exploring State Change Capture of Heterogeneous Backbones @ Ego4D Hands and Objects Challenge 2022
Yin-Dong Zheng
Guo Chen
Jiahao Wang
Tong Lu
Liming Wang
178
1
0
16 Nov 2022
Masked Reconstruction Contrastive Learning with Information Bottleneck Principle
Ziwen Liu
Bonan li
Congying Han
Tiande Guo
Xuecheng Nie
SSL
149
2
0
15 Nov 2022
Self-supervised remote sensing feature learning: Learning Paradigms, Challenges, and Future Works
IEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), 2022
Chao Tao
Ji Qi
Mingning Guo
Qing Zhu
Haifeng Li
SSL
240
76
0
15 Nov 2022
EVA: Exploring the Limits of Masked Visual Representation Learning at Scale
Computer Vision and Pattern Recognition (CVPR), 2022
Yuxin Fang
Wen Wang
Binhui Xie
Quan-Sen Sun
Ledell Yu Wu
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLM
CLIP
604
898
0
14 Nov 2022
Seeing Beyond the Brain: Conditional Diffusion Model with Sparse Masked Modeling for Vision Decoding
Computer Vision and Pattern Recognition (CVPR), 2022
Zijiao Chen
Jiaxin Qing
Tiange Xiang
Wan Lin Yue
J. Zhou
DiffM
MedIm
336
200
0
13 Nov 2022
Demystify Self-Attention in Vision Transformers from a Semantic Perspective: Analysis and Application
Leijie Wu
Song Guo
Yaohong Ding
Junxiao Wang
Wenchao Xu
Richard Yi Da Xu
Jiewei Zhang
150
3
0
13 Nov 2022
MARLIN: Masked Autoencoder for facial video Representation LearnINg
Computer Vision and Pattern Recognition (CVPR), 2022
Zhixi Cai
Shreya Ghosh
Kalin Stefanov
Abhinav Dhall
Jianfei Cai
Hamid Rezatofighi
Reza Haffari
Munawar Hayat
ViT
CVBM
244
93
0
12 Nov 2022
Attention-based Neural Cellular Automata
Neural Information Processing Systems (NeurIPS), 2022
Mattie Tesfaldet
Derek Nowrouzezahrai
C. Pal
ViT
223
26
0
02 Nov 2022
RGMIM: Region-Guided Masked Image Modeling for Learning Meaningful Representation from X-Ray Images
Guang Li
Ren Togo
Takahiro Ogawa
Miki Haseyama
209
1
0
01 Nov 2022
Changes from Classical Statistics to Modern Statistics and Data Science
Kai Zhang
Shan-Yu Liu
M. Xiong
302
1
0
30 Oct 2022
Adversarial Pretraining of Self-Supervised Deep Networks: Past, Present and Future
Guo-Jun Qi
M. Shah
SSL
150
8
0
23 Oct 2022
i-MAE: Are Latent Representations in Masked Autoencoders Linearly Separable?
Kevin Zhang
Zhiqiang Shen
112
10
0
20 Oct 2022
Towards Sustainable Self-supervised Learning
Shanghua Gao
Pan Zhou
Mingg-Ming Cheng
Shuicheng Yan
CLL
328
11
0
20 Oct 2022
CroCo: Self-Supervised Pre-training for 3D Vision Tasks by Cross-View Completion
Neural Information Processing Systems (NeurIPS), 2022
Philippe Weinzaepfel
Vincent Leroy
Thomas Lucas
Romain Brégier
Yohann Cabon
Vaibhav Arora
L. Antsfeld
Boris Chidlovskii
G. Csurka
Jérôme Revaud
SSL
371
123
0
19 Oct 2022
A Unified View of Masked Image Modeling
Zhiliang Peng
Li Dong
Hangbo Bao
QiXiang Ye
Furu Wei
VLM
234
42
0
19 Oct 2022
Token Merging: Your ViT But Faster
International Conference on Learning Representations (ICLR), 2022
Daniel Bolya
Cheng-Yang Fu
Xiaoliang Dai
Peizhao Zhang
Christoph Feichtenhofer
Judy Hoffman
MoMe
414
716
0
17 Oct 2022
The Hidden Uniform Cluster Prior in Self-Supervised Learning
International Conference on Learning Representations (ICLR), 2022
Mahmoud Assran
Randall Balestriero
Quentin Duval
Florian Bordes
Ishan Misra
Piotr Bojanowski
Pascal Vincent
Michael G. Rabbat
Nicolas Ballas
SSL
208
62
0
13 Oct 2022
Exploring Long-Sequence Masked Autoencoders
Ronghang Hu
Shoubhik Debnath
Saining Xie
Xinlei Chen
181
23
0
13 Oct 2022
Masked Motion Encoding for Self-Supervised Video Representation Learning
Computer Vision and Pattern Recognition (CVPR), 2022
Xinyu Sun
Peihao Chen
Liang-Chieh Chen
Chan Li
Thomas H. Li
Zhuliang Yu
Chuang Gan
285
43
0
12 Oct 2022
ZITS++: Image Inpainting by Improving the Incremental Transformer on Structural Priors
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Chenjie Cao
Qiaole Dong
Yanwei Fu
335
47
0
12 Oct 2022
It Takes Two: Masked Appearance-Motion Modeling for Self-supervised Video Transformer Pre-training
Yuxin Song
Min Yang
Wenhao Wu
Dongliang He
Fu Li
Jingdong Wang
ViT
258
12
0
11 Oct 2022
MAMO: Masked Multimodal Modeling for Fine-Grained Vision-Language Representation Learning
Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2022
Zijia Zhao
Longteng Guo
Xingjian He
Shuai Shao
Zehuan Yuan
Jing Liu
300
13
0
09 Oct 2022
Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders
Haosen Yang
Deng Huang
Bin Wen
Jiannan Wu
Huanjin Yao
Yi Jiang
Xiatian Zhu
Zehuan Yuan
137
29
0
09 Oct 2022
Image Masking for Robust Self-Supervised Monocular Depth Estimation
IEEE International Conference on Robotics and Automation (ICRA), 2022
Hemang Chawla
Kishaan Jeeveswaran
Elahe Arani
Bahram Zonooz
MDE
221
8
0
05 Oct 2022
Backdoor Attacks in the Supply Chain of Masked Image Modeling
Xinyue Shen
Xinlei He
Zheng Li
Yun Shen
Michael Backes
Yang Zhang
179
8
0
04 Oct 2022
Contrastive Audio-Visual Masked Autoencoder
International Conference on Learning Representations (ICLR), 2022
Yuan Gong
Andrew Rouditchenko
Alexander H. Liu
David Harwath
Leonid Karlinsky
Hilde Kuehne
James R. Glass
395
166
0
02 Oct 2022
Federated Training of Dual Encoding Models on Small Non-IID Client Datasets
Raviteja Vemulapalli
Warren Morningstar
Philip Mansfield
Hubert Eichner
K. Singhal
Arash Afkanpour
Bradley Green
FedML
289
2
0
30 Sep 2022
Previous
1
2
3
...
10
7
8
9
Next