Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2112.09133
Cited By
v1
v2 (latest)
Masked Feature Prediction for Self-Supervised Visual Pre-Training
16 December 2021
Chen Wei
Haoqi Fan
Saining Xie
Chaoxia Wu
Alan Yuille
Christoph Feichtenhofer
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Masked Feature Prediction for Self-Supervised Visual Pre-Training"
50 / 497 papers shown
Title
SmooSeg: Smoothness Prior for Unsupervised Semantic Segmentation
Neural Information Processing Systems (NeurIPS), 2023
Mengcheng Lan
Xinjiang Wang
Yiping Ke
Jiaxing Xu
Xue Jiang
Wayne Zhang
224
18
0
27 Oct 2023
Towards Control-Centric Representations in Reinforcement Learning from Images
Chen Liu
Hongyu Zang
Xin Li
Yong Heng
Yifei Wang
Zhen Fang
Yisen Wang
Mingzhong Wang
259
0
0
25 Oct 2023
Modality-Agnostic Self-Supervised Learning with Meta-Learned Masked Auto-Encoder
Neural Information Processing Systems (NeurIPS), 2023
Huiwon Jang
Jihoon Tack
Daewon Choi
Jongheon Jeong
Jinwoo Shin
192
6
0
25 Oct 2023
Generative and Contrastive Paradigms Are Complementary for Graph Self-Supervised Learning
IEEE International Conference on Data Engineering (ICDE), 2023
Yuxiang Wang
Xiao Yan
Chuang Hu
Fangcheng Fu
Wentao Zhang
Hao Wang
Shuo Shang
Jiawei Jiang
SSL
132
14
0
24 Oct 2023
Learning with Unmasked Tokens Drives Stronger Vision Learners
Taekyung Kim
Sanghyuk Chun
Byeongho Heo
Dongyoon Han
SSL
270
3
0
20 Oct 2023
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
Lijun Yu
José Lezama
N. B. Gundavarapu
Luca Versari
Kihyuk Sohn
...
Boqing Gong
Ming-Hsuan Yang
Irfan Essa
David A. Ross
Lu Jiang
427
511
0
09 Oct 2023
Enhancing Representations through Heterogeneous Self-Supervised Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Zhongyu Li
Bo-Wen Yin
Yongxiang Liu
Tianpeng Liu
Ming-Ming Cheng
SSL
330
3
0
08 Oct 2023
Self-Supervised Neuron Segmentation with Multi-Agent Reinforcement Learning
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Yinda Chen
Wei-Ping Huang
Shenglong Zhou
Qi Chen
Zhiwei Xiong
169
30
0
06 Oct 2023
Self-distilled Masked Attention guided masked image modeling with noise Regularized Teacher (SMART) for medical image analysis
Jue Jiang
Aneesh Rangnekar
Chloe Choi
Harini Veeraraghavan
MedIm
176
1
0
02 Oct 2023
Towards Free Data Selection with General-Purpose Models
Neural Information Processing Systems (NeurIPS), 2023
Alessandro Mutti
Mingyu Ding
Patrizia Semeraro
Wei Zhan
216
12
0
29 Sep 2023
CtxMIM: Context-Enhanced Masked Image Modeling for Remote Sensing Image Understanding
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP) (TOMM), 2023
Mingming Zhang
Qingjie Liu
Yunhong Wang
340
9
0
28 Sep 2023
M
3
^{3}
3
3D: Learning 3D priors using Multi-Modal Masked Autoencoders for 2D image and video understanding
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Muhammad Abdullah Jamal
Omid Mohareri
3DPC
208
2
0
26 Sep 2023
Masked Image Residual Learning for Scaling Deeper Vision Transformers
Neural Information Processing Systems (NeurIPS), 2023
Guoxi Huang
Hongtao Fu
A. Bors
244
8
0
25 Sep 2023
Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
IEEE International Conference on Computer Vision (ICCV), 2023
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Yingya Zhang
Changxin Gao
Deli Zhao
Nong Sang
208
31
0
14 Sep 2023
Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization
International Conference on Learning Representations (ICLR), 2023
Yang Jin
Kun Xu
Kun Xu
Liwei Chen
Chao Liao
...
Xiaoqiang Lei
Chen Zhang
Wenwu Ou
Kun Gai
Yadong Mu
MLLM
VLM
221
75
0
09 Sep 2023
AMLP: Adjustable Masking Lesion Patches for Self-Supervised Medical Image Segmentation
IEEE Transactions on Medical Imaging (TMI), 2023
Xiang-Fei Wang
Ruizhi Wang
Jie Zhou
Thomas Lukasiewicz
162
0
0
08 Sep 2023
DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions
Neural Information Processing Systems (NeurIPS), 2023
Haochen Wang
Junsong Fan
Yuxi Wang
Kaiyou Song
Tong Wang
Zhaoxiang Zhang
234
23
0
07 Sep 2023
Toward High Quality Facial Representation Learning
ACM Multimedia (ACM MM), 2023
Yue Wang
Jinlong Peng
Jiangning Zhang
Ran Yi
Lu Liu
Yabiao Wang
Chengjie Wang
CVBM
SSL
192
12
0
07 Sep 2023
DAT++: Spatially Dynamic Vision Transformer with Deformable Attention
Zhuofan Xia
Xuran Pan
Shiji Song
Li Erran Li
Gao Huang
ViT
229
41
0
04 Sep 2023
COMEDIAN: Self-Supervised Learning and Knowledge Distillation for Action Spotting using Transformers
J. Denize
Mykola Liashuha
Jaonary Rabarisoa
Astrid Orcesi
Romain Hérault
ViT
283
19
0
03 Sep 2023
RevColV2: Exploring Disentangled Representations in Masked Image Modeling
Neural Information Processing Systems (NeurIPS), 2023
Qi Han
Yuxuan Cai
Xiangyu Zhang
291
13
0
02 Sep 2023
Self-Supervised Video Transformers for Isolated Sign Language Recognition
Marcelo Sandoval-Castaneda
Yanhong Li
D. Brentari
Karen Livescu
Gregory Shakhnarovich
SLR
244
9
0
02 Sep 2023
Contrastive Feature Masking Open-Vocabulary Vision Transformer
IEEE International Conference on Computer Vision (ICCV), 2023
Dahun Kim
A. Angelova
Weicheng Kuo
ObjD
VLM
298
37
0
02 Sep 2023
CL-MAE: Curriculum-Learned Masked Autoencoders
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Neelu Madan
Nicolae-Cătălin Ristea
Kamal Nasrollahi
T. Moeslund
Radu Tudor Ionescu
404
25
0
31 Aug 2023
Motion-Guided Masking for Spatiotemporal Representation Learning
IEEE International Conference on Computer Vision (ICCV), 2023
D. Fan
Jue Wang
Shuai Liao
Yi Zhu
Vimal Bhat
H. Santos-Villalobos
M. Rohith
Xinyu Li
VGen
187
27
0
24 Aug 2023
DLIP: Distilling Language-Image Pre-training
Huafeng Kuang
Jie Wu
Xiawu Zheng
Ming Li
Xuefeng Xiao
Rui Wang
Min Zheng
Rongrong Ji
VLM
128
6
0
24 Aug 2023
Masked Feature Modelling: Feature Masking for the Unsupervised Pre-training of a Graph Attention Network Block for Bottom-up Video Event Recognition
Dimitrios Daskalakis
Nikolaos Gkalelis
Vasileios Mezaris
173
0
0
24 Aug 2023
MOFO: MOtion FOcused Self-Supervision for Video Understanding
Mona Ahmadian
Frank Guerin
Andrew Gilbert
275
4
0
23 Aug 2023
EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE
AAAI Conference on Artificial Intelligence (AAAI), 2023
Junyi Chen
Longteng Guo
Jianxiang Sun
Shuai Shao
Zehuan Yuan
Liang Lin
Dongyu Zhang
MLLM
VLM
MoE
171
19
0
23 Aug 2023
Masked Momentum Contrastive Learning for Zero-shot Semantic Understanding
Jiantao Wu
Shentong Mo
Muhammad Awais
Sara Atito
Zhenhua Feng
J. Kittler
VLM
188
4
0
22 Aug 2023
MGMAE: Motion Guided Masking for Video Masked Autoencoding
IEEE International Conference on Computer Vision (ICCV), 2023
Bingkun Huang
Zhiyu Zhao
Guozhen Zhang
Yu Qiao
Limin Wang
143
49
0
21 Aug 2023
Improving Adversarial Robustness of Masked Autoencoders via Test-time Frequency-domain Prompting
IEEE International Conference on Computer Vision (ICCV), 2023
Qidong Huang
Xiaoyi Dong
DongDong Chen
Yinpeng Chen
Lu Yuan
Gang Hua
Weiming Zhang
Neng H. Yu
AAML
282
11
0
20 Aug 2023
GiGaMAE: Generalizable Graph Masked Autoencoder via Collaborative Latent Space Reconstruction
International Conference on Information and Knowledge Management (CIKM), 2023
Yucheng Shi
Yushun Dong
Qiaoyu Tan
Jundong Li
Ninghao Liu
323
37
0
18 Aug 2023
Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud Videos
IEEE International Conference on Computer Vision (ICCV), 2023
Zhiqiang Shen
Xiaoxiao Sheng
Hehe Fan
Longguang Wang
Yike Guo
Qiong Liu
Hao Wen
Xiaoping Zhou
3DPC
121
20
0
18 Aug 2023
Auxiliary Tasks Benefit 3D Skeleton-based Human Motion Prediction
IEEE International Conference on Computer Vision (ICCV), 2023
Chenxin Xu
R. Tan
Yuhong Tan
Siheng Chen
Xinchao Wang
Yanfeng Wang
3DH
237
31
0
17 Aug 2023
SRMAE: Masked Image Modeling for Scale-Invariant Deep Representations
Chinese Conference on Pattern Recognition and Computer Vision (CPRCV), 2023
Zhiming Wang
Lin Gu
Feng Lu
226
1
0
17 Aug 2023
Stable and Causal Inference for Discriminative Self-supervised Deep Visual Representations
IEEE International Conference on Computer Vision (ICCV), 2023
Yuewei Yang
Hai Helen Li
Yiran Chen
CML
OOD
241
1
0
16 Aug 2023
Masked Motion Predictors are Strong 3D Action Representation Learners
IEEE International Conference on Computer Vision (ICCV), 2023
Yunyao Mao
Jiajun Deng
Wen-gang Zhou
Yao Fang
Wanli Ouyang
Houqiang Li
3DPC
239
62
0
14 Aug 2023
Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks
David Junhao Zhang
Mutian Xu
Chuhui Xue
Wenqing Zhang
Xiaoguang Han
Song Bai
Mike Zheng Shou
DiffM
229
7
0
13 Aug 2023
Temporally-Adaptive Models for Efficient Video Understanding
Ziyuan Huang
Shiwei Zhang
Liang Pan
Zhiwu Qing
Yingya Zhang
Ziwei Liu
Marcelo H. Ang
185
16
0
10 Aug 2023
Unsupervised Camouflaged Object Segmentation as Domain Adaptation
Yi Zhang
Chengyi Wu
196
7
0
08 Aug 2023
Learning Concise and Descriptive Attributes for Visual Recognition
IEEE International Conference on Computer Vision (ICCV), 2023
Andy Yan
Yu Wang
Yiwu Zhong
Chengyu Dong
Zexue He
Yujie Lu
William Wang
Jingbo Shang
Julian McAuley
VLM
230
84
0
07 Aug 2023
Feature-Suppressed Contrast for Self-Supervised Food Pre-training
ACM Multimedia (ACM MM), 2023
Xinda Liu
Yaohui Zhu
Linhu Liu
Jiang Tian
Lili Wang
SSL
189
6
0
07 Aug 2023
Improving Pixel-based MIM by Reducing Wasted Modeling Capability
IEEE International Conference on Computer Vision (ICCV), 2023
Yuan Liu
Songyang Zhang
Jiacheng Chen
Zhaohui Yu
Kai-xiang Chen
Dahua Lin
188
41
0
01 Aug 2023
EEG-based Cognitive Load Classification using Feature Masked Autoencoding and Emotion Transfer Learning
International Conference on Multimodal Interaction (ICMI), 2023
Dustin Pulver
Prithila Angkan
Paul Hungler
Ali Etemad
245
15
0
01 Aug 2023
Disruptive Autoencoders: Leveraging Low-level features for 3D Medical Image Pre-training
International Conference on Medical Imaging with Deep Learning (MIDL), 2023
Jeya Maria Jose Valanarasu
Yucheng Tang
Dong Yang
Ziyue Xu
Can Zhao
...
Vishal M. Patel
Bennett Landman
Daguang Xu
Yufan He
V. Nath
MedIm
174
20
0
31 Jul 2023
HandMIM: Pose-Aware Self-Supervised Learning for 3D Hand Mesh Estimation
Zuyan Liu
Gaojie Lin
Congyi Wang
Min Zheng
Feida Zhu
3DH
145
1
0
29 Jul 2023
MOCA: Self-supervised Representation Learning by Predicting Masked Online Codebook Assignments
Spyros Gidaris
Andrei Bursuc
Oriane Siméoni
Antonín Vobecký
N. Komodakis
Matthieu Cord
Patrick Pérez
SSL
ViT
253
9
0
18 Jul 2023
Does Visual Pretraining Help End-to-End Reasoning?
Neural Information Processing Systems (NeurIPS), 2023
Chen Sun
Calvin Luo
Xingyi Zhou
Anurag Arnab
Cordelia Schmid
OCL
LRM
ViT
290
4
0
17 Jul 2023
DreamTeacher: Pretraining Image Backbones with Deep Generative Models
IEEE International Conference on Computer Vision (ICCV), 2023
Daiqing Li
Huan Ling
Amlan Kar
David Acuna
Seung Wook Kim
Karsten Kreis
Antonio Torralba
Sanja Fidler
VLM
DiffM
237
34
0
14 Jul 2023
Previous
1
2
3
4
5
6
...
8
9
10
Next