Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2112.09133
Cited By
v1
v2 (latest)
Masked Feature Prediction for Self-Supervised Visual Pre-Training
16 December 2021
Chen Wei
Haoqi Fan
Saining Xie
Chaoxia Wu
Alan Yuille
Christoph Feichtenhofer
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Masked Feature Prediction for Self-Supervised Visual Pre-Training"
50 / 497 papers shown
Title
SmooSeg: Smoothness Prior for Unsupervised Semantic Segmentation
Neural Information Processing Systems (NeurIPS), 2023
Mengcheng Lan
Xinjiang Wang
Yiping Ke
Jiaxing Xu
Xue Jiang
Wayne Zhang
228
18
0
27 Oct 2023
Towards Control-Centric Representations in Reinforcement Learning from Images
Chen Liu
Hongyu Zang
Xin Li
Yong Heng
Yifei Wang
Zhen Fang
Yisen Wang
Mingzhong Wang
269
0
0
25 Oct 2023
Modality-Agnostic Self-Supervised Learning with Meta-Learned Masked Auto-Encoder
Neural Information Processing Systems (NeurIPS), 2023
Huiwon Jang
Jihoon Tack
Daewon Choi
Jongheon Jeong
Jinwoo Shin
192
6
0
25 Oct 2023
Generative and Contrastive Paradigms Are Complementary for Graph Self-Supervised Learning
IEEE International Conference on Data Engineering (ICDE), 2023
Yuxiang Wang
Xiao Yan
Chuang Hu
Fangcheng Fu
Wentao Zhang
Hao Wang
Shuo Shang
Jiawei Jiang
SSL
144
14
0
24 Oct 2023
Learning with Unmasked Tokens Drives Stronger Vision Learners
Taekyung Kim
Sanghyuk Chun
Byeongho Heo
Dongyoon Han
SSL
274
3
0
20 Oct 2023
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
Lijun Yu
José Lezama
N. B. Gundavarapu
Luca Versari
Kihyuk Sohn
...
Boqing Gong
Ming-Hsuan Yang
Irfan Essa
David A. Ross
Lu Jiang
427
515
0
09 Oct 2023
Enhancing Representations through Heterogeneous Self-Supervised Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Zhongyu Li
Bo-Wen Yin
Yongxiang Liu
Tianpeng Liu
Ming-Ming Cheng
SSL
358
3
0
08 Oct 2023
Self-Supervised Neuron Segmentation with Multi-Agent Reinforcement Learning
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Yinda Chen
Wei-Ping Huang
Shenglong Zhou
Qi Chen
Zhiwei Xiong
177
31
0
06 Oct 2023
Self-distilled Masked Attention guided masked image modeling with noise Regularized Teacher (SMART) for medical image analysis
Jue Jiang
Aneesh Rangnekar
Chloe Choi
Harini Veeraraghavan
MedIm
188
1
0
02 Oct 2023
Towards Free Data Selection with General-Purpose Models
Neural Information Processing Systems (NeurIPS), 2023
Alessandro Mutti
Mingyu Ding
Patrizia Semeraro
Wei Zhan
224
12
0
29 Sep 2023
CtxMIM: Context-Enhanced Masked Image Modeling for Remote Sensing Image Understanding
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP) (TOMM), 2023
Mingming Zhang
Qingjie Liu
Yunhong Wang
352
9
0
28 Sep 2023
M
3
^{3}
3
3D: Learning 3D priors using Multi-Modal Masked Autoencoders for 2D image and video understanding
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Muhammad Abdullah Jamal
Omid Mohareri
3DPC
232
2
0
26 Sep 2023
Masked Image Residual Learning for Scaling Deeper Vision Transformers
Neural Information Processing Systems (NeurIPS), 2023
Guoxi Huang
Hongtao Fu
A. Bors
260
8
0
25 Sep 2023
Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
IEEE International Conference on Computer Vision (ICCV), 2023
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Yingya Zhang
Changxin Gao
Deli Zhao
Nong Sang
208
31
0
14 Sep 2023
Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization
International Conference on Learning Representations (ICLR), 2023
Yang Jin
Kun Xu
Kun Xu
Liwei Chen
Chao Liao
...
Xiaoqiang Lei
Chen Zhang
Wenwu Ou
Kun Gai
Yadong Mu
MLLM
VLM
225
76
0
09 Sep 2023
AMLP: Adjustable Masking Lesion Patches for Self-Supervised Medical Image Segmentation
IEEE Transactions on Medical Imaging (TMI), 2023
Xiang-Fei Wang
Ruizhi Wang
Jie Zhou
Thomas Lukasiewicz
170
0
0
08 Sep 2023
DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions
Neural Information Processing Systems (NeurIPS), 2023
Haochen Wang
Junsong Fan
Yuxi Wang
Kaiyou Song
Tong Wang
Zhaoxiang Zhang
242
23
0
07 Sep 2023
Toward High Quality Facial Representation Learning
ACM Multimedia (ACM MM), 2023
Yue Wang
Jinlong Peng
Jiangning Zhang
Ran Yi
Lu Liu
Yabiao Wang
Chengjie Wang
CVBM
SSL
200
12
0
07 Sep 2023
DAT++: Spatially Dynamic Vision Transformer with Deformable Attention
Zhuofan Xia
Xuran Pan
Shiji Song
Li Erran Li
Gao Huang
ViT
245
41
0
04 Sep 2023
COMEDIAN: Self-Supervised Learning and Knowledge Distillation for Action Spotting using Transformers
J. Denize
Mykola Liashuha
Jaonary Rabarisoa
Astrid Orcesi
Romain Hérault
ViT
287
19
0
03 Sep 2023
RevColV2: Exploring Disentangled Representations in Masked Image Modeling
Neural Information Processing Systems (NeurIPS), 2023
Qi Han
Yuxuan Cai
Xiangyu Zhang
303
13
0
02 Sep 2023
Self-Supervised Video Transformers for Isolated Sign Language Recognition
Marcelo Sandoval-Castaneda
Yanhong Li
D. Brentari
Karen Livescu
Gregory Shakhnarovich
SLR
256
9
0
02 Sep 2023
Contrastive Feature Masking Open-Vocabulary Vision Transformer
IEEE International Conference on Computer Vision (ICCV), 2023
Dahun Kim
A. Angelova
Weicheng Kuo
ObjD
VLM
318
37
0
02 Sep 2023
CL-MAE: Curriculum-Learned Masked Autoencoders
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Neelu Madan
Nicolae-Cătălin Ristea
Kamal Nasrollahi
T. Moeslund
Radu Tudor Ionescu
448
25
0
31 Aug 2023
Motion-Guided Masking for Spatiotemporal Representation Learning
IEEE International Conference on Computer Vision (ICCV), 2023
D. Fan
Jue Wang
Shuai Liao
Yi Zhu
Vimal Bhat
H. Santos-Villalobos
M. Rohith
Xinyu Li
VGen
199
27
0
24 Aug 2023
DLIP: Distilling Language-Image Pre-training
Huafeng Kuang
Jie Wu
Xiawu Zheng
Ming Li
Xuefeng Xiao
Rui Wang
Min Zheng
Rongrong Ji
VLM
140
6
0
24 Aug 2023
Masked Feature Modelling: Feature Masking for the Unsupervised Pre-training of a Graph Attention Network Block for Bottom-up Video Event Recognition
Dimitrios Daskalakis
Nikolaos Gkalelis
Vasileios Mezaris
185
0
0
24 Aug 2023
MOFO: MOtion FOcused Self-Supervision for Video Understanding
Mona Ahmadian
Frank Guerin
Andrew Gilbert
295
4
0
23 Aug 2023
EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE
AAAI Conference on Artificial Intelligence (AAAI), 2023
Junyi Chen
Longteng Guo
Jianxiang Sun
Shuai Shao
Zehuan Yuan
Liang Lin
Dongyu Zhang
MLLM
VLM
MoE
171
20
0
23 Aug 2023
Masked Momentum Contrastive Learning for Zero-shot Semantic Understanding
Jiantao Wu
Shentong Mo
Muhammad Awais
Sara Atito
Zhenhua Feng
J. Kittler
VLM
204
4
0
22 Aug 2023
MGMAE: Motion Guided Masking for Video Masked Autoencoding
IEEE International Conference on Computer Vision (ICCV), 2023
Bingkun Huang
Zhiyu Zhao
Guozhen Zhang
Yu Qiao
Limin Wang
143
50
0
21 Aug 2023
Improving Adversarial Robustness of Masked Autoencoders via Test-time Frequency-domain Prompting
IEEE International Conference on Computer Vision (ICCV), 2023
Qidong Huang
Xiaoyi Dong
DongDong Chen
Yinpeng Chen
Lu Yuan
Gang Hua
Weiming Zhang
Neng H. Yu
AAML
294
11
0
20 Aug 2023
GiGaMAE: Generalizable Graph Masked Autoencoder via Collaborative Latent Space Reconstruction
International Conference on Information and Knowledge Management (CIKM), 2023
Yucheng Shi
Yushun Dong
Qiaoyu Tan
Jundong Li
Ninghao Liu
339
37
0
18 Aug 2023
Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud Videos
IEEE International Conference on Computer Vision (ICCV), 2023
Zhiqiang Shen
Xiaoxiao Sheng
Hehe Fan
Longguang Wang
Yike Guo
Qiong Liu
Hao Wen
Xiaoping Zhou
3DPC
133
20
0
18 Aug 2023
Auxiliary Tasks Benefit 3D Skeleton-based Human Motion Prediction
IEEE International Conference on Computer Vision (ICCV), 2023
Chenxin Xu
R. Tan
Yuhong Tan
Siheng Chen
Xinchao Wang
Yanfeng Wang
3DH
259
31
0
17 Aug 2023
SRMAE: Masked Image Modeling for Scale-Invariant Deep Representations
Chinese Conference on Pattern Recognition and Computer Vision (CPRCV), 2023
Zhiming Wang
Lin Gu
Feng Lu
230
1
0
17 Aug 2023
Stable and Causal Inference for Discriminative Self-supervised Deep Visual Representations
IEEE International Conference on Computer Vision (ICCV), 2023
Yuewei Yang
Hai Helen Li
Yiran Chen
CML
OOD
249
2
0
16 Aug 2023
Masked Motion Predictors are Strong 3D Action Representation Learners
IEEE International Conference on Computer Vision (ICCV), 2023
Yunyao Mao
Jiajun Deng
Wen-gang Zhou
Yao Fang
Wanli Ouyang
Houqiang Li
3DPC
247
62
0
14 Aug 2023
Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks
David Junhao Zhang
Mutian Xu
Chuhui Xue
Wenqing Zhang
Xiaoguang Han
Song Bai
Mike Zheng Shou
DiffM
229
7
0
13 Aug 2023
Temporally-Adaptive Models for Efficient Video Understanding
Ziyuan Huang
Shiwei Zhang
Liang Pan
Zhiwu Qing
Yingya Zhang
Ziwei Liu
Marcelo H. Ang
193
16
0
10 Aug 2023
Unsupervised Camouflaged Object Segmentation as Domain Adaptation
Yi Zhang
Chengyi Wu
212
7
0
08 Aug 2023
Learning Concise and Descriptive Attributes for Visual Recognition
IEEE International Conference on Computer Vision (ICCV), 2023
Andy Yan
Yu Wang
Yiwu Zhong
Chengyu Dong
Zexue He
Yujie Lu
William Wang
Jingbo Shang
Julian McAuley
VLM
242
84
0
07 Aug 2023
Feature-Suppressed Contrast for Self-Supervised Food Pre-training
ACM Multimedia (ACM MM), 2023
Xinda Liu
Yaohui Zhu
Linhu Liu
Jiang Tian
Lili Wang
SSL
217
6
0
07 Aug 2023
Improving Pixel-based MIM by Reducing Wasted Modeling Capability
IEEE International Conference on Computer Vision (ICCV), 2023
Yuan Liu
Songyang Zhang
Jiacheng Chen
Zhaohui Yu
Kai-xiang Chen
Dahua Lin
200
41
0
01 Aug 2023
EEG-based Cognitive Load Classification using Feature Masked Autoencoding and Emotion Transfer Learning
International Conference on Multimodal Interaction (ICMI), 2023
Dustin Pulver
Prithila Angkan
Paul Hungler
Ali Etemad
261
15
0
01 Aug 2023
Disruptive Autoencoders: Leveraging Low-level features for 3D Medical Image Pre-training
International Conference on Medical Imaging with Deep Learning (MIDL), 2023
Jeya Maria Jose Valanarasu
Yucheng Tang
Dong Yang
Ziyue Xu
Can Zhao
...
Vishal M. Patel
Bennett Landman
Daguang Xu
Yufan He
V. Nath
MedIm
182
20
0
31 Jul 2023
HandMIM: Pose-Aware Self-Supervised Learning for 3D Hand Mesh Estimation
Zuyan Liu
Gaojie Lin
Congyi Wang
Min Zheng
Feida Zhu
3DH
149
1
0
29 Jul 2023
MOCA: Self-supervised Representation Learning by Predicting Masked Online Codebook Assignments
Spyros Gidaris
Andrei Bursuc
Oriane Siméoni
Antonín Vobecký
N. Komodakis
Matthieu Cord
Patrick Pérez
SSL
ViT
257
9
0
18 Jul 2023
Does Visual Pretraining Help End-to-End Reasoning?
Neural Information Processing Systems (NeurIPS), 2023
Chen Sun
Calvin Luo
Xingyi Zhou
Anurag Arnab
Cordelia Schmid
OCL
LRM
ViT
318
4
0
17 Jul 2023
DreamTeacher: Pretraining Image Backbones with Deep Generative Models
IEEE International Conference on Computer Vision (ICCV), 2023
Daiqing Li
Huan Ling
Amlan Kar
David Acuna
Seung Wook Kim
Karsten Kreis
Antonio Torralba
Sanja Fidler
VLM
DiffM
249
34
0
14 Jul 2023
Previous
1
2
3
4
5
6
...
8
9
10
Next