ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.09133
  4. Cited By
Masked Feature Prediction for Self-Supervised Visual Pre-Training
v1v2 (latest)

Masked Feature Prediction for Self-Supervised Visual Pre-Training

16 December 2021
Chen Wei
Haoqi Fan
Saining Xie
Chaoxia Wu
Alan Yuille
Christoph Feichtenhofer
    ViT
ArXiv (abs)PDFHTML

Papers citing "Masked Feature Prediction for Self-Supervised Visual Pre-Training"

50 / 497 papers shown
Title
SmooSeg: Smoothness Prior for Unsupervised Semantic Segmentation
SmooSeg: Smoothness Prior for Unsupervised Semantic SegmentationNeural Information Processing Systems (NeurIPS), 2023
Mengcheng Lan
Xinjiang Wang
Yiping Ke
Jiaxing Xu
Xue Jiang
Wayne Zhang
224
18
0
27 Oct 2023
Towards Control-Centric Representations in Reinforcement Learning from
  Images
Towards Control-Centric Representations in Reinforcement Learning from Images
Chen Liu
Hongyu Zang
Xin Li
Yong Heng
Yifei Wang
Zhen Fang
Yisen Wang
Mingzhong Wang
259
0
0
25 Oct 2023
Modality-Agnostic Self-Supervised Learning with Meta-Learned Masked
  Auto-Encoder
Modality-Agnostic Self-Supervised Learning with Meta-Learned Masked Auto-EncoderNeural Information Processing Systems (NeurIPS), 2023
Huiwon Jang
Jihoon Tack
Daewon Choi
Jongheon Jeong
Jinwoo Shin
192
6
0
25 Oct 2023
Generative and Contrastive Paradigms Are Complementary for Graph
  Self-Supervised Learning
Generative and Contrastive Paradigms Are Complementary for Graph Self-Supervised LearningIEEE International Conference on Data Engineering (ICDE), 2023
Yuxiang Wang
Xiao Yan
Chuang Hu
Fangcheng Fu
Wentao Zhang
Hao Wang
Shuo Shang
Jiawei Jiang
SSL
132
14
0
24 Oct 2023
Learning with Unmasked Tokens Drives Stronger Vision Learners
Learning with Unmasked Tokens Drives Stronger Vision Learners
Taekyung Kim
Sanghyuk Chun
Byeongho Heo
Dongyoon Han
SSL
270
3
0
20 Oct 2023
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
Lijun Yu
José Lezama
N. B. Gundavarapu
Luca Versari
Kihyuk Sohn
...
Boqing Gong
Ming-Hsuan Yang
Irfan Essa
David A. Ross
Lu Jiang
427
511
0
09 Oct 2023
Enhancing Representations through Heterogeneous Self-Supervised Learning
Enhancing Representations through Heterogeneous Self-Supervised LearningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Zhongyu Li
Bo-Wen Yin
Yongxiang Liu
Tianpeng Liu
Ming-Ming Cheng
SSL
330
3
0
08 Oct 2023
Self-Supervised Neuron Segmentation with Multi-Agent Reinforcement
  Learning
Self-Supervised Neuron Segmentation with Multi-Agent Reinforcement LearningInternational Joint Conference on Artificial Intelligence (IJCAI), 2023
Yinda Chen
Wei-Ping Huang
Shenglong Zhou
Qi Chen
Zhiwei Xiong
169
30
0
06 Oct 2023
Self-distilled Masked Attention guided masked image modeling with noise
  Regularized Teacher (SMART) for medical image analysis
Self-distilled Masked Attention guided masked image modeling with noise Regularized Teacher (SMART) for medical image analysis
Jue Jiang
Aneesh Rangnekar
Chloe Choi
Harini Veeraraghavan
MedIm
176
1
0
02 Oct 2023
Towards Free Data Selection with General-Purpose Models
Towards Free Data Selection with General-Purpose ModelsNeural Information Processing Systems (NeurIPS), 2023
Alessandro Mutti
Mingyu Ding
Patrizia Semeraro
Wei Zhan
216
12
0
29 Sep 2023
CtxMIM: Context-Enhanced Masked Image Modeling for Remote Sensing Image
  Understanding
CtxMIM: Context-Enhanced Masked Image Modeling for Remote Sensing Image UnderstandingACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP) (TOMM), 2023
Mingming Zhang
Qingjie Liu
Yunhong Wang
340
9
0
28 Sep 2023
M$^{3}$3D: Learning 3D priors using Multi-Modal Masked Autoencoders for
  2D image and video understanding
M3^{3}33D: Learning 3D priors using Multi-Modal Masked Autoencoders for 2D image and video understandingIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Muhammad Abdullah Jamal
Omid Mohareri
3DPC
208
2
0
26 Sep 2023
Masked Image Residual Learning for Scaling Deeper Vision Transformers
Masked Image Residual Learning for Scaling Deeper Vision TransformersNeural Information Processing Systems (NeurIPS), 2023
Guoxi Huang
Hongtao Fu
A. Bors
244
8
0
25 Sep 2023
Disentangling Spatial and Temporal Learning for Efficient Image-to-Video
  Transfer Learning
Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer LearningIEEE International Conference on Computer Vision (ICCV), 2023
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Yingya Zhang
Changxin Gao
Deli Zhao
Nong Sang
208
31
0
14 Sep 2023
Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual
  Tokenization
Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual TokenizationInternational Conference on Learning Representations (ICLR), 2023
Yang Jin
Kun Xu
Kun Xu
Liwei Chen
Chao Liao
...
Xiaoqiang Lei
Chen Zhang
Wenwu Ou
Kun Gai
Yadong Mu
MLLMVLM
221
75
0
09 Sep 2023
AMLP: Adjustable Masking Lesion Patches for Self-Supervised Medical Image Segmentation
AMLP: Adjustable Masking Lesion Patches for Self-Supervised Medical Image SegmentationIEEE Transactions on Medical Imaging (TMI), 2023
Xiang-Fei Wang
Ruizhi Wang
Jie Zhou
Thomas Lukasiewicz
162
0
0
08 Sep 2023
DropPos: Pre-Training Vision Transformers by Reconstructing Dropped
  Positions
DropPos: Pre-Training Vision Transformers by Reconstructing Dropped PositionsNeural Information Processing Systems (NeurIPS), 2023
Haochen Wang
Junsong Fan
Yuxi Wang
Kaiyou Song
Tong Wang
Zhaoxiang Zhang
234
23
0
07 Sep 2023
Toward High Quality Facial Representation Learning
Toward High Quality Facial Representation LearningACM Multimedia (ACM MM), 2023
Yue Wang
Jinlong Peng
Jiangning Zhang
Ran Yi
Lu Liu
Yabiao Wang
Chengjie Wang
CVBMSSL
192
12
0
07 Sep 2023
DAT++: Spatially Dynamic Vision Transformer with Deformable Attention
DAT++: Spatially Dynamic Vision Transformer with Deformable Attention
Zhuofan Xia
Xuran Pan
Shiji Song
Li Erran Li
Gao Huang
ViT
229
41
0
04 Sep 2023
COMEDIAN: Self-Supervised Learning and Knowledge Distillation for Action
  Spotting using Transformers
COMEDIAN: Self-Supervised Learning and Knowledge Distillation for Action Spotting using Transformers
J. Denize
Mykola Liashuha
Jaonary Rabarisoa
Astrid Orcesi
Romain Hérault
ViT
283
19
0
03 Sep 2023
RevColV2: Exploring Disentangled Representations in Masked Image
  Modeling
RevColV2: Exploring Disentangled Representations in Masked Image ModelingNeural Information Processing Systems (NeurIPS), 2023
Qi Han
Yuxuan Cai
Xiangyu Zhang
291
13
0
02 Sep 2023
Self-Supervised Video Transformers for Isolated Sign Language
  Recognition
Self-Supervised Video Transformers for Isolated Sign Language Recognition
Marcelo Sandoval-Castaneda
Yanhong Li
D. Brentari
Karen Livescu
Gregory Shakhnarovich
SLR
244
9
0
02 Sep 2023
Contrastive Feature Masking Open-Vocabulary Vision Transformer
Contrastive Feature Masking Open-Vocabulary Vision TransformerIEEE International Conference on Computer Vision (ICCV), 2023
Dahun Kim
A. Angelova
Weicheng Kuo
ObjDVLM
298
37
0
02 Sep 2023
CL-MAE: Curriculum-Learned Masked Autoencoders
CL-MAE: Curriculum-Learned Masked AutoencodersIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Neelu Madan
Nicolae-Cătălin Ristea
Kamal Nasrollahi
T. Moeslund
Radu Tudor Ionescu
404
25
0
31 Aug 2023
Motion-Guided Masking for Spatiotemporal Representation Learning
Motion-Guided Masking for Spatiotemporal Representation LearningIEEE International Conference on Computer Vision (ICCV), 2023
D. Fan
Jue Wang
Shuai Liao
Yi Zhu
Vimal Bhat
H. Santos-Villalobos
M. Rohith
Xinyu Li
VGen
187
27
0
24 Aug 2023
DLIP: Distilling Language-Image Pre-training
DLIP: Distilling Language-Image Pre-training
Huafeng Kuang
Jie Wu
Xiawu Zheng
Ming Li
Xuefeng Xiao
Rui Wang
Min Zheng
Rongrong Ji
VLM
128
6
0
24 Aug 2023
Masked Feature Modelling: Feature Masking for the Unsupervised
  Pre-training of a Graph Attention Network Block for Bottom-up Video Event
  Recognition
Masked Feature Modelling: Feature Masking for the Unsupervised Pre-training of a Graph Attention Network Block for Bottom-up Video Event Recognition
Dimitrios Daskalakis
Nikolaos Gkalelis
Vasileios Mezaris
173
0
0
24 Aug 2023
MOFO: MOtion FOcused Self-Supervision for Video Understanding
MOFO: MOtion FOcused Self-Supervision for Video Understanding
Mona Ahmadian
Frank Guerin
Andrew Gilbert
275
4
0
23 Aug 2023
EVE: Efficient Vision-Language Pre-training with Masked Prediction and
  Modality-Aware MoE
EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoEAAAI Conference on Artificial Intelligence (AAAI), 2023
Junyi Chen
Longteng Guo
Jianxiang Sun
Shuai Shao
Zehuan Yuan
Liang Lin
Dongyu Zhang
MLLMVLMMoE
171
19
0
23 Aug 2023
Masked Momentum Contrastive Learning for Zero-shot Semantic
  Understanding
Masked Momentum Contrastive Learning for Zero-shot Semantic Understanding
Jiantao Wu
Shentong Mo
Muhammad Awais
Sara Atito
Zhenhua Feng
J. Kittler
VLM
188
4
0
22 Aug 2023
MGMAE: Motion Guided Masking for Video Masked Autoencoding
MGMAE: Motion Guided Masking for Video Masked AutoencodingIEEE International Conference on Computer Vision (ICCV), 2023
Bingkun Huang
Zhiyu Zhao
Guozhen Zhang
Yu Qiao
Limin Wang
143
49
0
21 Aug 2023
Improving Adversarial Robustness of Masked Autoencoders via Test-time
  Frequency-domain Prompting
Improving Adversarial Robustness of Masked Autoencoders via Test-time Frequency-domain PromptingIEEE International Conference on Computer Vision (ICCV), 2023
Qidong Huang
Xiaoyi Dong
DongDong Chen
Yinpeng Chen
Lu Yuan
Gang Hua
Weiming Zhang
Neng H. Yu
AAML
282
11
0
20 Aug 2023
GiGaMAE: Generalizable Graph Masked Autoencoder via Collaborative Latent
  Space Reconstruction
GiGaMAE: Generalizable Graph Masked Autoencoder via Collaborative Latent Space ReconstructionInternational Conference on Information and Knowledge Management (CIKM), 2023
Yucheng Shi
Yushun Dong
Qiaoyu Tan
Jundong Li
Ninghao Liu
323
37
0
18 Aug 2023
Masked Spatio-Temporal Structure Prediction for Self-supervised Learning
  on Point Cloud Videos
Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud VideosIEEE International Conference on Computer Vision (ICCV), 2023
Zhiqiang Shen
Xiaoxiao Sheng
Hehe Fan
Longguang Wang
Yike Guo
Qiong Liu
Hao Wen
Xiaoping Zhou
3DPC
121
20
0
18 Aug 2023
Auxiliary Tasks Benefit 3D Skeleton-based Human Motion Prediction
Auxiliary Tasks Benefit 3D Skeleton-based Human Motion PredictionIEEE International Conference on Computer Vision (ICCV), 2023
Chenxin Xu
R. Tan
Yuhong Tan
Siheng Chen
Xinchao Wang
Yanfeng Wang
3DH
237
31
0
17 Aug 2023
SRMAE: Masked Image Modeling for Scale-Invariant Deep Representations
SRMAE: Masked Image Modeling for Scale-Invariant Deep RepresentationsChinese Conference on Pattern Recognition and Computer Vision (CPRCV), 2023
Zhiming Wang
Lin Gu
Feng Lu
226
1
0
17 Aug 2023
Stable and Causal Inference for Discriminative Self-supervised Deep
  Visual Representations
Stable and Causal Inference for Discriminative Self-supervised Deep Visual RepresentationsIEEE International Conference on Computer Vision (ICCV), 2023
Yuewei Yang
Hai Helen Li
Yiran Chen
CMLOOD
241
1
0
16 Aug 2023
Masked Motion Predictors are Strong 3D Action Representation Learners
Masked Motion Predictors are Strong 3D Action Representation LearnersIEEE International Conference on Computer Vision (ICCV), 2023
Yunyao Mao
Jiajun Deng
Wen-gang Zhou
Yao Fang
Wanli Ouyang
Houqiang Li
3DPC
239
62
0
14 Aug 2023
Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images
  with Free Attention Masks
Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks
David Junhao Zhang
Mutian Xu
Chuhui Xue
Wenqing Zhang
Xiaoguang Han
Song Bai
Mike Zheng Shou
DiffM
229
7
0
13 Aug 2023
Temporally-Adaptive Models for Efficient Video Understanding
Temporally-Adaptive Models for Efficient Video Understanding
Ziyuan Huang
Shiwei Zhang
Liang Pan
Zhiwu Qing
Yingya Zhang
Ziwei Liu
Marcelo H. Ang
185
16
0
10 Aug 2023
Unsupervised Camouflaged Object Segmentation as Domain Adaptation
Unsupervised Camouflaged Object Segmentation as Domain Adaptation
Yi Zhang
Chengyi Wu
196
7
0
08 Aug 2023
Learning Concise and Descriptive Attributes for Visual Recognition
Learning Concise and Descriptive Attributes for Visual RecognitionIEEE International Conference on Computer Vision (ICCV), 2023
Andy Yan
Yu Wang
Yiwu Zhong
Chengyu Dong
Zexue He
Yujie Lu
William Wang
Jingbo Shang
Julian McAuley
VLM
230
84
0
07 Aug 2023
Feature-Suppressed Contrast for Self-Supervised Food Pre-training
Feature-Suppressed Contrast for Self-Supervised Food Pre-trainingACM Multimedia (ACM MM), 2023
Xinda Liu
Yaohui Zhu
Linhu Liu
Jiang Tian
Lili Wang
SSL
189
6
0
07 Aug 2023
Improving Pixel-based MIM by Reducing Wasted Modeling Capability
Improving Pixel-based MIM by Reducing Wasted Modeling CapabilityIEEE International Conference on Computer Vision (ICCV), 2023
Yuan Liu
Songyang Zhang
Jiacheng Chen
Zhaohui Yu
Kai-xiang Chen
Dahua Lin
188
41
0
01 Aug 2023
EEG-based Cognitive Load Classification using Feature Masked
  Autoencoding and Emotion Transfer Learning
EEG-based Cognitive Load Classification using Feature Masked Autoencoding and Emotion Transfer LearningInternational Conference on Multimodal Interaction (ICMI), 2023
Dustin Pulver
Prithila Angkan
Paul Hungler
Ali Etemad
245
15
0
01 Aug 2023
Disruptive Autoencoders: Leveraging Low-level features for 3D Medical
  Image Pre-training
Disruptive Autoencoders: Leveraging Low-level features for 3D Medical Image Pre-trainingInternational Conference on Medical Imaging with Deep Learning (MIDL), 2023
Jeya Maria Jose Valanarasu
Yucheng Tang
Dong Yang
Ziyue Xu
Can Zhao
...
Vishal M. Patel
Bennett Landman
Daguang Xu
Yufan He
V. Nath
MedIm
174
20
0
31 Jul 2023
HandMIM: Pose-Aware Self-Supervised Learning for 3D Hand Mesh Estimation
HandMIM: Pose-Aware Self-Supervised Learning for 3D Hand Mesh Estimation
Zuyan Liu
Gaojie Lin
Congyi Wang
Min Zheng
Feida Zhu
3DH
145
1
0
29 Jul 2023
MOCA: Self-supervised Representation Learning by Predicting Masked
  Online Codebook Assignments
MOCA: Self-supervised Representation Learning by Predicting Masked Online Codebook Assignments
Spyros Gidaris
Andrei Bursuc
Oriane Siméoni
Antonín Vobecký
N. Komodakis
Matthieu Cord
Patrick Pérez
SSLViT
253
9
0
18 Jul 2023
Does Visual Pretraining Help End-to-End Reasoning?
Does Visual Pretraining Help End-to-End Reasoning?Neural Information Processing Systems (NeurIPS), 2023
Chen Sun
Calvin Luo
Xingyi Zhou
Anurag Arnab
Cordelia Schmid
OCLLRMViT
290
4
0
17 Jul 2023
DreamTeacher: Pretraining Image Backbones with Deep Generative Models
DreamTeacher: Pretraining Image Backbones with Deep Generative ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Daiqing Li
Huan Ling
Amlan Kar
David Acuna
Seung Wook Kim
Karsten Kreis
Antonio Torralba
Sanja Fidler
VLMDiffM
237
34
0
14 Jul 2023
Previous
123456...8910
Next