v1v2 (latest)

Masked Feature Prediction for Self-Supervised Visual Pre-Training

16 December 2021

Christoph Feichtenhofer

ViT

ArXiv (abs)PDF HTML

Papers citing "Masked Feature Prediction for Self-Supervised Visual Pre-Training"

50 / 498 papers shown

Rethinking the Learning Paradigm for Facial Expression Recognition

Weijie Wang

Andrii Zadaianchuk

Bruno Lepri

239

30 Sep 2022

Improving Molecular Pretraining with Complementary Featurizations

Dingshuo Chen

Qiang Liu

165

29 Sep 2022

Spiking Neural Networks for event-based action recognition: A new task to understand their advantageNeurocomputing (Neurocomputing), 2022

210

29 Sep 2022

Information-Theoretic Hashing for Zero-Shot Cross-Modal Retrieval

126

26 Sep 2022

Self-Supervised Masked Convolutional Transformer Block for Anomaly DetectionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

Neelu Madan

Nicolae-Cătălin Ristea

548

100

25 Sep 2022

Exploring Modulated Detection Transformer as a Tool for Action Recognition in Videos

132

21 Sep 2022

Attentive Symmetric Autoencoder for Brain MRI SegmentationInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2022

Xiang Wan

132

19 Sep 2022

S$^3$R: Self-supervised Spectral Regression for Hyperspectral
Histopathology Image Classification

^3

R: Self-supervised Spectral Regression for Hyperspectral Histopathology Image ClassificationInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2022

Xingran Xie

Yan Wang

Qingli Li

161

19 Sep 2022

Vision Transformers for Action Recognition: A Survey

225

13 Sep 2022

Exploring Target Representations for Masked AutoencodersInternational Conference on Learning Representations (ICLR), 2022

666

08 Sep 2022

MimCo: Masked Image Modeling Pre-training with Contrastive TeacherACM Multimedia (ACM MM), 2022

Hao Li

301

07 Sep 2022

An Empirical Study of End-to-End Video-Language Transformers with Masked Visual ModelingComputer Vision and Pattern Recognition (CVPR), 2022

Zicheng Liu

624

04 Sep 2022

MORI-RAN: Multi-view Robust Representation Learning via Hybrid Contrastive Fusion

Guanzhou Ke

Yong-Nan Zhu

Yang Yu

117

26 Aug 2022

MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image PretrainingComputer Vision and Pattern Recognition (CVPR), 2022

Jianmin Bao

...

Lu Yuan

281

221

25 Aug 2022

Masked Autoencoders Enable Efficient Knowledge DistillersComputer Vision and Pattern Recognition (CVPR), 2022

Cihang Xie

284

25 Aug 2022

VLMAE: Vision-Language Masked Autoencoder

195

19 Aug 2022

BEiT v2: Masked Image Modeling with Vector-Quantized Visual Tokenizers

405

389

12 Aug 2022

MILAN: Masked Image Pretraining on Language Assisted Representation

301

11 Aug 2022

Frozen CLIP Models are Efficient Video LearnersEuropean Conference on Computer Vision (ECCV), 2022

Yu Qiao

256

253

06 Aug 2022

Learning Prior Feature and Attention Enhanced Image InpaintingEuropean Conference on Computer Vision (ECCV), 2022

163

03 Aug 2022

SdAE: Self-distillated Masked AutoencoderEuropean Conference on Computer Vision (ECCV), 2022

210

31 Jul 2022

A Survey on Masked Autoencoder for Self-supervised Learning in Vision and Beyond

Kang Zhang

In So Kweon

SSL

225

30 Jul 2022

Contrastive Masked Autoencoders are Stronger Vision LearnersIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

503

206

27 Jul 2022

P2ANet: A Dataset and Benchmark for Dense Action Detection from Table Tennis Match Broadcasting Videos

Jiang Bian

Haoyi Xiong

190

26 Jul 2022

MAR: Masked Autoencoders for Efficient Action RecognitionIEEE transactions on multimedia (IEEE TMM), 2022

236

24 Jul 2022

MABe22: A Multi-Species Multi-Task Benchmark for Learned Representations of BehaviorInternational Conference on Machine Learning (ICML), 2022

...

225

21 Jul 2022

Self-Supervised-RCNN for Medical Image Segmentation with Limited Data Annotation

186

17 Jul 2022

Bootstrapped Masked Autoencoders for Vision BERT PretrainingEuropean Conference on Computer Vision (ECCV), 2022

Jianmin Bao

Lu Yuan

207

14 Jul 2022

Masked Autoencoders that ListenNeural Information Processing Systems (NeurIPS), 2022

Po-Yao (Bernie) Huang

Christoph Feichtenhofer

534

385

13 Jul 2022

387

08 Jul 2022

Consecutive Pretraining: A Knowledge Transfer Learning Strategy with Relevant Unlabeled Data for Remote Sensing DomainRemote Sensing (RS), 2022

Tong Zhang

171

08 Jul 2022

Masked Surfel Prediction for Self-Supervised Point Cloud Learning

Lei Zhang

197

07 Jul 2022

Dissecting Self-Supervised Learning Methods for Surgical Computer Vision

...

Alexandros Karargyris

N. Padoy

514

01 Jul 2022

Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text RecognitionACM Multimedia (ACM MM), 2022

349

01 Jul 2022

Masked World Models for Visual ControlConference on Robot Learning (CoRL), 2022

Pieter Abbeel

402

181

28 Jun 2022

ST-Adapter: Parameter-Efficient Image-to-Video Transfer LearningNeural Information Processing Systems (NeurIPS), 2022

382

264

27 Jun 2022

Bi-Calibration Networks for Weakly-Supervised Video Representation LearningInternational Journal of Computer Vision (IJCV), 2022

Tao Mei

252

21 Jun 2022

SemMAE: Semantic-Guided Masking for Learning Masked AutoencodersNeural Information Processing Systems (NeurIPS), 2022

464

152

21 Jun 2022

EATFormer: Improving Vision Transformer Inspired by Evolutionary AlgorithmInternational Journal of Computer Vision (IJCV), 2022

Jiangning Zhang

Xiangtai Li

Yabiao Wang

Chengjie Wang

304

19 Jun 2022

Self-Supervised Learning for Videos: A SurveyACM Computing Surveys (ACM CSUR), 2022

Madeline Chantry Schiappa

Yogesh S Rawat

M. Shah

SSL

474

166

18 Jun 2022

OmniMAE: Single Model Masked Pretraining on Images and VideosComputer Vision and Pattern Recognition (CVPR), 2022

Rohit Girdhar

Alaaeldin El-Nouby

Mannat Singh

Kalyan Vasudev Alwala

Armand Joulin

Ishan Misra

ViT

265

118

16 Jun 2022

iBoot: Image-bootstrapped Self-Supervised Video Representation Learning

F. Saleh

Fuwen Tan

Adrian Bulat

Georgios Tzimiropoulos

Brais Martínez

SSL

275

16 Jun 2022

Adapting Self-Supervised Vision Transformers by Probing Attention-Conditioned Masking ConsistencyNeural Information Processing Systems (NeurIPS), 2022

140

16 Jun 2022

Masked Frequency Modeling for Self-Supervised Visual Pre-TrainingInternational Conference on Learning Representations (ICLR), 2022

Xiaohang Zhan

232

15 Jun 2022

LAVENDER: Unifying Video-Language Understanding as Masked Language ModelingComputer Vision and Pattern Recognition (CVPR), 2022

Zicheng Liu

191

14 Jun 2022

SERE: Exploring Feature Self-relation for Self-supervised TransformerIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

Zhong-Yu Li

Shanghua Gao

Ming-Ming Cheng

ViT MDE

253

10 Jun 2022

Extreme Masking for Learning Instance and Distributed Visual Representations

291

09 Jun 2022

Spatial Entropy as an Inductive Bias for Vision TransformersMachine-mediated learning (ML), 2022

Wei Bi

277

09 Jun 2022

Towards Understanding Why Mask-Reconstruction Pretraining Helps in Downstream TasksInternational Conference on Learning Representations (ICLR), 2022

345

08 Jun 2022

Siamese Image Modeling for Self-Supervised Vision Representation LearningComputer Vision and Pattern Recognition (CVPR), 2022

Gao Huang

Yu Qiao

286

107

02 Jun 2022