v1v2 (latest)

Masked Feature Prediction for Self-Supervised Visual Pre-Training

16 December 2021

Christoph Feichtenhofer

ViT

ArXiv (abs)PDF HTML

Papers citing "Masked Feature Prediction for Self-Supervised Visual Pre-Training"

50 / 498 papers shown

Audiovisual Masked AutoencodersIEEE International Conference on Computer Vision (ICCV), 2022

Mariana-Iuliana Georgescu

317

09 Dec 2022

Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation LearningComputer Vision and Pattern Recognition (CVPR), 2022

Zuxuan Wu

Lu Yuan

326

120

08 Dec 2022

Group Generalized Mean Pooling for Vision Transformer

295

08 Dec 2022

SimVTP: Simple Video Text Pre-training with Masked Autoencoders

Yue Ma

Tianyu Yang

Yin Shan

Xiu Li

169

07 Dec 2022

Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video LearningComputer Vision and Pattern Recognition (CVPR), 2022

239

06 Dec 2022

InternVideo: General Video Foundation Models via Generative and Discriminative Learning

Yi Wang

...

Yu Qiao

454

446

06 Dec 2022

Location-Aware Self-Supervised Transformers for Semantic SegmentationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

329

05 Dec 2022

Exploring Stochastic Autoregressive Image Modeling for Visual RepresentationAAAI Conference on Artificial Intelligence (AAAI), 2022

Fan Yang

104

03 Dec 2022

MIC: Masked Image Consistency for Context-Enhanced Domain AdaptationComputer Vision and Pattern Recognition (CVPR), 2022

Lukas Hoyer

Dengxin Dai

Haoran Wang

Luc Van Gool

396

323

02 Dec 2022

Multi-scale Transformer Network with Edge-aware Pre-training for Cross-Modality MR Image SynthesisIEEE Transactions on Medical Imaging (IEEE TMI), 2022

343

02 Dec 2022

Masked Contrastive Pre-Training for Efficient Video-Text Retrieval

185

02 Dec 2022

Scaling Language-Image Pre-training via MaskingComputer Vision and Pattern Recognition (CVPR), 2022

Yanghao Li

Haoqi Fan

Ronghang Hu

Christoph Feichtenhofer

Kaiming He

CLIP VLM

375

393

01 Dec 2022

Spatio-Temporal Crop Aggregation for Video Representation LearningIEEE International Conference on Computer Vision (ICCV), 2022

Sepehr Sameni

Simon Jenni

Paolo Favaro

312

30 Nov 2022

Self-Supervised Learning based on Heat Equation

Lu Yuan

Zicheng Liu

Youzuo Lin

146

23 Nov 2022

Fast-iTPN: Integrally Pre-Trained Transformer Pyramid Network with Token MigrationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

Yaowei Wang

196

23 Nov 2022

LoopDA: Constructing Self-loops to Adapt Nighttime Semantic SegmentationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

144

21 Nov 2022

SMAUG: Sparse Masked Autoencoder for Efficient Video-Language Pre-trainingIEEE International Conference on Computer Vision (ICCV), 2022

Cihang Xie

305

21 Nov 2022

CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical FlowIEEE International Conference on Computer Vision (ICCV), 2022

436

156

18 Nov 2022

CAE v2: Context Autoencoder with CLIP Target

...

Errui Ding

Jingdong Wang

VLM CLIP

276

17 Nov 2022

UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer

Yi Wang

Yu Qiao

224

155

17 Nov 2022

AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked AutoencodersComputer Vision and Pattern Recognition (CVPR), 2022

239

16 Nov 2022

MAGE: MAsked Generative Encoder to Unify Representation Learning and Image SynthesisComputer Vision and Pattern Recognition (CVPR), 2022

326

229

16 Nov 2022

Stare at What You See: Masked Image Modeling without ReconstructionComputer Vision and Pattern Recognition (CVPR), 2022

Yu Qiao

183

16 Nov 2022

Exploring State Change Capture of Heterogeneous Backbones @ Ego4D Hands and Objects Challenge 2022

178

16 Nov 2022

Masked Reconstruction Contrastive Learning with Information Bottleneck Principle

Congying Han

Xuecheng Nie

149

15 Nov 2022

Self-supervised remote sensing feature learning: Learning Paradigms, Challenges, and Future WorksIEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), 2022

240

15 Nov 2022

EVA: Exploring the Limits of Masked Visual Representation Learning at ScaleComputer Vision and Pattern Recognition (CVPR), 2022

604

898

14 Nov 2022

Seeing Beyond the Brain: Conditional Diffusion Model with Sparse Masked Modeling for Vision DecodingComputer Vision and Pattern Recognition (CVPR), 2022

336

200

13 Nov 2022

Demystify Self-Attention in Vision Transformers from a Semantic Perspective: Analysis and Application

150

13 Nov 2022

MARLIN: Masked Autoencoder for facial video Representation LearnINgComputer Vision and Pattern Recognition (CVPR), 2022

Zhixi Cai

Shreya Ghosh

Kalin Stefanov

Abhinav Dhall

Jianfei Cai

244

12 Nov 2022

Attention-based Neural Cellular AutomataNeural Information Processing Systems (NeurIPS), 2022

223

02 Nov 2022

RGMIM: Region-Guided Masked Image Modeling for Learning Meaningful Representation from X-Ray Images

Guang Li

Ren Togo

Takahiro Ogawa

Miki Haseyama

209

01 Nov 2022

Changes from Classical Statistics to Modern Statistics and Data Science

Kai Zhang

Shan-Yu Liu

M. Xiong

302

30 Oct 2022

Adversarial Pretraining of Self-Supervised Deep Networks: Past, Present and Future

Guo-Jun Qi

M. Shah

SSL

150

23 Oct 2022

i-MAE: Are Latent Representations in Masked Autoencoders Linearly Separable?

Kevin Zhang

Zhiqiang Shen

112

20 Oct 2022

Towards Sustainable Self-supervised Learning

328

20 Oct 2022

CroCo: Self-Supervised Pre-training for 3D Vision Tasks by Cross-View CompletionNeural Information Processing Systems (NeurIPS), 2022

371

123

19 Oct 2022

A Unified View of Masked Image Modeling

234

19 Oct 2022

Token Merging: Your ViT But FasterInternational Conference on Learning Representations (ICLR), 2022

Christoph Feichtenhofer

Judy Hoffman

MoMe

414

716

17 Oct 2022

The Hidden Uniform Cluster Prior in Self-Supervised LearningInternational Conference on Learning Representations (ICLR), 2022

Pascal Vincent

208

13 Oct 2022

Exploring Long-Sequence Masked Autoencoders

181

13 Oct 2022

Masked Motion Encoding for Self-Supervised Video Representation LearningComputer Vision and Pattern Recognition (CVPR), 2022

Chuang Gan

285

12 Oct 2022

ZITS++: Image Inpainting by Improving the Incremental Transformer on Structural PriorsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

Chenjie Cao

Qiaole Dong

Yanwei Fu

335

12 Oct 2022

It Takes Two: Masked Appearance-Motion Modeling for Self-supervised Video Transformer Pre-training

Jingdong Wang

258

11 Oct 2022

MAMO: Masked Multimodal Modeling for Fine-Grained Vision-Language Representation LearningAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2022

Jing Liu

300

09 Oct 2022

Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders

137

09 Oct 2022

Image Masking for Robust Self-Supervised Monocular Depth EstimationIEEE International Conference on Robotics and Automation (ICRA), 2022

221

05 Oct 2022

Backdoor Attacks in the Supply Chain of Masked Image Modeling

Zheng Li

Michael Backes

179

04 Oct 2022

Contrastive Audio-Visual Masked AutoencoderInternational Conference on Learning Representations (ICLR), 2022

395

166

02 Oct 2022

Federated Training of Dual Encoding Models on Small Non-IID Client Datasets

289

30 Sep 2022