v1v2 (latest)

Masked Feature Prediction for Self-Supervised Visual Pre-Training

16 December 2021

Christoph Feichtenhofer

ViT

ArXiv (abs)PDF HTML

Papers citing "Masked Feature Prediction for Self-Supervised Visual Pre-Training"

50 / 498 papers shown

DreamTeacher: Pretraining Image Backbones with Deep Generative ModelsIEEE International Conference on Computer Vision (ICCV), 2023

Antonio Torralba

Sanja Fidler

VLM DiffM

263

14 Jul 2023

HA-ViD: A Human Assembly Video Dataset for Comprehensive Assembly Knowledge UnderstandingNeural Information Processing Systems (NeurIPS), 2023

105

09 Jul 2023

AxonCallosumEM Dataset: Axon Semantic Segmentation of Whole Corpus Callosum cross section from EM Images

171

05 Jul 2023

EgoCOL: Egocentric Camera pose estimation for Open-world 3D object Localization @Ego4D challenge 2023

Cristhian Forigua

María Escobar

Jordi Pont-Tuset

Kevis-Kokitsi Maninis

Pablo Arbelaez

EgoV

238

29 Jun 2023

Learning with Difference Attention for Visually Grounded Self-supervised Representations

Aishwarya Agarwal

Srikrishna Karanam

Balaji Vasan Srinivasan

179

26 Jun 2023

Task-Robust Pre-Training for Worst-Case Downstream AdaptationNeural Information Processing Systems (NeurIPS), 2023

Cong Fang

242

21 Jun 2023

MOFI: Learning Image Representations from Noisy Entity Annotated ImagesInternational Conference on Learning Representations (ICLR), 2023

Chen Chen

...

Xianzhi Du

241

13 Jun 2023

Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-TrainingComputer Vision and Image Understanding (CVIU), 2023

Lorenzo Baraldi

Lorenzo Baraldi

379

12 Jun 2023

Global and Local Semantic Completion Learning for Vision-Language Pre-trainingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

Wenzhe Zhao

Hongfa Wang

Yujiu Yang

Wei Liu

VLM

252

12 Jun 2023

Exploring Effective Mask Sampling Modeling for Neural Image Compression

226

09 Jun 2023

ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion ProcessInternational Conference on Learning Representations (ICLR), 2023

Gao Huang

192

08 Jun 2023

R-MAE: Regions Meet Masked AutoencodersInternational Conference on Learning Representations (ICLR), 2023

293

08 Jun 2023

Asymmetric Patch Sampling for Contrastive LearningPattern Recognition (Pattern Recogn.), 2023

243

05 Jun 2023

rPPG-MAE: Self-supervised Pre-training with Masked Autoencoders for Remote Physiological MeasurementIEEE transactions on multimedia (IEEE TMM), 2023

218

04 Jun 2023

Recent Advances of Local Mechanisms in Computer Vision: A Survey and Outlook of Recent Work

Qiangchang Wang

Yilong Yin

300

02 Jun 2023

Masked Autoencoder for Unsupervised Video Summarization

169

02 Jun 2023

Hiera: A Hierarchical Vision Transformer without the Bells-and-WhistlesInternational Conference on Machine Learning (ICML), 2023

...

Christoph Feichtenhofer

3DH

305

301

01 Jun 2023

Masked Autoencoders with Multi-Window Local-Global Attention Are Better Audio LearnersInternational Conference on Learning Representations (ICLR), 2023

Sarthak Yadav

Sergios Theodoridis

Lars Kai Hansen

Zheng-Hua Tan

251

01 Jun 2023

A Novel Driver Distraction Behavior Detection Method Based on Self-supervised Learning with Masked Image ModelingIEEE Internet of Things Journal (IEEE IoT J.), 2023

381

01 Jun 2023

MiniSUPERB: Lightweight Benchmark for Self-supervised Speech ModelsAutomatic Speech Recognition & Understanding (ASRU), 2023

450

30 May 2023

Image as First-Order Norm+Linear Autoregression: Unveiling Mathematical Invariance

Lu Yuan

Zicheng Liu

Youzuo Lin

227

25 May 2023

RoMa: Robust Dense Feature MatchingComputer Vision and Pattern Recognition (CVPR), 2023

Georg Bökman

350

228

24 May 2023

Know Your Self-supervised Learning: A Survey on Image-based Generative and Discriminative Training

259

23 May 2023

SurgMAE: Masked Autoencoders for Long Surgical Video Analysis

Muhammad Abdullah Jamal

Omid Mohareri

180

19 May 2023

ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

Peng Wang

Shijie Wang

Junyang Lin

Shuai Bai

Xiaohuan Zhou

Jingren Zhou

Xinggang Wang

Chang Zhou

VLM MLLM ObjD

582

154

18 May 2023

A Survey on Time-Series Pre-Trained ModelsIEEE Transactions on Knowledge and Data Engineering (TKDE), 2023

282

18 May 2023

GeoMAE: Masked Geometric Target Prediction for Self-supervised Point Cloud Pre-TrainingComputer Vision and Pattern Recognition (CVPR), 2023

Xiaoyu Tian

Haoxi Ran

Yue Wang

Hang Zhao

3DPC ViT

133

15 May 2023

Medical supervised masked autoencoders: Crafting a better masking strategy and efficient fine-tuning schedule for medical image classification

184

10 May 2023

Self-supervised Pre-training with Masked Shape Prediction for 3D Scene UnderstandingComputer Vision and Pattern Recognition (CVPR), 2023

Bernt Schiele

250

08 May 2023

Annotation-efficient learning for OCT segmentationBiomedical Optics Express (BOE), 2023

542

06 May 2023

What Do Self-Supervised Vision Transformers Learn?International Conference on Learning Representations (ICLR), 2023

300

103

01 May 2023

Improve Video Representation with Temporal Adversarial AugmentationInternational Joint Conference on Artificial Intelligence (IJCAI), 2023

238

28 Apr 2023

Self-Supervised Multi-Object Tracking For Autonomous Driving From Consistency Across TimescalesIEEE Robotics and Automation Letters (RA-L), 2023

Christopher Lang

Alexander Braun

Lars Schillingmann

Abhinav Valada

181

25 Apr 2023

Img2Vec: A Teacher of High Token-Diversity Helps Masked AutoEncoders

Hongfa Wang

142

25 Apr 2023

Self-supervised Learning by View Synthesis

Xiangyu Zhang

165

22 Apr 2023

FreMIM: Fourier Transform Meets Masked Image Modeling for Medical Image SegmentationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

453

21 Apr 2023

Transformer-Based Visual Segmentation: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

Xiangtai Li

370

247

19 Apr 2023

CMID: A Unified Self-Supervised Learning Framework for Remote Sensing Image UnderstandingIEEE Transactions on Geoscience and Remote Sensing (TGRS), 2023

296

19 Apr 2023

Efficient Video Action Detection with Token Dropout and Context RefinementIEEE International Conference on Computer Vision (ICCV), 2023

Lei Chen

Zhan Tong

Yibing Song

Gangshan Wu

Limin Wang

305

17 Apr 2023

3D Feature Prediction for Masked-AutoEncoder-Based Point Cloud PretrainingInternational Conference on Learning Representations (ICLR), 2023

Siming Yan

Yu-Qi Yang

Yu-Xiao Guo

Hao Pan

Peng-shuai Wang

Xin Tong

Yang Liu

Qi-Xing Huang

3DPC

255

14 Apr 2023

Hard Patches Mining for Masked Image ModelingComputer Vision and Pattern Recognition (CVPR), 2023

209

12 Apr 2023

GraphMAE2: A Decoding-Enhanced Masked Self-Supervised Graph LearnerThe Web Conference (WWW), 2023

Zhenyu Hou

Yufei He

Yukuo Cen

Xiao Liu

Yuxiao Dong

Evgeny Kharlamov

Jie Tang

SSL

176

153

10 Apr 2023

Diffusion Models as Masked AutoencodersIEEE International Conference on Computer Vision (ICCV), 2023

Chen Wei

K. Mangalam

Po-Yao (Bernie) Huang

Cihang Xie

Christoph Feichtenhofer

DiffM SyDa

196

06 Apr 2023

On the Benefits of 3D Pose and Tracking for Human Action RecognitionComputer Vision and Pattern Recognition (CVPR), 2023

Jathushan Rajasegaran

Georgios Pavlakos

Angjoo Kanazawa

Christoph Feichtenhofer

Jitendra Malik

400

03 Apr 2023

PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose EstimationComputer Vision and Pattern Recognition (CVPR), 2023

176

164

30 Mar 2023

Complementary Random Masking for RGB-Thermal Semantic SegmentationIEEE International Conference on Robotics and Automation (ICRA), 2023

Ukcheol Shin

Kyunghyun Lee

In So Kweon

Jean Oh

144

30 Mar 2023

ISSTAD: Incremental Self-Supervised Learning Based on Transformer for Anomaly Detection and LocalizationEngineering applications of artificial intelligence (Eng. Appl. Artif. Intell.), 2023

336

30 Mar 2023

Mixed Autoencoder for Self-supervised Visual Representation LearningComputer Vision and Pattern Recognition (CVPR), 2023

Kai Chen

Zhili Liu

Lanqing Hong

Hang Xu

Zhenguo Li

Dit-Yan Yeung

SSL

357

30 Mar 2023

VideoMAE V2: Scaling Video Masked Autoencoders with Dual MaskingComputer Vision and Pattern Recognition (CVPR), 2023

Yi Wang

Yu Qiao

379

533

29 Mar 2023

Unmasked Teacher: Towards Training-Efficient Video Foundation ModelsIEEE International Conference on Computer Vision (ICCV), 2023

Yi Wang

Yu Qiao

528

237

28 Mar 2023