ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.09133
  4. Cited By
Masked Feature Prediction for Self-Supervised Visual Pre-Training
v1v2 (latest)

Masked Feature Prediction for Self-Supervised Visual Pre-Training

16 December 2021
Chen Wei
Haoqi Fan
Saining Xie
Chaoxia Wu
Alan Yuille
Christoph Feichtenhofer
    ViT
ArXiv (abs)PDFHTML

Papers citing "Masked Feature Prediction for Self-Supervised Visual Pre-Training"

50 / 498 papers shown
DreamTeacher: Pretraining Image Backbones with Deep Generative Models
DreamTeacher: Pretraining Image Backbones with Deep Generative ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Daiqing Li
Huan Ling
Amlan Kar
David Acuna
Seung Wook Kim
Karsten Kreis
Antonio Torralba
Sanja Fidler
VLMDiffM
263
34
0
14 Jul 2023
HA-ViD: A Human Assembly Video Dataset for Comprehensive Assembly
  Knowledge Understanding
HA-ViD: A Human Assembly Video Dataset for Comprehensive Assembly Knowledge UnderstandingNeural Information Processing Systems (NeurIPS), 2023
Hao Zheng
R. Lee
Yuqian Lu
VGen
105
23
0
09 Jul 2023
AxonCallosumEM Dataset: Axon Semantic Segmentation of Whole Corpus
  Callosum cross section from EM Images
AxonCallosumEM Dataset: Axon Semantic Segmentation of Whole Corpus Callosum cross section from EM Images
Ao Cheng
Guoqiang Zhao
Lirong Wang
Ruobing Zhang
171
5
0
05 Jul 2023
EgoCOL: Egocentric Camera pose estimation for Open-world 3D object
  Localization @Ego4D challenge 2023
EgoCOL: Egocentric Camera pose estimation for Open-world 3D object Localization @Ego4D challenge 2023
Cristhian Forigua
María Escobar
Jordi Pont-Tuset
Kevis-Kokitsi Maninis
Pablo Arbelaez
EgoV
238
2
0
29 Jun 2023
Learning with Difference Attention for Visually Grounded Self-supervised
  Representations
Learning with Difference Attention for Visually Grounded Self-supervised Representations
Aishwarya Agarwal
Srikrishna Karanam
Balaji Vasan Srinivasan
179
1
0
26 Jun 2023
Task-Robust Pre-Training for Worst-Case Downstream Adaptation
Task-Robust Pre-Training for Worst-Case Downstream AdaptationNeural Information Processing Systems (NeurIPS), 2023
Jianghui Wang
Cheng Yang
Xingyu Xie
Cong Fang
Zhouchen Lin
OOD
242
1
0
21 Jun 2023
MOFI: Learning Image Representations from Noisy Entity Annotated Images
MOFI: Learning Image Representations from Noisy Entity Annotated ImagesInternational Conference on Learning Representations (ICLR), 2023
Wentao Wu
Aleksei Timofeev
Chen Chen
Bowen Zhang
Kun Duan
...
Yantao Zheng
Jonathon Shlens
Xianzhi Du
Zhe Gan
Yinfei Yang
VLM
241
9
0
13 Jun 2023
Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training
Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-TrainingComputer Vision and Image Understanding (CVIU), 2023
Lorenzo Baraldi
Roberto Amoroso
Marcella Cornia
Lorenzo Baraldi
Andrea Pilzer
Rita Cucchiara
379
3
0
12 Jun 2023
Global and Local Semantic Completion Learning for Vision-Language
  Pre-training
Global and Local Semantic Completion Learning for Vision-Language Pre-trainingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Rong-Cheng Tu
Yatai Ji
Jie Jiang
Weijie Kong
Chengfei Cai
Wenzhe Zhao
Hongfa Wang
Yujiu Yang
Wei Liu
VLM
252
8
0
12 Jun 2023
Exploring Effective Mask Sampling Modeling for Neural Image Compression
Exploring Effective Mask Sampling Modeling for Neural Image Compression
Lin Liu
Mingming Zhao
Shanxin Yuan
Wenlong Lyu
Wen-gang Zhou
Houqiang Li
Yanfeng Wang
Qi Tian
226
4
0
09 Jun 2023
ADDP: Learning General Representations for Image Recognition and
  Generation with Alternating Denoising Diffusion Process
ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion ProcessInternational Conference on Learning Representations (ICLR), 2023
Changyao Tian
Chenxin Tao
Jifeng Dai
Hao Li
Ziheng Li
Lewei Lu
Xiaogang Wang
Jiaming Song
Gao Huang
Xizhou Zhu
DiffM
192
16
0
08 Jun 2023
R-MAE: Regions Meet Masked Autoencoders
R-MAE: Regions Meet Masked AutoencodersInternational Conference on Learning Representations (ICLR), 2023
Duy-Kien Nguyen
Vaibhav Aggarwal
Yanghao Li
Martin R. Oswald
Alexander Kirillov
Cees G. M. Snoek
Xinlei Chen
TPM
293
16
0
08 Jun 2023
Asymmetric Patch Sampling for Contrastive Learning
Asymmetric Patch Sampling for Contrastive LearningPattern Recognition (Pattern Recogn.), 2023
Chen Shen
Jianzhong Chen
Shu Wang
Hulin Kuang
Jin Liu
Jianxin Wang
SSL
243
7
0
05 Jun 2023
rPPG-MAE: Self-supervised Pre-training with Masked Autoencoders for
  Remote Physiological Measurement
rPPG-MAE: Self-supervised Pre-training with Masked Autoencoders for Remote Physiological MeasurementIEEE transactions on multimedia (IEEE TMM), 2023
Xin Liu
Yuting Zhang
Zitong Yu
Hao Lu
Huanjing Yue
Jingyu Yang
218
53
0
04 Jun 2023
Recent Advances of Local Mechanisms in Computer Vision: A Survey and
  Outlook of Recent Work
Recent Advances of Local Mechanisms in Computer Vision: A Survey and Outlook of Recent Work
Qiangchang Wang
Yilong Yin
300
1
0
02 Jun 2023
Masked Autoencoder for Unsupervised Video Summarization
Masked Autoencoder for Unsupervised Video Summarization
Minho Shim
Taeoh Kim
Jinhyung Kim
Dongyoon Wee
169
3
0
02 Jun 2023
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Hiera: A Hierarchical Vision Transformer without the Bells-and-WhistlesInternational Conference on Machine Learning (ICML), 2023
Chaitanya K. Ryali
Yuan-Ting Hu
Daniel Bolya
Chen Wei
Haoqi Fan
...
Omid Poursaeed
Judy Hoffman
Jitendra Malik
Yanghao Li
Christoph Feichtenhofer
3DH
305
301
0
01 Jun 2023
Masked Autoencoders with Multi-Window Local-Global Attention Are Better
  Audio Learners
Masked Autoencoders with Multi-Window Local-Global Attention Are Better Audio LearnersInternational Conference on Learning Representations (ICLR), 2023
Sarthak Yadav
Sergios Theodoridis
Lars Kai Hansen
Zheng-Hua Tan
251
14
0
01 Jun 2023
A Novel Driver Distraction Behavior Detection Method Based on
  Self-supervised Learning with Masked Image Modeling
A Novel Driver Distraction Behavior Detection Method Based on Self-supervised Learning with Masked Image ModelingIEEE Internet of Things Journal (IEEE IoT J.), 2023
Yingzhi Zhang
Taiguo Li
Chong Li
Xinghong Zhou
381
15
0
01 Jun 2023
MiniSUPERB: Lightweight Benchmark for Self-supervised Speech Models
MiniSUPERB: Lightweight Benchmark for Self-supervised Speech ModelsAutomatic Speech Recognition & Understanding (ASRU), 2023
Yu-Hsiang Wang
Huan Chen
Kai-Wei Chang
Winston H. Hsu
Hung-yi Lee
450
7
0
30 May 2023
Image as First-Order Norm+Linear Autoregression: Unveiling Mathematical
  Invariance
Image as First-Order Norm+Linear Autoregression: Unveiling Mathematical Invariance
Yinpeng Chen
Xiyang Dai
Dongdong Chen
Xiyang Dai
Lu Yuan
Zicheng Liu
Youzuo Lin
227
2
0
25 May 2023
RoMa: Robust Dense Feature Matching
RoMa: Robust Dense Feature MatchingComputer Vision and Pattern Recognition (CVPR), 2023
Johan Edstedt
Qiyu Sun
Georg Bökman
Maarten Wadenback
Michael Felsberg
3DV
350
228
0
24 May 2023
Know Your Self-supervised Learning: A Survey on Image-based Generative
  and Discriminative Training
Know Your Self-supervised Learning: A Survey on Image-based Generative and Discriminative Training
Utku Ozbulak
Hyun Jung Lee
Beril Boga
Esla Timothy Anzaku
Ho-min Park
Arnout Van Messem
W. D. Neve
J. Vankerschaver
DiffM
259
50
0
23 May 2023
SurgMAE: Masked Autoencoders for Long Surgical Video Analysis
SurgMAE: Masked Autoencoders for Long Surgical Video Analysis
Muhammad Abdullah Jamal
Omid Mohareri
180
8
0
19 May 2023
ONE-PEACE: Exploring One General Representation Model Toward Unlimited
  Modalities
ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Peng Wang
Shijie Wang
Junyang Lin
Shuai Bai
Xiaohuan Zhou
Jingren Zhou
Xinggang Wang
Chang Zhou
VLMMLLMObjD
582
154
0
18 May 2023
A Survey on Time-Series Pre-Trained Models
A Survey on Time-Series Pre-Trained ModelsIEEE Transactions on Knowledge and Data Engineering (TKDE), 2023
Qianli Ma
Ziqiang Liu
Zhenjing Zheng
Ziyang Huang
Siying Zhu
Zhongzhong Yu
James T. Kwok
AI4TS
282
88
0
18 May 2023
GeoMAE: Masked Geometric Target Prediction for Self-supervised Point
  Cloud Pre-Training
GeoMAE: Masked Geometric Target Prediction for Self-supervised Point Cloud Pre-TrainingComputer Vision and Pattern Recognition (CVPR), 2023
Xiaoyu Tian
Haoxi Ran
Yue Wang
Hang Zhao
3DPCViT
133
49
0
15 May 2023
Medical supervised masked autoencoders: Crafting a better masking
  strategy and efficient fine-tuning schedule for medical image classification
Medical supervised masked autoencoders: Crafting a better masking strategy and efficient fine-tuning schedule for medical image classification
Jia-ju Mao
Shu-Hua Guo
Yuan Chang
Xuesong Yin
Binling Nie
184
3
0
10 May 2023
Self-supervised Pre-training with Masked Shape Prediction for 3D Scene
  Understanding
Self-supervised Pre-training with Masked Shape Prediction for 3D Scene UnderstandingComputer Vision and Pattern Recognition (CVPR), 2023
Li Jiang
Zetong Yang
Shaoshuai Shi
Vladislav Golyanik
Dengxin Dai
Bernt Schiele
3DPC
250
13
0
08 May 2023
Annotation-efficient learning for OCT segmentation
Annotation-efficient learning for OCT segmentationBiomedical Optics Express (BOE), 2023
Haoran Zhang
Jianlong Yang
Ce Zheng
Shiqing Zhao
Aili Zhang
MedIm
542
9
0
06 May 2023
What Do Self-Supervised Vision Transformers Learn?
What Do Self-Supervised Vision Transformers Learn?International Conference on Learning Representations (ICLR), 2023
Namuk Park
Wonjae Kim
Byeongho Heo
Taekyung Kim
Sangdoo Yun
SSL
300
103
1
01 May 2023
Improve Video Representation with Temporal Adversarial Augmentation
Improve Video Representation with Temporal Adversarial AugmentationInternational Joint Conference on Artificial Intelligence (IJCAI), 2023
Jinhao Duan
Quanfu Fan
Hao-Ran Cheng
Xiaoshuang Shi
Kaidi Xu
AAMLAI4TSViT
238
3
0
28 Apr 2023
Self-Supervised Multi-Object Tracking For Autonomous Driving From
  Consistency Across Timescales
Self-Supervised Multi-Object Tracking For Autonomous Driving From Consistency Across TimescalesIEEE Robotics and Automation Letters (RA-L), 2023
Christopher Lang
Alexander Braun
Lars Schillingmann
Abhinav Valada
181
9
0
25 Apr 2023
Img2Vec: A Teacher of High Token-Diversity Helps Masked AutoEncoders
Img2Vec: A Teacher of High Token-Diversity Helps Masked AutoEncoders
Heng Pan
Chenyang Liu
Wenxiao Wang
Liejie Yuan
Hongfa Wang
Zhifeng Li
Wen Liu
VLM
142
3
0
25 Apr 2023
Self-supervised Learning by View Synthesis
Self-supervised Learning by View Synthesis
Shaoteng Liu
Xiangyu Zhang
T. Hu
Jiaya Jia
3DVViT
165
1
0
22 Apr 2023
FreMIM: Fourier Transform Meets Masked Image Modeling for Medical Image
  Segmentation
FreMIM: Fourier Transform Meets Masked Image Modeling for Medical Image SegmentationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Wenxuan Wang
Jing Wang
Chen Chen
Jianbo Jiao
Yuanxiu Cai
Shanshan Song
Jiangyun Li
MedIm
453
29
0
21 Apr 2023
Transformer-Based Visual Segmentation: A Survey
Transformer-Based Visual Segmentation: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Xiangtai Li
Henghui Ding
Haobo Yuan
Wenwei Zhang
Jiangmiao Pang
Guangliang Cheng
Kai-xiang Chen
Ziwei Liu
Chen Change Loy
ViTMedIm
370
247
0
19 Apr 2023
CMID: A Unified Self-Supervised Learning Framework for Remote Sensing
  Image Understanding
CMID: A Unified Self-Supervised Learning Framework for Remote Sensing Image UnderstandingIEEE Transactions on Geoscience and Remote Sensing (TGRS), 2023
Dilxat Muhtar
Xue-liang Zhang
Pengfeng Xiao
Zhenshi Li
Feng-Xue Gu
SSL
296
94
0
19 Apr 2023
Efficient Video Action Detection with Token Dropout and Context
  Refinement
Efficient Video Action Detection with Token Dropout and Context RefinementIEEE International Conference on Computer Vision (ICCV), 2023
Lei Chen
Zhan Tong
Yibing Song
Gangshan Wu
Limin Wang
305
25
0
17 Apr 2023
3D Feature Prediction for Masked-AutoEncoder-Based Point Cloud
  Pretraining
3D Feature Prediction for Masked-AutoEncoder-Based Point Cloud PretrainingInternational Conference on Learning Representations (ICLR), 2023
Siming Yan
Yu-Qi Yang
Yu-Xiao Guo
Hao Pan
Peng-shuai Wang
Xin Tong
Yang Liu
Qi-Xing Huang
3DPC
255
21
0
14 Apr 2023
Hard Patches Mining for Masked Image Modeling
Hard Patches Mining for Masked Image ModelingComputer Vision and Pattern Recognition (CVPR), 2023
Haochen Wang
Kaiyou Song
Junsong Fan
Yuxi Wang
Jin Xie
Zhaoxiang Zhang
209
81
0
12 Apr 2023
GraphMAE2: A Decoding-Enhanced Masked Self-Supervised Graph Learner
GraphMAE2: A Decoding-Enhanced Masked Self-Supervised Graph LearnerThe Web Conference (WWW), 2023
Zhenyu Hou
Yufei He
Yukuo Cen
Xiao Liu
Yuxiao Dong
Evgeny Kharlamov
Jie Tang
SSL
176
153
0
10 Apr 2023
Diffusion Models as Masked Autoencoders
Diffusion Models as Masked AutoencodersIEEE International Conference on Computer Vision (ICCV), 2023
Chen Wei
K. Mangalam
Po-Yao (Bernie) Huang
Yanghao Li
Haoqi Fan
Hu Xu
Huiyu Wang
Cihang Xie
Alan Yuille
Christoph Feichtenhofer
DiffMSyDa
196
75
0
06 Apr 2023
On the Benefits of 3D Pose and Tracking for Human Action Recognition
On the Benefits of 3D Pose and Tracking for Human Action RecognitionComputer Vision and Pattern Recognition (CVPR), 2023
Jathushan Rajasegaran
Georgios Pavlakos
Angjoo Kanazawa
Christoph Feichtenhofer
Jitendra Malik
400
45
0
03 Apr 2023
PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D
  Human Pose Estimation
PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose EstimationComputer Vision and Pattern Recognition (CVPR), 2023
Qi-jun Zhao
Ce Zheng
Mengyuan Liu
Pichao Wang
Chong Chen
ViT
176
164
0
30 Mar 2023
Complementary Random Masking for RGB-Thermal Semantic Segmentation
Complementary Random Masking for RGB-Thermal Semantic SegmentationIEEE International Conference on Robotics and Automation (ICRA), 2023
Ukcheol Shin
Kyunghyun Lee
In So Kweon
Jean Oh
144
41
0
30 Mar 2023
ISSTAD: Incremental Self-Supervised Learning Based on Transformer for
  Anomaly Detection and Localization
ISSTAD: Incremental Self-Supervised Learning Based on Transformer for Anomaly Detection and LocalizationEngineering applications of artificial intelligence (Eng. Appl. Artif. Intell.), 2023
WenPing Jin
Fei-Yu Guo
Li Zhu
ViTMedIm
336
5
0
30 Mar 2023
Mixed Autoencoder for Self-supervised Visual Representation Learning
Mixed Autoencoder for Self-supervised Visual Representation LearningComputer Vision and Pattern Recognition (CVPR), 2023
Kai Chen
Zhili Liu
Lanqing Hong
Hang Xu
Zhenguo Li
Dit-Yan Yeung
SSL
357
52
0
30 Mar 2023
VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
VideoMAE V2: Scaling Video Masked Autoencoders with Dual MaskingComputer Vision and Pattern Recognition (CVPR), 2023
Limin Wang
Bingkun Huang
Zhiyu Zhao
Zhan Tong
Yinan He
Yi Wang
Yali Wang
Yu Qiao
VGen
379
533
0
29 Mar 2023
Unmasked Teacher: Towards Training-Efficient Video Foundation Models
Unmasked Teacher: Towards Training-Efficient Video Foundation ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Kunchang Li
Yali Wang
Yizhuo Li
Yi Wang
Yinan He
Limin Wang
Yu Qiao
VGen
528
237
0
28 Mar 2023
Previous
123...1056789
Next