ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.09133
  4. Cited By
Masked Feature Prediction for Self-Supervised Visual Pre-Training
v1v2 (latest)

Masked Feature Prediction for Self-Supervised Visual Pre-Training

16 December 2021
Chen Wei
Haoqi Fan
Saining Xie
Chaoxia Wu
Alan Yuille
Christoph Feichtenhofer
    ViT
ArXiv (abs)PDFHTML

Papers citing "Masked Feature Prediction for Self-Supervised Visual Pre-Training"

50 / 494 papers shown
Title
The Common Stability Mechanism behind most Self-Supervised Learning
  Approaches
The Common Stability Mechanism behind most Self-Supervised Learning Approaches
Abhishek Jha
Matthew B. Blaschko
Yuki M. Asano
Tinne Tuytelaars
SSL
105
4
0
22 Feb 2024
VideoPrism: A Foundational Visual Encoder for Video Understanding
VideoPrism: A Foundational Visual Encoder for Video Understanding
Long Zhao
N. B. Gundavarapu
Liangzhe Yuan
Hao Zhou
Shen Yan
...
Huisheng Wang
Hartwig Adam
Mikhail Sirotenko
Ting Liu
Boqing Gong
VGen
345
64
0
20 Feb 2024
Delving into Multi-modal Multi-task Foundation Models for Road Scene
  Understanding: From Learning Paradigm Perspectives
Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm PerspectivesIEEE Transactions on Intelligent Vehicles (TIV), 2024
Sheng Luo
Wei Chen
Wanxin Tian
Rui Liu
Luanxuan Hou
...
Ling Shao
Yi Yang
Bojun Gao
Qun Li
Guobin Wu
327
27
0
05 Feb 2024
MLIP: Enhancing Medical Visual Representation with Divergence Encoder
  and Knowledge-guided Contrastive Learning
MLIP: Enhancing Medical Visual Representation with Divergence Encoder and Knowledge-guided Contrastive Learning
Zhe Li
Laurence T. Yang
Bocheng Ren
Xin Nie
Zhangyang Gao
Cheng Tan
Stan Z. Li
VLM
201
29
0
03 Feb 2024
MV2MAE: Multi-View Video Masked Autoencoders
MV2MAE: Multi-View Video Masked Autoencoders
Ketul Shah
Robert Crandall
Jie Xu
Peng Zhou
Marian George
Mayank Bansal
Rama Chellappa
223
6
0
29 Jan 2024
Harmonized Spatial and Spectral Learning for Robust and Generalized
  Medical Image Segmentation
Harmonized Spatial and Spectral Learning for Robust and Generalized Medical Image Segmentation
Vandan Gorade
Sparsh Mittal
Debesh Jha
Rekha Singhal
Ulas Bagci
186
3
0
18 Jan 2024
Collaboratively Self-supervised Video Representation Learning for Action Recognition
Collaboratively Self-supervised Video Representation Learning for Action RecognitionIEEE Transactions on Information Forensics and Security (IEEE TIFS), 2024
Jie Zhang
Zhifan Wan
Lanqing Hu
Stephen Lin
Shuzhe Wu
Shiguang Shan
TTA
327
2
0
15 Jan 2024
Motion Guided Token Compression for Efficient Masked Video Modeling
Motion Guided Token Compression for Efficient Masked Video Modeling
Yukun Feng
Yangming Shi
Fengze Liu
Tan Yan
236
0
0
10 Jan 2024
Generic Knowledge Boosted Pre-training For Remote Sensing Images
Generic Knowledge Boosted Pre-training For Remote Sensing ImagesIEEE Transactions on Geoscience and Remote Sensing (TGRS), 2024
Ziyue Huang
Mingming Zhang
Yuan Gong
Qingjie Liu
Yunhong Wang
VLM
157
21
0
09 Jan 2024
Skeleton2vec: A Self-supervised Learning Framework with Contextualized
  Target Representations for Skeleton Sequence
Skeleton2vec: A Self-supervised Learning Framework with Contextualized Target Representations for Skeleton Sequence
Ruizhuo Xu
Linzhi Huang
Mei Wang
Jiani Hu
Weihong Deng
ViTMedIm
263
5
0
01 Jan 2024
Masked Modeling for Self-supervised Representation Learning on Vision
  and Beyond
Masked Modeling for Self-supervised Representation Learning on Vision and Beyond
Siyuan Li
Luyuan Zhang
Zedong Wang
Di Wu
Lirong Wu
...
Jun Xia
Cheng Tan
Yang Liu
Baigui Sun
Stan Z. Li
SSL
235
27
0
31 Dec 2023
Morphing Tokens Draw Strong Masked Image Models
Morphing Tokens Draw Strong Masked Image ModelsInternational Conference on Learning Representations (ICLR), 2023
Taekyung Kim
Byeongho Heo
Dongyoon Han
583
3
0
30 Dec 2023
Visual Point Cloud Forecasting enables Scalable Autonomous Driving
Visual Point Cloud Forecasting enables Scalable Autonomous DrivingComputer Vision and Pattern Recognition (CVPR), 2023
Zetong Yang
Li Chen
Yanan Sun
Guoying Gu
3DPC
300
87
0
29 Dec 2023
Video Understanding with Large Language Models: A Survey
Video Understanding with Large Language Models: A Survey
Yunlong Tang
Jing Bi
Siting Xu
Luchuan Song
Susan Liang
...
Feng Zheng
Jianguo Zhang
Chenliang Xu
Jiebo Luo
Chenliang Xu
VLM
639
161
0
29 Dec 2023
Learning Vision from Models Rivals Learning Vision from Data
Learning Vision from Models Rivals Learning Vision from DataComputer Vision and Pattern Recognition (CVPR), 2023
Yonglong Tian
Lijie Fan
Kaifeng Chen
Dina Katabi
Dilip Krishnan
Phillip Isola
237
72
0
28 Dec 2023
Bootstrap Masked Visual Modeling via Hard Patches Mining
Bootstrap Masked Visual Modeling via Hard Patches Mining
Haochen Wang
Junsong Fan
Yuxi Wang
Kaiyou Song
Tiancai Wang
Xiangyu Zhang
Zhaoxiang Zhang
183
6
0
21 Dec 2023
Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual
  Test-Time Adaptation
Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation
Jiaming Liu
Ran Xu
Senqiao Yang
Renrui Zhang
Qizhe Zhang
Zehui Chen
Yandong Guo
Shanghang Zhang
TTA
205
26
0
19 Dec 2023
M-BEV: Masked BEV Perception for Robust Autonomous Driving
M-BEV: Masked BEV Perception for Robust Autonomous Driving
Siran Chen
Yue Ma
Yu Qiao
Yali Wang
254
18
0
19 Dec 2023
DMT: Comprehensive Distillation with Multiple Self-supervised Teachers
DMT: Comprehensive Distillation with Multiple Self-supervised Teachers
Yuang Liu
Jing Wang
Qiang-feng Zhou
Fan Wang
Jun Wang
Wei Zhang
116
1
0
19 Dec 2023
Semantic-Aware Autoregressive Image Modeling for Visual Representation
  Learning
Semantic-Aware Autoregressive Image Modeling for Visual Representation LearningAAAI Conference on Artificial Intelligence (AAAI), 2023
Kaiyou Song
Shan Zhang
Tong Wang
VLM
138
2
0
16 Dec 2023
T-MAE: Temporal Masked Autoencoders for Point Cloud Representation
  Learning
T-MAE: Temporal Masked Autoencoders for Point Cloud Representation LearningEuropean Conference on Computer Vision (ECCV), 2023
Weijie Wei
Fatemeh Karimi Nejadasl
Theo Gevers
Martin R. Oswald
3DPC
232
9
0
15 Dec 2023
PAD: Self-Supervised Pre-Training with Patchwise-Scale Adapter for
  Infrared Images
PAD: Self-Supervised Pre-Training with Patchwise-Scale Adapter for Infrared Images
Tao Zhang
Kun Ding
Jinyong Wen
Yu Xiong
Zeyu Zhang
Shiming Xiang
Chunhong Pan
145
4
0
13 Dec 2023
LMD: Faster Image Reconstruction with Latent Masking Diffusion
LMD: Faster Image Reconstruction with Latent Masking DiffusionAAAI Conference on Artificial Intelligence (AAAI), 2023
Zhiyuan Ma
Zhihuan Yu
Jianjun Li
Bowen Zhou
DiffM
167
13
0
13 Dec 2023
4M: Massively Multimodal Masked Modeling
4M: Massively Multimodal Masked Modeling
David Mizrahi
Roman Bachmann
Ouguzhan Fatih Kar
Teresa Yeo
Mingfei Gao
Afshin Dehghan
Amir Zamir
MLLM
242
106
0
11 Dec 2023
Cross-BERT for Point Cloud Pretraining
Cross-BERT for Point Cloud Pretraining
Xin Li
Peng Li
Zeyong Wei
Zhe Zhu
Mingqiang Wei
Junhui Hou
Liangliang Nan
J. Qin
H. Xie
F. Wang
SSL3DPC
139
2
0
08 Dec 2023
MIMIR: Masked Image Modeling for Mutual Information-based Adversarial Robustness
MIMIR: Masked Image Modeling for Mutual Information-based Adversarial Robustness
Xiaoyun Xu
Shujian Yu
Jingzheng Wu
S. Picek
AAML
490
6
0
08 Dec 2023
Rejuvenating image-GPT as Strong Visual Representation Learners
Rejuvenating image-GPT as Strong Visual Representation LearnersInternational Conference on Machine Learning (ICML), 2023
Sucheng Ren
Zeyu Wang
Hongru Zhu
Junfei Xiao
Yaoyao Liu
Cihang Xie
VLM
246
11
0
04 Dec 2023
SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference
SCLIP: Rethinking Self-Attention for Dense Vision-Language InferenceEuropean Conference on Computer Vision (ECCV), 2023
Feng Wang
Jieru Mei
Yaoyao Liu
VLM
329
116
0
04 Dec 2023
SANeRF-HQ: Segment Anything for NeRF in High Quality
SANeRF-HQ: Segment Anything for NeRF in High QualityComputer Vision and Pattern Recognition (CVPR), 2023
Yichen Liu
Benran Hu
Chi-Keung Tang
Yu-Wing Tai
243
22
0
03 Dec 2023
Local Masking Meets Progressive Freezing: Crafting Efficient Vision
  Transformers for Self-Supervised Learning
Local Masking Meets Progressive Freezing: Crafting Efficient Vision Transformers for Self-Supervised LearningInternational Conference on Machine Vision (ICMV), 2023
Utku Mert Topcuoglu
Erdem Akagündüz
218
2
0
02 Dec 2023
Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense
  Interactions through Masked Modeling
Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked ModelingComputer Vision and Pattern Recognition (CVPR), 2023
Shentong Mo
Pedro Morgado
234
29
0
02 Dec 2023
Improve Supervised Representation Learning with Masked Image Modeling
Improve Supervised Representation Learning with Masked Image Modeling
Kaifeng Chen
Daniel M. Salz
Huiwen Chang
Kihyuk Sohn
Dilip Krishnan
Mojtaba Seyedhosseini
SSLViT
236
3
0
01 Dec 2023
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment
  Anything
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment AnythingComputer Vision and Pattern Recognition (CVPR), 2023
Yunyang Xiong
Bala Varadarajan
Lemeng Wu
Xiaoyu Xiang
Fanyi Xiao
...
Dilin Wang
Fei Sun
Forrest N. Iandola
Raghuraman Krishnamoorthi
Vikas Chandra
VLM
280
230
0
01 Dec 2023
A-JEPA: Joint-Embedding Predictive Architecture Can Listen
A-JEPA: Joint-Embedding Predictive Architecture Can Listen
Zhengcong Fei
Mingyuan Fan
Junshi Huang
315
31
0
27 Nov 2023
Predicting Gradient is Better: Exploring Self-Supervised Learning for
  SAR ATR with a Joint-Embedding Predictive Architecture
Predicting Gradient is Better: Exploring Self-Supervised Learning for SAR ATR with a Joint-Embedding Predictive ArchitectureIsprs Journal of Photogrammetry and Remote Sensing (ISPRS J. Photogramm. Remote Sens.), 2023
Wei-Jang Li
Yang Wei
Tianpeng Liu
Yuenan Hou
Yuxuan Li
Zhen Liu
Yongxiang Liu
Tianpeng Liu
421
55
0
26 Nov 2023
Understanding Self-Supervised Features for Learning Unsupervised
  Instance Segmentation
Understanding Self-Supervised Features for Learning Unsupervised Instance Segmentation
Paul Engstler
Luke Melas-Kyriazi
Christian Rupprecht
Iro Laina
SSL
156
7
0
24 Nov 2023
Towards Transferable Multi-modal Perception Representation Learning for
  Autonomy: NeRF-Supervised Masked AutoEncoder
Towards Transferable Multi-modal Perception Representation Learning for Autonomy: NeRF-Supervised Masked AutoEncoder
Xiaohao Xu
316
0
0
23 Nov 2023
Pair-wise Layer Attention with Spatial Masking for Video Prediction
Pair-wise Layer Attention with Spatial Masking for Video Prediction
Ping Li
Chenhan Zhang
Zheng Yang
Xianghua Xu
Mingli Song
192
0
0
19 Nov 2023
From Pretext to Purpose: Batch-Adaptive Self-Supervised Learning
From Pretext to Purpose: Batch-Adaptive Self-Supervised Learning
Jiansong Zhang
Linlin Shen
Peizhong Liu
SSL
204
0
0
16 Nov 2023
Window Attention is Bugged: How not to Interpolate Position Embeddings
Window Attention is Bugged: How not to Interpolate Position EmbeddingsInternational Conference on Learning Representations (ICLR), 2023
Daniel Bolya
Chaitanya K. Ryali
Judy Hoffman
Christoph Feichtenhofer
184
16
0
09 Nov 2023
Learning Discriminative Features for Crowd Counting
Learning Discriminative Features for Crowd Counting
Yuehai Chen
Qingzhong Wang
Jing Yang
Badong Chen
Haoyi Xiong
Shaoyi Du
187
14
0
08 Nov 2023
OmniVec: Learning robust representations with cross modal sharing
OmniVec: Learning robust representations with cross modal sharingIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Siddharth Srivastava
Gaurav Sharma
SSL
228
81
0
07 Nov 2023
Asymmetric Masked Distillation for Pre-Training Small Foundation Models
Asymmetric Masked Distillation for Pre-Training Small Foundation ModelsComputer Vision and Pattern Recognition (CVPR), 2023
Zhiyu Zhao
Bingkun Huang
Sen Xing
Gangshan Wu
Yu Qiao
Limin Wang
164
11
0
06 Nov 2023
ProS: Facial Omni-Representation Learning via Prototype-based
  Self-Distillation
ProS: Facial Omni-Representation Learning via Prototype-based Self-DistillationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Xing Di
Yiyu Zheng
Xiaoming Liu
Yu Cheng
232
6
0
03 Nov 2023
Concatenated Masked Autoencoders as Spatial-Temporal Learner
Concatenated Masked Autoencoders as Spatial-Temporal Learner
Zhouqiang Jiang
Bowen Wang
Tong Xiang
Zhaofeng Niu
Hong Tang
Guangshun Li
Liangzhi Li
143
4
0
02 Nov 2023
HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception
HAP: Structure-Aware Masked Image Modeling for Human-Centric PerceptionNeural Information Processing Systems (NeurIPS), 2023
Junkun Yuan
Xinyu Zhang
Hao Zhou
Jian Wang
Zhongwei Qiu
...
Junyu Han
Errui Ding
Lanfen Lin
Leilei Gan
Jingdong Wang
186
28
0
31 Oct 2023
Pre-training with Random Orthogonal Projection Image Modeling
Pre-training with Random Orthogonal Projection Image ModelingInternational Conference on Learning Representations (ICLR), 2023
Maryam Haghighat
Peyman Moghadam
Shaheer Mohamed
Piotr Koniusz
VLM
281
13
0
28 Oct 2023
Feature Guided Masked Autoencoder for Self-supervised Learning in Remote
  Sensing
Feature Guided Masked Autoencoder for Self-supervised Learning in Remote SensingIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (JSTARS), 2023
Yi Wang
Hugo Hernández Hernández
C. Albrecht
Xiao Xiang Zhu
216
52
0
28 Oct 2023
SmooSeg: Smoothness Prior for Unsupervised Semantic Segmentation
SmooSeg: Smoothness Prior for Unsupervised Semantic SegmentationNeural Information Processing Systems (NeurIPS), 2023
Mengcheng Lan
Xinjiang Wang
Yiping Ke
Jiaxing Xu
Xue Jiang
Wayne Zhang
188
18
0
27 Oct 2023
Towards Control-Centric Representations in Reinforcement Learning from
  Images
Towards Control-Centric Representations in Reinforcement Learning from Images
Chen Liu
Hongyu Zang
Xin Li
Yong Heng
Yifei Wang
Zhen Fang
Yisen Wang
Mingzhong Wang
219
0
0
25 Oct 2023
Previous
12345...8910
Next