ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.09133
  4. Cited By
Masked Feature Prediction for Self-Supervised Visual Pre-Training

Masked Feature Prediction for Self-Supervised Visual Pre-Training

16 December 2021
Chen Wei
Haoqi Fan
Saining Xie
Chaoxia Wu
Alan Yuille
Christoph Feichtenhofer
    ViT
ArXivPDFHTML

Papers citing "Masked Feature Prediction for Self-Supervised Visual Pre-Training"

50 / 462 papers shown
Title
SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal
  Visual Object Tracking
SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Tracking
Xiaojun Hou
Jiazheng Xing
Yijie Qian
Yaowei Guo
Shuo Xin
...
Kai Tang
Mengmeng Wang
Zhengkai Jiang
Liang Liu
Yong-Jin Liu
28
22
0
24 Mar 2024
Edit3K: Universal Representation Learning for Video Editing Components
Edit3K: Universal Representation Learning for Video Editing Components
Xin Gu
Libo Zhang
Fan Chen
Longyin Wen
Yufei Wang
Tiejian Luo
Sijie Zhu
30
4
0
24 Mar 2024
Improve Cross-domain Mixed Sampling with Guidance Training for Adaptive
  Segmentation
Improve Cross-domain Mixed Sampling with Guidance Training for Adaptive Segmentation
Wenlve Zhou
Zhiheng Zhou
Tianlei Wang
Delu Zeng
28
0
0
22 Mar 2024
Rethinking Multi-view Representation Learning via Distilled
  Disentangling
Rethinking Multi-view Representation Learning via Distilled Disentangling
Guanzhou Ke
Bo Wang
Xiaoli Wang
Shengfeng He
27
3
0
16 Mar 2024
FocusMAE: Gallbladder Cancer Detection from Ultrasound Videos with
  Focused Masked Autoencoders
FocusMAE: Gallbladder Cancer Detection from Ultrasound Videos with Focused Masked Autoencoders
Soumen Basu
Mayuna Gupta
Chetan Madan
Pankaj Gupta
Chetan Arora
28
4
0
13 Mar 2024
Masked AutoDecoder is Effective Multi-Task Vision Generalist
Masked AutoDecoder is Effective Multi-Task Vision Generalist
Han Qiu
Jiaxing Huang
Peng Gao
Lewei Lu
Xiaoqin Zhang
Shijian Lu
32
3
0
12 Mar 2024
AACP: Aesthetics assessment of children's paintings based on
  self-supervised learning
AACP: Aesthetics assessment of children's paintings based on self-supervised learning
Shiqi Jiang
Ning Li
Chen Shi
Liping Guo
Changbo Wang
Chenhui Li
20
0
0
12 Mar 2024
Joint-Embedding Masked Autoencoder for Self-supervised Learning of Dynamic Functional Connectivity from the Human Brain
Joint-Embedding Masked Autoencoder for Self-supervised Learning of Dynamic Functional Connectivity from the Human Brain
Jungwon Choi
Hyungi Lee
Byung-Hoon Kim
Juho Lee
67
0
0
11 Mar 2024
Spatiotemporal Predictive Pre-training for Robotic Motor Control
Spatiotemporal Predictive Pre-training for Robotic Motor Control
Jiange Yang
Bei Liu
Jianlong Fu
Bocheng Pan
Gangshan Wu
Limin Wang
26
10
0
08 Mar 2024
VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks
VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks
Xiangxiang Chu
Jianlin Su
Bo-Wen Zhang
Chunhua Shen
MLLM
27
10
0
01 Mar 2024
Data-efficient Event Camera Pre-training via Disentangled Masked
  Modeling
Data-efficient Event Camera Pre-training via Disentangled Masked Modeling
Zhenpeng Huang
Chao Li
Hao Chen
Yongjian Deng
Yifeng Geng
Limin Wang
32
2
0
01 Mar 2024
ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text
  Detection and Spotting
ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting
Chen Duan
Pei Fu
Shan Guo
Qianyi Jiang
Xiaoming Wei
VLM
41
5
0
01 Mar 2024
A Simple yet Effective Network based on Vision Transformer for
  Camouflaged Object and Salient Object Detection
A Simple yet Effective Network based on Vision Transformer for Camouflaged Object and Salient Object Detection
Chao Hao
Zitong Yu
Xin Liu
Jun Xu
Huanjing Yue
Jingyu Yang
ViT
26
6
0
29 Feb 2024
LSPT: Long-term Spatial Prompt Tuning for Visual Representation Learning
LSPT: Long-term Spatial Prompt Tuning for Visual Representation Learning
Shentong Mo
Yansen Wang
Xufang Luo
Dongsheng Li
VLM
25
1
0
27 Feb 2024
The Common Stability Mechanism behind most Self-Supervised Learning
  Approaches
The Common Stability Mechanism behind most Self-Supervised Learning Approaches
Abhishek Jha
Matthew B. Blaschko
Yuki M. Asano
Tinne Tuytelaars
SSL
22
1
0
22 Feb 2024
VideoPrism: A Foundational Visual Encoder for Video Understanding
VideoPrism: A Foundational Visual Encoder for Video Understanding
Long Zhao
N. B. Gundavarapu
Liangzhe Yuan
Hao Zhou
Shen Yan
...
Huisheng Wang
Hartwig Adam
Mikhail Sirotenko
Ting Liu
Boqing Gong
VGen
27
29
0
20 Feb 2024
Delving into Multi-modal Multi-task Foundation Models for Road Scene
  Understanding: From Learning Paradigm Perspectives
Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm Perspectives
Sheng Luo
Wei-Neng Chen
Wanxin Tian
Rui Liu
Luanxuan Hou
...
Ling Shao
Yi Yang
Bojun Gao
Qun Li
Guobin Wu
47
13
0
05 Feb 2024
MLIP: Enhancing Medical Visual Representation with Divergence Encoder
  and Knowledge-guided Contrastive Learning
MLIP: Enhancing Medical Visual Representation with Divergence Encoder and Knowledge-guided Contrastive Learning
Zhe Li
Laurence T. Yang
Bocheng Ren
Xin Nie
Zhangyang Gao
Cheng Tan
Stan Z. Li
VLM
10
11
0
03 Feb 2024
MV2MAE: Multi-View Video Masked Autoencoders
MV2MAE: Multi-View Video Masked Autoencoders
Ketul Shah
Robert Crandall
Jie Xu
Peng Zhou
Marian George
Mayank Bansal
Rama Chellappa
15
4
0
29 Jan 2024
Harmonized Spatial and Spectral Learning for Robust and Generalized
  Medical Image Segmentation
Harmonized Spatial and Spectral Learning for Robust and Generalized Medical Image Segmentation
Vandan Gorade
Sparsh Mittal
Debesh Jha
Rekha Singhal
Ulas Bagci
20
3
0
18 Jan 2024
Collaboratively Self-supervised Video Representation Learning for Action Recognition
Collaboratively Self-supervised Video Representation Learning for Action Recognition
Jie M. Zhang
Zhifan Wan
Lanqing Hu
Stephen Lin
Shuzhe Wu
Shiguang Shan
TTA
52
0
0
15 Jan 2024
Motion Guided Token Compression for Efficient Masked Video Modeling
Motion Guided Token Compression for Efficient Masked Video Modeling
Yukun Feng
Yangming Shi
Fengze Liu
Tan Yan
17
0
0
10 Jan 2024
Generic Knowledge Boosted Pre-training For Remote Sensing Images
Generic Knowledge Boosted Pre-training For Remote Sensing Images
Ziyue Huang
Mingming Zhang
Yuan Gong
Qingjie Liu
Yunhong Wang
VLM
20
14
0
09 Jan 2024
Skeleton2vec: A Self-supervised Learning Framework with Contextualized
  Target Representations for Skeleton Sequence
Skeleton2vec: A Self-supervised Learning Framework with Contextualized Target Representations for Skeleton Sequence
Ruizhuo Xu
Linzhi Huang
Mei Wang
Jiani Hu
Weihong Deng
ViT
MedIm
27
1
0
01 Jan 2024
Masked Modeling for Self-supervised Representation Learning on Vision
  and Beyond
Masked Modeling for Self-supervised Representation Learning on Vision and Beyond
Siyuan Li
Luyuan Zhang
Zedong Wang
Di Wu
Lirong Wu
...
Jun-Xiong Xia
Cheng Tan
Yang Liu
Baigui Sun
Stan Z. Li
SSL
29
13
0
31 Dec 2023
Morphing Tokens Draw Strong Masked Image Models
Morphing Tokens Draw Strong Masked Image Models
Taekyung Kim
Byeongho Heo
Dongyoon Han
38
3
0
30 Dec 2023
Visual Point Cloud Forecasting enables Scalable Autonomous Driving
Visual Point Cloud Forecasting enables Scalable Autonomous Driving
Zetong Yang
Li Chen
Yanan Sun
Hongyang Li
3DPC
20
40
0
29 Dec 2023
Video Understanding with Large Language Models: A Survey
Video Understanding with Large Language Models: A Survey
Yunlong Tang
Jing Bi
Siting Xu
Luchuan Song
Susan Liang
...
Feng Zheng
Jianguo Zhang
Ping Luo
Jiebo Luo
Chenliang Xu
VLM
47
81
0
29 Dec 2023
Learning Vision from Models Rivals Learning Vision from Data
Learning Vision from Models Rivals Learning Vision from Data
Yonglong Tian
Lijie Fan
Kaifeng Chen
Dina Katabi
Dilip Krishnan
Phillip Isola
6
43
0
28 Dec 2023
Bootstrap Masked Visual Modeling via Hard Patches Mining
Bootstrap Masked Visual Modeling via Hard Patches Mining
Haochen Wang
Junsong Fan
Yuxi Wang
Kaiyou Song
Tiancai Wang
Xiangyu Zhang
Zhaoxiang Zhang
34
5
0
21 Dec 2023
Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual
  Test-Time Adaptation
Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation
Jiaming Liu
Ran Xu
Senqiao Yang
Renrui Zhang
Qizhe Zhang
Zehui Chen
Yandong Guo
Shanghang Zhang
TTA
22
10
0
19 Dec 2023
M-BEV: Masked BEV Perception for Robust Autonomous Driving
M-BEV: Masked BEV Perception for Robust Autonomous Driving
Siran Chen
Yue Ma
Yu Qiao
Yali Wang
19
8
0
19 Dec 2023
DMT: Comprehensive Distillation with Multiple Self-supervised Teachers
DMT: Comprehensive Distillation with Multiple Self-supervised Teachers
Yuang Liu
Jing Wang
Qiang-feng Zhou
Fan Wang
Jun Wang
Wei Zhang
11
0
0
19 Dec 2023
Semantic-Aware Autoregressive Image Modeling for Visual Representation
  Learning
Semantic-Aware Autoregressive Image Modeling for Visual Representation Learning
Kaiyou Song
Shan Zhang
Tong Wang
VLM
6
2
0
16 Dec 2023
T-MAE: Temporal Masked Autoencoders for Point Cloud Representation
  Learning
T-MAE: Temporal Masked Autoencoders for Point Cloud Representation Learning
Weijie Wei
F. Karimi Nejadasl
Theo Gevers
Martin R. Oswald
3DPC
20
3
0
15 Dec 2023
PAD: Self-Supervised Pre-Training with Patchwise-Scale Adapter for
  Infrared Images
PAD: Self-Supervised Pre-Training with Patchwise-Scale Adapter for Infrared Images
Tao Zhang
Kun Ding
Jinyong Wen
Yu Xiong
Zeyu Zhang
Shiming Xiang
Chunhong Pan
14
3
0
13 Dec 2023
LMD: Faster Image Reconstruction with Latent Masking Diffusion
LMD: Faster Image Reconstruction with Latent Masking Diffusion
Zhiyuan Ma
Zhihuan Yu
Jianjun Li
Bowen Zhou
DiffM
11
8
0
13 Dec 2023
4M: Massively Multimodal Masked Modeling
4M: Massively Multimodal Masked Modeling
David Mizrahi
Roman Bachmann
Ouguzhan Fatih Kar
Teresa Yeo
Mingfei Gao
Afshin Dehghan
Amir Zamir
MLLM
28
62
0
11 Dec 2023
Cross-BERT for Point Cloud Pretraining
Cross-BERT for Point Cloud Pretraining
Xin Li
Peng Li
Zeyong Wei
Zhe Zhu
Mingqiang Wei
Junhui Hou
Liangliang Nan
J. Qin
H. Xie
F. Wang
SSL
3DPC
15
0
0
08 Dec 2023
Towards Context-Stable and Visual-Consistent Image Inpainting
Towards Context-Stable and Visual-Consistent Image Inpainting
Yikai Wang
Chenjie Cao
Yanwei Fu
DiffM
27
2
0
08 Dec 2023
MIMIR: Masked Image Modeling for Mutual Information-based Adversarial Robustness
MIMIR: Masked Image Modeling for Mutual Information-based Adversarial Robustness
Xiaoyun Xu
Shujian Yu
Jingzheng Wu
S. Picek
AAML
31
0
0
08 Dec 2023
Rejuvenating image-GPT as Strong Visual Representation Learners
Rejuvenating image-GPT as Strong Visual Representation Learners
Sucheng Ren
Zeyu Wang
Hongru Zhu
Junfei Xiao
Alan L. Yuille
Cihang Xie
VLM
34
7
0
04 Dec 2023
SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference
SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference
Feng Wang
Jieru Mei
Alan L. Yuille
VLM
19
54
0
04 Dec 2023
SANeRF-HQ: Segment Anything for NeRF in High Quality
SANeRF-HQ: Segment Anything for NeRF in High Quality
Yichen Liu
Benran Hu
Chi-Keung Tang
Yu-Wing Tai
13
11
0
03 Dec 2023
Local Masking Meets Progressive Freezing: Crafting Efficient Vision
  Transformers for Self-Supervised Learning
Local Masking Meets Progressive Freezing: Crafting Efficient Vision Transformers for Self-Supervised Learning
Utku Mert Topcuoglu
Erdem Akagündüz
27
1
0
02 Dec 2023
Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense
  Interactions through Masked Modeling
Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling
Shentong Mo
Pedro Morgado
19
13
0
02 Dec 2023
Improve Supervised Representation Learning with Masked Image Modeling
Improve Supervised Representation Learning with Masked Image Modeling
Kaifeng Chen
Daniel M. Salz
Huiwen Chang
Kihyuk Sohn
Dilip Krishnan
Mojtaba Seyedhosseini
SSL
ViT
14
2
0
01 Dec 2023
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment
  Anything
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Yunyang Xiong
Bala Varadarajan
Lemeng Wu
Xiaoyu Xiang
Fanyi Xiao
...
Dilin Wang
Fei Sun
Forrest N. Iandola
Raghuraman Krishnamoorthi
Vikas Chandra
VLM
21
136
0
01 Dec 2023
A-JEPA: Joint-Embedding Predictive Architecture Can Listen
A-JEPA: Joint-Embedding Predictive Architecture Can Listen
Zhengcong Fei
Mingyuan Fan
Junshi Huang
21
17
0
27 Nov 2023
Predicting Gradient is Better: Exploring Self-Supervised Learning for
  SAR ATR with a Joint-Embedding Predictive Architecture
Predicting Gradient is Better: Exploring Self-Supervised Learning for SAR ATR with a Joint-Embedding Predictive Architecture
Wei-Jang Li
Yang Wei
Tianpeng Liu
Yuenan Hou
Yuxuan Li
Zhen Liu
Yongxiang Liu
Li Liu
13
17
0
26 Nov 2023
Previous
123456...8910
Next