ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.14313
  4. Cited By
Beyond Masking: Demystifying Token-Based Pre-Training for Vision
  Transformers

Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers

27 March 2022
Yunjie Tian
Lingxi Xie
Jiemin Fang
Mengnan Shi
Junran Peng
Xiaopeng Zhang
Jianbin Jiao
Qi Tian
QiXiang Ye
ArXiv (abs)PDFHTMLGithub (26★)

Papers citing "Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers"

16 / 16 papers shown
Understanding and Enhancing Mask-Based Pretraining towards Universal Representations
Understanding and Enhancing Mask-Based Pretraining towards Universal Representations
Mingze Dong
Leda Wang
Yuval Kluger
SSL
187
1
0
25 Sep 2025
Prompt-based Dynamic Token Pruning for Efficient Segmentation of Medical Images
Prompt-based Dynamic Token Pruning for Efficient Segmentation of Medical Images
Pallabi Dutta
Anubhab Maity
S. Mitra
MedIm
283
0
0
19 Jun 2025
Masked Angle-Aware Autoencoder for Remote Sensing Images
Masked Angle-Aware Autoencoder for Remote Sensing ImagesEuropean Conference on Computer Vision (ECCV), 2024
Zhihao Li
B. Hou
Siteng Ma
Zitong Wu
Xianpeng Guo
Bo Ren
Licheng Jiao
372
34
0
04 Aug 2024
Pre-training with Random Orthogonal Projection Image Modeling
Pre-training with Random Orthogonal Projection Image ModelingInternational Conference on Learning Representations (ICLR), 2023
Maryam Haghighat
Peyman Moghadam
Shaheer Mohamed
Piotr Koniusz
VLM
403
15
0
28 Oct 2023
Deblurring Masked Autoencoder is Better Recipe for Ultrasound Image
  Recognition
Deblurring Masked Autoencoder is Better Recipe for Ultrasound Image RecognitionInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2023
Qingbo Kang
Jun Gao
Kang Li
Qicheng Lao
290
15
0
14 Jun 2023
Correlational Image Modeling for Self-Supervised Visual Pre-Training
Correlational Image Modeling for Self-Supervised Visual Pre-TrainingComputer Vision and Pattern Recognition (CVPR), 2023
Wei Li
Jiahao Xie
Chen Change Loy
SSL
402
19
0
22 Mar 2023
Remote Sensing Scene Classification with Masked Image Modeling (MIM)
Remote Sensing Scene Classification with Masked Image Modeling (MIM)Remote Sensing (RS), 2023
Liya Wang
A. Tien
294
5
0
28 Feb 2023
Beyond Pretrained Features: Noisy Image Modeling Provides Adversarial
  Defense
Beyond Pretrained Features: Noisy Image Modeling Provides Adversarial DefenseNeural Information Processing Systems (NeurIPS), 2023
Zunzhi You
Daochang Liu
Bohyung Han
Chang Xu
AAMLVLM
551
8
0
02 Feb 2023
Aerial Image Object Detection With Vision Transformer Detector (ViTDet)
Aerial Image Object Detection With Vision Transformer Detector (ViTDet)IEEE International Geoscience and Remote Sensing Symposium (IGARSS), 2023
Liya Wang
A. Tien
487
22
0
28 Jan 2023
Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image
  Transformers Help 3D Representation Learning?
Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?International Conference on Learning Representations (ICLR), 2022
Runpei Dong
Zekun Qi
Linfeng Zhang
Junbo Zhang
Jian‐Yuan Sun
Zheng Ge
Li Yi
Kaisheng Ma
ViT3DPC
410
150
0
16 Dec 2022
Fast-iTPN: Integrally Pre-Trained Transformer Pyramid Network with Token
  Migration
Fast-iTPN: Integrally Pre-Trained Transformer Pyramid Network with Token MigrationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Yunjie Tian
Lingxi Xie
Jihao Qiu
Jianbin Jiao
Yaowei Wang
Qi Tian
Qixiang Ye
ViT
251
27
0
23 Nov 2022
A Survey on Masked Autoencoder for Self-supervised Learning in Vision
  and Beyond
A Survey on Masked Autoencoder for Self-supervised Learning in Vision and Beyond
Chaoning Zhang
Chenshuang Zhang
Junha Song
John Seon Keun Yi
Kang Zhang
In So Kweon
SSL
315
100
0
30 Jul 2022
HiViT: Hierarchical Vision Transformer Meets Masked Image Modeling
HiViT: Hierarchical Vision Transformer Meets Masked Image Modeling
Xiaosong Zhang
Yunjie Tian
Wei Huang
QiXiang Ye
Jingdong Sun
Lingxi Xie
Qi Tian
317
42
0
30 May 2022
Corrupted Image Modeling for Self-Supervised Visual Pre-Training
Corrupted Image Modeling for Self-Supervised Visual Pre-TrainingInternational Conference on Learning Representations (ICLR), 2022
Yuxin Fang
Li Dong
Hangbo Bao
Xinggang Wang
Furu Wei
396
93
0
07 Feb 2022
Context Autoencoder for Self-Supervised Representation Learning
Context Autoencoder for Self-Supervised Representation LearningInternational Journal of Computer Vision (IJCV), 2022
Xiaokang Chen
Mingyu Ding
Xiaodi Wang
Ying Xin
Shentong Mo
Yunhao Wang
Shumin Han
Ping Luo
Gang Zeng
Jingdong Wang
SSL
635
477
0
07 Feb 2022
Exploring Complicated Search Spaces with Interleaving-Free Sampling
Exploring Complicated Search Spaces with Interleaving-Free SamplingIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
Yunjie Tian
Lingxi Xie
Jiemin Fang
Jianbin Jiao
QiXiang Ye
Qi Tian
260
0
0
05 Dec 2021
1
Page 1 of 1