Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.19009
Cited By
Enhancing Vision-Language Model with Unmasked Token Alignment
29 May 2024
Jihao Liu
Jinliang Zheng
Boxiao Liu
Yu Liu
Hongsheng Li
CLIP
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Enhancing Vision-Language Model with Unmasked Token Alignment"
3 / 3 papers shown
Title
Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation
Yixuan Wei
Han Hu
Zhenda Xie
Zheng-Wei Zhang
Yue Cao
Jianmin Bao
Dong Chen
B. Guo
CLIP
80
123
0
27 May 2022
Revealing the Dark Secrets of Masked Image Modeling
Zhenda Xie
Zigang Geng
Jingcheng Hu
Zheng-Wei Zhang
Han Hu
Yue Cao
VLM
186
105
0
26 May 2022
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
258
7,337
0
11 Nov 2021
1