Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.03937
Cited By
Diffusion Models as Masked Audio-Video Learners
5 October 2023
Elvis Nunez
Yanzi Jin
Mohammad Rastegari
Sachin Mehta
Maxwell Horton
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Diffusion Models as Masked Audio-Video Learners"
2 / 2 papers shown
Title
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
258
7,337
0
11 Nov 2021
MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer
Sachin Mehta
Mohammad Rastegari
ViT
189
1,148
0
05 Oct 2021
1