Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.16174
Cited By
Multimodal Transformer for Parallel Concatenated Variational Autoencoders
28 October 2022
Stephen D. Liang
J. Mendel
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Multimodal Transformer for Parallel Concatenated Variational Autoencoders"
3 / 3 papers shown
Title
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
258
7,337
0
11 Nov 2021
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text
Hassan Akbari
Liangzhe Yuan
Rui Qian
Wei-Hong Chuang
Shih-Fu Chang
Yin Cui
Boqing Gong
ViT
234
573
0
22 Apr 2021
Decoupling the Role of Data, Attention, and Losses in Multimodal Transformers
Lisa Anne Hendricks
John F. J. Mellor
R. Schneider
Jean-Baptiste Alayrac
Aida Nematzadeh
75
110
0
31 Jan 2021
1