Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2207.08024
Cited By
LAVA: Language Audio Vision Alignment for Contrastive Video Pre-Training
16 July 2022
Sumanth Gurram
An Fang
David M. Chan
John F. Canny
VLM
AI4TS
Re-assign community
ArXiv
PDF
HTML
Papers citing
"LAVA: Language Audio Vision Alignment for Contrastive Video Pre-Training"
4 / 4 papers shown
Title
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text
Hassan Akbari
Liangzhe Yuan
Rui Qian
Wei-Hong Chuang
Shih-Fu Chang
Yin Cui
Boqing Gong
ViT
231
573
0
22 Apr 2021
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
278
1,939
0
09 Feb 2021
Self-supervised Co-training for Video Representation Learning
Tengda Han
Weidi Xie
Andrew Zisserman
SSL
198
304
0
19 Oct 2020
Audiovisual SlowFast Networks for Video Recognition
Fanyi Xiao
Yong Jae Lee
Kristen Grauman
Jitendra Malik
Christoph Feichtenhofer
192
204
0
23 Jan 2020
1