Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2305.03204
Cited By
VideoOFA: Two-Stage Pre-Training for Video-to-Text Generation
4 May 2023
Xilun Chen
L. Yu
Wenhan Xiong
Barlas Ouguz
Yashar Mehdad
Anuj Kumar
VGen
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"VideoOFA: Two-Stage Pre-Training for Video-to-Text Generation"
2 / 2 papers shown
SyncFlow: Toward Temporally Aligned Joint Audio-Video Generation from Text
Haohe Liu
Gaël Le Lan
Xinhao Mei
Zhaoheng Ni
Anurag Kumar
Varun K. Nagaraja
Wenwu Wang
Mark D. Plumbley
Yangyang Shi
Vikas Chandra
VGen
347
12
0
03 Dec 2024
Mirasol3B: A Multimodal Autoregressive model for time-aligned and contextual modalities
Computer Vision and Pattern Recognition (CVPR), 2023
A. Piergiovanni
Isaac Noble
Dahun Kim
Michael S. Ryoo
Victor Gomes
A. Angelova
375
25
0
09 Nov 2023
1