Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2404.16305
Cited By
Semantically consistent Video-to-Audio Generation using Multimodal Language Large Model
25 April 2024
Gehui Chen
Guan’an Wang
Xiaowen Huang
Jitao Sang
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Semantically consistent Video-to-Audio Generation using Multimodal Language Large Model"
3 / 3 papers shown
Title
SyncFlow: Toward Temporally Aligned Joint Audio-Video Generation from Text
Haohe Liu
Gaël Le Lan
Xinhao Mei
Zhaoheng Ni
Anurag Kumar
Varun K. Nagaraja
Wenwu Wang
Mark D. Plumbley
Yangyang Shi
Vikas Chandra
VGen
64
1
0
03 Dec 2024
Rhythmic Foley: A Framework For Seamless Audio-Visual Alignment In Video-to-Audio Synthesis
Zhiqi Huang
Dan Luo
Jun Wang
Huan Liao
Zhiheng Li
Zhiyong Wu
VGen
45
4
0
13 Sep 2024
KeyVideoLLM: Towards Large-scale Video Keyframe Selection
Hao Liang
Jiapeng Li
Tianyi Bai
Xijie Huang
Linzhuang Sun
Zhengren Wang
Conghui He
Bin Cui
Chong Chen
Wentao Zhang
VGen
29
7
0
03 Jul 2024
1