Human Motion Video Generation: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025 |
ANIM-400K: A Large-Scale Dataset for Automated End-To-End Dubbing of
VideoIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024 |
Looking Similar, Sounding Different: Leveraging Counterfactual
Cross-Modal Pairs for Audiovisual Representation LearningComputer Vision and Pattern Recognition (CVPR), 2023 |
Learning to Dub Movies via Hierarchical Prosody ModelsComputer Vision and Pattern Recognition (CVPR), 2022 |
Towards Realistic Visual Dubbing with Heterogeneous SourcesACM Multimedia (MM), 2021 |
More than Words: In-the-Wild Visually-Driven Prosody for Text-to-SpeechComputer Vision and Pattern Recognition (CVPR), 2021 |