Multi-Source Spatial Knowledge Understanding for Immersive Visual Text-to-Speech

18 October 2024

Papers citing "Multi-Source Spatial Knowledge Understanding for Immersive Visual Text-to-Speech"

2 / 2 papers shown

Title
ReverbMiipher: Generative Speech Restoration meets Reverberation Characteristics Controllability Wataru Nakata Yuma Koizumi Shigeki Karita Robin Scheibler Haruko Ishikawa Adriana Guevara-Rukoz Heiga Zen M. Bacchiani 41 0 0 08 May 2025
Multi-modal and Multi-scale Spatial Environment Understanding for Immersive Visual Text-to-Speech Rui Liu Shuwei He Yifan Hu H. Li VLM 87 1 0 16 Dec 2024