OverFlow: Putting flows on top of neural transducers for better TTS

OverFlow: Putting flows on top of neural transducers for better TTS

13 November 2022

Ambika Kirkland

Papers citing "OverFlow: Putting flows on top of neural transducers for better TTS"

11 / 11 papers shown

Title
A Language Modeling Approach to Diacritic-Free Hebrew TTS Amit Roth A. Turetzky Yossi Adi 27 2 0 16 Jul 2024
Source Tracing of Audio Deepfake Systems Nicholas Klein Tianxiang Chen Hemlata Tak Ricardo Casal Elie Khoury 19 4 0 10 Jul 2024
Should you use a probabilistic duration model in TTS? Probably! Especially for spontaneous speech Shivam Mehta Harm Lameris Rajiv Punmiya Jonas Beskow Éva Székely G. Henter 23 1 0 08 Jun 2024
Fake it to make it: Using synthetic data to remedy the data shortage in joint multimodal speech-and-gesture synthesis Shivam Mehta Anna Deichler Jim O'Regan Birger Moëll Jonas Beskow G. Henter Simon Alexanderson 34 4 0 30 Apr 2024
MLAAD: The Multi-Language Audio Anti-Spoofing Dataset Nicolas M. Muller Piotr Kawa Wei Herng Choong Edresson Casanova Eren Golge Thorsten Muller P. Syga Philip Sperl Konstantin Böttinger 12 35 0 17 Jan 2024
Unified speech and gesture synthesis using flow matching Shivam Mehta Ruibo Tu Simon Alexanderson Jonas Beskow Éva Székely G. Henter 22 3 0 08 Oct 2023
Comparative Analysis of Transfer Learning in Deep Learning Text-to-Speech Models on a Few-Shot, Low-Resource, Customized Dataset Ze Liu 17 0 0 08 Oct 2023
Matcha-TTS: A fast TTS architecture with conditional flow matching Shivam Mehta Ruibo Tu Jonas Beskow Éva Székely G. Henter 14 68 0 06 Sep 2023
Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis Shivam Mehta Siyang Wang Simon Alexanderson Jonas Beskow Éva Székely G. Henter DiffM 24 14 0 15 Jun 2023
Robust Classification using Hidden Markov Models and Mixtures of Normalizing Flows Anubhab Ghosh Antoine Honoré Dong Liu G. Henter S. Chatterjee BDL VLM 16 7 0 15 Feb 2021
Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition Yu Zhang James Qin Daniel S. Park Wei Han Chung-Cheng Chiu Ruoming Pang Quoc V. Le Yonghui Wu VLM SSL 136 307 0 20 Oct 2020