v1v2 (latest)

Effect of data reduction on sequence-to-sequence neural TTS

15 November 2018

Papers citing "Effect of data reduction on sequence-to-sequence neural TTS"

24 / 24 papers shown

Title
Unify and Conquer: How Phonetic Feature Representation Affects Polyglot Text-To-Speech (TTS) Ariadna Sánchez Alessio Falai Ziyao Zhang Orazio Angelini K. Yanagisawa 90 7 0 04 Jul 2022
Mix and Match: An Empirical Study on Training Corpus Composition for Polyglot Text-To-Speech (TTS) Ziyao Zhang Alessio Falai Ariadna Sánchez Orazio Angelini K. Yanagisawa 56 4 0 04 Jul 2022
Computer-assisted Pronunciation Training -- Speech synthesis is almost all you need Daniel Korzekwa Jaime Lorenzo-Trueba Thomas Drugman B. Kostek 62 28 0 02 Jul 2022
Parallel Synthesis for Autoregressive Speech Generation Po-Chun Hsu Da-Rong Liu Andy T. Liu Hung-yi Lee 80 5 0 25 Apr 2022
Voice Filter: Few-shot text-to-speech speaker adaptation using voice conversion as a post-processing module Adam Gabry's Goeric Huybrechts M. Ribeiro C. Chien Julian Roth Giulia Comini Roberto Barra-Chicote Bartek Perz Jaime Lorenzo-Trueba 80 21 0 16 Feb 2022
Distribution augmentation for low-resource expressive text-to-speech Mateusz Lajszczak Animesh Prasad Arent van Korlaar Bajibabu Bollepalli Antonio Bonafonte ... M. Nicolis Alexis Moinet Thomas Drugman Trevor Wood Elena Sokolova 61 7 0 13 Feb 2022
Machine Translation Verbosity Control for Automatic Dubbing Surafel Melaku Lakew Marcello Federico Yue Wang Cuong Hoang Yogesh Virkar Roberto Barra-Chicote Robert Enyedi 63 24 0 08 Oct 2021
Combining speakers of multiple languages to improve quality of neural voices Javier Latorre Charlotte Bailleul Tuuli H. Morrill Alistair Conkie Y. Stylianou 64 8 0 17 Aug 2021
Enhancing audio quality for expressive Neural Text-to-Speech Abdelhamid Ezzerg Adam Gabry's Bartosz Putrycz Daniel Korzekwa Daniel Sáez-Trigueros David McHardy Kamil Pokora Jakub Lachowicz Jaime Lorenzo-Trueba V. Klimkov 132 6 0 13 Aug 2021
Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech Raahil Shah Kamil Pokora Abdelhamid Ezzerg V. Klimkov Goeric Huybrechts Bartosz Putrycz Daniel Korzekwa Thomas Merritt 64 26 0 24 Jun 2021
Sprachsynthese -- State-of-the-Art in englischer und deutscher Sprache René Peinl 48 0 0 11 Jun 2021
EmoCat: Language-agnostic Emotional Voice Conversion Bastian Schnell Goeric Huybrechts Bartek Perz Thomas Drugman Jaime Lorenzo-Trueba 89 11 0 14 Jan 2021
Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention Daniel Korzekwa Roberto Barra-Chicote Szymon Zaporowski Grzegorz Beringer Jaime Lorenzo-Trueba Alicja Serafinowicz J. Droppo Thomas Drugman B. Kostek 51 9 0 29 Dec 2020
Low-resource expressive text-to-speech using data augmentation Goeric Huybrechts Thomas Merritt Giulia Comini Bartek Perz Raahil Shah Jaime Lorenzo-Trueba 68 53 0 11 Nov 2020
Learning to Maximize Speech Quality Directly Using MOS Prediction for Neural Text-to-Speech Yeunju Choi Youngmoon Jung Youngjoo Suh Hoirin Kim 129 6 0 02 Nov 2020
Efficient neural speech synthesis for low-resource languages through multilingual modeling M. D. Korte Jaebok Kim E. Klabbers 59 19 0 20 Aug 2020
From Speech-to-Speech Translation to Automatic Dubbing Marcello Federico Robert Enyedi Roberto Barra-Chicote Ritwik Giri Umut Isik A. Krishnaswamy Hassan Sawaf 106 43 0 19 Jan 2020
Singing Synthesis: with a little help from my attention Orazio Angelini Alexis Moinet K. Yanagisawa Thomas Drugman 61 17 0 12 Dec 2019
Dynamic Prosody Generation for Speech Synthesis using Linguistics-Driven Acoustic Embedding Selection Shubhi Tyagi M. Nicolis Jonas Rohnke Thomas Drugman Jaime Lorenzo-Trueba 77 32 0 02 Dec 2019
Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech Vatsal Aggarwal Marius Cotescu N. Prateek Jaime Lorenzo-Trueba Roberto Barra-Chicote 90 31 0 28 Nov 2019
Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech Daniel Korzekwa Roberto Barra-Chicote B. Kostek Thomas Drugman Mateusz Lajszczak 27 20 0 10 Jul 2019
Fine-grained robust prosody transfer for single-speaker neural text-to-speech V. Klimkov S. Ronanki Jonas Rohnke Thomas Drugman AI4TS 89 82 0 04 Jul 2019
In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data N. Prateek Mateusz Lajszczak Roberto Barra-Chicote Thomas Drugman Jaime Lorenzo-Trueba Thomas Merritt S. Ronanki Trevor Wood 84 30 0 04 Apr 2019
Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora Hieu-Thi Luong Xin Wang Junichi Yamagishi Nobuyuki Nishizawa 86 23 0 01 Apr 2019