LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech

LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech

5 April 2019

ArXiv (abs)PDF HTML

Papers citing "LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech"

17 / 617 papers shown

Title
Fully-hierarchical fine-grained prosody modeling for interpretable speech synthesis Guangzhi Sun Yu Zhang Ron J. Weiss Yuanbin Cao Heiga Zen Yonghui Wu 56 130 0 06 Feb 2020
Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior Guangzhi Sun Yu Zhang Ron J. Weiss Yuan Cao Heiga Zen Andrew Rosenberg Bhuvana Ramabhadran Yonghui Wu DiffM 95 93 0 06 Feb 2020
BOFFIN TTS: Few-Shot Speaker Adaptation by Bayesian Optimization Henry B. Moss Vatsal Aggarwal N. Prateek Javier I. González Roberto Barra-Chicote BDL 51 57 0 04 Feb 2020
Generating Synthetic Audio Data for Attention-Based Speech Recognition Systems Nick Rossenbach Albert Zeyer Ralf Schluter Hermann Ney 91 84 0 19 Dec 2019
Developing a Multi-Platform Speech Recording System Toward Open Service of Building Large-Scale Speech Corpora Keita Ishizuka Takashi Nose 13 0 0 19 Dec 2019
Towards Robust Neural Vocoding for Speech Generation: A Survey Po-Chun Hsu Chun-hsuan Wang Andy T. Liu Hung-yi Lee OOD 78 25 0 05 Dec 2019
Sequential Multi-Frame Neural Beamforming for Speech Separation and Enhancement Zhong-Qiu Wang Hakan Erdogan Scott Wisdom K. Wilson Desh Raj Shinji Watanabe Zhuo Chen J. Hershey 51 1 0 18 Nov 2019
Mellotron: Multispeaker expressive voice synthesis by conditioning on rhythm, pitch and global style tokens Rafael Valle Jason Chun Lok Li R. Prenger Bryan Catanzaro 74 149 0 26 Oct 2019
Unsupervised Feature Enhancement for speaker verification P. S. Nidadavolu Saurabh Kataria Jesús Villalba Leibny Paola García-Perera Najim Dehak 69 18 0 25 Oct 2019
Low-Resource Domain Adaptation for Speaker Recognition Using Cycle-GANs P. S. Nidadavolu Saurabh Kataria Jesús Villalba Najim Dehak 60 24 0 25 Oct 2019
ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit Tomoki Hayashi Ryuichi Yamamoto Katsuki Inoue Takenori Yoshimura Shinji Watanabe Tomoki Toda K. Takeda Yu Zhang Xu Tan VLM 93 205 0 24 Oct 2019
Semi-Supervised Generative Modeling for Controllable Speech Synthesis Raza Habib Soroosh Mariooryad Matt Shannon Eric Battenberg RJ Skerry-Ryan Daisy Stanton David Kao Tom Bagby BDL 65 48 0 03 Oct 2019
Predicting Prosodic Prominence from Text with Pre-trained Contextualized Word Representations Aarne Talman Antti Suni H. Çelikkanat Sofoklis Kakouros Jörg Tiedemann M. Vainio 69 31 0 06 Aug 2019
Multi-Speaker End-to-End Speech Synthesis Jihyun Park Kexin Zhao Kainan Peng Ming-Yu Liu SyDa 66 19 0 09 Jul 2019
Speech bandwidth extension with WaveNet Archit Gupta Brendan Shillingford Yannis Assael Thomas C. Walters 60 29 0 05 Jul 2019
A Methodology for Controlling the Emotional Expressiveness in Synthetic Speech -- a Deep Learning approach Noé Tits 37 10 0 05 Jul 2019
TTS Skins: Speaker Conversion via ASR Adam Polyak Lior Wolf Yaniv Taigman 67 28 0 18 Apr 2019