Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1904.02882
Cited By
LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech
5 April 2019
Heiga Zen
Viet Dang
R. Clark
Yu Zhang
Ron J. Weiss
Ye Jia
Zhiwen Chen
Yonghui Wu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech"
17 / 617 papers shown
Title
Fully-hierarchical fine-grained prosody modeling for interpretable speech synthesis
Guangzhi Sun
Yu Zhang
Ron J. Weiss
Yuanbin Cao
Heiga Zen
Yonghui Wu
56
130
0
06 Feb 2020
Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior
Guangzhi Sun
Yu Zhang
Ron J. Weiss
Yuan Cao
Heiga Zen
Andrew Rosenberg
Bhuvana Ramabhadran
Yonghui Wu
DiffM
95
93
0
06 Feb 2020
BOFFIN TTS: Few-Shot Speaker Adaptation by Bayesian Optimization
Henry B. Moss
Vatsal Aggarwal
N. Prateek
Javier I. González
Roberto Barra-Chicote
BDL
51
57
0
04 Feb 2020
Generating Synthetic Audio Data for Attention-Based Speech Recognition Systems
Nick Rossenbach
Albert Zeyer
Ralf Schluter
Hermann Ney
91
84
0
19 Dec 2019
Developing a Multi-Platform Speech Recording System Toward Open Service of Building Large-Scale Speech Corpora
Keita Ishizuka
Takashi Nose
13
0
0
19 Dec 2019
Towards Robust Neural Vocoding for Speech Generation: A Survey
Po-Chun Hsu
Chun-hsuan Wang
Andy T. Liu
Hung-yi Lee
OOD
78
25
0
05 Dec 2019
Sequential Multi-Frame Neural Beamforming for Speech Separation and Enhancement
Zhong-Qiu Wang
Hakan Erdogan
Scott Wisdom
K. Wilson
Desh Raj
Shinji Watanabe
Zhuo Chen
J. Hershey
51
1
0
18 Nov 2019
Mellotron: Multispeaker expressive voice synthesis by conditioning on rhythm, pitch and global style tokens
Rafael Valle
Jason Chun Lok Li
R. Prenger
Bryan Catanzaro
74
149
0
26 Oct 2019
Unsupervised Feature Enhancement for speaker verification
P. S. Nidadavolu
Saurabh Kataria
Jesús Villalba
Leibny Paola García-Perera
Najim Dehak
69
18
0
25 Oct 2019
Low-Resource Domain Adaptation for Speaker Recognition Using Cycle-GANs
P. S. Nidadavolu
Saurabh Kataria
Jesús Villalba
Najim Dehak
60
24
0
25 Oct 2019
ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit
Tomoki Hayashi
Ryuichi Yamamoto
Katsuki Inoue
Takenori Yoshimura
Shinji Watanabe
Tomoki Toda
K. Takeda
Yu Zhang
Xu Tan
VLM
93
205
0
24 Oct 2019
Semi-Supervised Generative Modeling for Controllable Speech Synthesis
Raza Habib
Soroosh Mariooryad
Matt Shannon
Eric Battenberg
RJ Skerry-Ryan
Daisy Stanton
David Kao
Tom Bagby
BDL
65
48
0
03 Oct 2019
Predicting Prosodic Prominence from Text with Pre-trained Contextualized Word Representations
Aarne Talman
Antti Suni
H. Çelikkanat
Sofoklis Kakouros
Jörg Tiedemann
M. Vainio
69
31
0
06 Aug 2019
Multi-Speaker End-to-End Speech Synthesis
Jihyun Park
Kexin Zhao
Kainan Peng
Ming-Yu Liu
SyDa
66
19
0
09 Jul 2019
Speech bandwidth extension with WaveNet
Archit Gupta
Brendan Shillingford
Yannis Assael
Thomas C. Walters
60
29
0
05 Jul 2019
A Methodology for Controlling the Emotional Expressiveness in Synthetic Speech -- a Deep Learning approach
Noé Tits
37
10
0
05 Jul 2019
TTS Skins: Speaker Conversion via ASR
Adam Polyak
Lior Wolf
Yaniv Taigman
67
28
0
18 Apr 2019
Previous
1
2
3
...
11
12
13