ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.06315
  4. Cited By
Effect of data reduction on sequence-to-sequence neural TTS
v1v2 (latest)

Effect of data reduction on sequence-to-sequence neural TTS

15 November 2018
Javier Latorre
Jakub Lachowicz
Jaime Lorenzo-Trueba
Thomas Merritt
Thomas Drugman
S. Ronanki
Klimkov Viacheslav
ArXiv (abs)PDFHTML

Papers citing "Effect of data reduction on sequence-to-sequence neural TTS"

24 / 24 papers shown
Title
Unify and Conquer: How Phonetic Feature Representation Affects Polyglot
  Text-To-Speech (TTS)
Unify and Conquer: How Phonetic Feature Representation Affects Polyglot Text-To-Speech (TTS)
Ariadna Sánchez
Alessio Falai
Ziyao Zhang
Orazio Angelini
K. Yanagisawa
90
7
0
04 Jul 2022
Mix and Match: An Empirical Study on Training Corpus Composition for
  Polyglot Text-To-Speech (TTS)
Mix and Match: An Empirical Study on Training Corpus Composition for Polyglot Text-To-Speech (TTS)
Ziyao Zhang
Alessio Falai
Ariadna Sánchez
Orazio Angelini
K. Yanagisawa
56
4
0
04 Jul 2022
Computer-assisted Pronunciation Training -- Speech synthesis is almost
  all you need
Computer-assisted Pronunciation Training -- Speech synthesis is almost all you need
Daniel Korzekwa
Jaime Lorenzo-Trueba
Thomas Drugman
B. Kostek
62
28
0
02 Jul 2022
Parallel Synthesis for Autoregressive Speech Generation
Parallel Synthesis for Autoregressive Speech Generation
Po-Chun Hsu
Da-Rong Liu
Andy T. Liu
Hung-yi Lee
80
5
0
25 Apr 2022
Voice Filter: Few-shot text-to-speech speaker adaptation using voice
  conversion as a post-processing module
Voice Filter: Few-shot text-to-speech speaker adaptation using voice conversion as a post-processing module
Adam Gabry's
Goeric Huybrechts
M. Ribeiro
C. Chien
Julian Roth
Giulia Comini
Roberto Barra-Chicote
Bartek Perz
Jaime Lorenzo-Trueba
80
21
0
16 Feb 2022
Distribution augmentation for low-resource expressive text-to-speech
Distribution augmentation for low-resource expressive text-to-speech
Mateusz Lajszczak
Animesh Prasad
Arent van Korlaar
Bajibabu Bollepalli
Antonio Bonafonte
...
M. Nicolis
Alexis Moinet
Thomas Drugman
Trevor Wood
Elena Sokolova
61
7
0
13 Feb 2022
Machine Translation Verbosity Control for Automatic Dubbing
Machine Translation Verbosity Control for Automatic Dubbing
Surafel Melaku Lakew
Marcello Federico
Yue Wang
Cuong Hoang
Yogesh Virkar
Roberto Barra-Chicote
Robert Enyedi
63
24
0
08 Oct 2021
Combining speakers of multiple languages to improve quality of neural
  voices
Combining speakers of multiple languages to improve quality of neural voices
Javier Latorre
Charlotte Bailleul
Tuuli H. Morrill
Alistair Conkie
Y. Stylianou
64
8
0
17 Aug 2021
Enhancing audio quality for expressive Neural Text-to-Speech
Enhancing audio quality for expressive Neural Text-to-Speech
Abdelhamid Ezzerg
Adam Gabry's
Bartosz Putrycz
Daniel Korzekwa
Daniel Sáez-Trigueros
David McHardy
Kamil Pokora
Jakub Lachowicz
Jaime Lorenzo-Trueba
V. Klimkov
132
6
0
13 Aug 2021
Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource
  Highly Expressive Speech
Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech
Raahil Shah
Kamil Pokora
Abdelhamid Ezzerg
V. Klimkov
Goeric Huybrechts
Bartosz Putrycz
Daniel Korzekwa
Thomas Merritt
64
26
0
24 Jun 2021
Sprachsynthese -- State-of-the-Art in englischer und deutscher Sprache
Sprachsynthese -- State-of-the-Art in englischer und deutscher Sprache
René Peinl
48
0
0
11 Jun 2021
EmoCat: Language-agnostic Emotional Voice Conversion
EmoCat: Language-agnostic Emotional Voice Conversion
Bastian Schnell
Goeric Huybrechts
Bartek Perz
Thomas Drugman
Jaime Lorenzo-Trueba
89
11
0
14 Jan 2021
Detection of Lexical Stress Errors in Non-Native (L2) English with Data
  Augmentation and Attention
Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention
Daniel Korzekwa
Roberto Barra-Chicote
Szymon Zaporowski
Grzegorz Beringer
Jaime Lorenzo-Trueba
Alicja Serafinowicz
J. Droppo
Thomas Drugman
B. Kostek
51
9
0
29 Dec 2020
Low-resource expressive text-to-speech using data augmentation
Low-resource expressive text-to-speech using data augmentation
Goeric Huybrechts
Thomas Merritt
Giulia Comini
Bartek Perz
Raahil Shah
Jaime Lorenzo-Trueba
68
53
0
11 Nov 2020
Learning to Maximize Speech Quality Directly Using MOS Prediction for
  Neural Text-to-Speech
Learning to Maximize Speech Quality Directly Using MOS Prediction for Neural Text-to-Speech
Yeunju Choi
Youngmoon Jung
Youngjoo Suh
Hoirin Kim
129
6
0
02 Nov 2020
Efficient neural speech synthesis for low-resource languages through
  multilingual modeling
Efficient neural speech synthesis for low-resource languages through multilingual modeling
M. D. Korte
Jaebok Kim
E. Klabbers
59
19
0
20 Aug 2020
From Speech-to-Speech Translation to Automatic Dubbing
From Speech-to-Speech Translation to Automatic Dubbing
Marcello Federico
Robert Enyedi
Roberto Barra-Chicote
Ritwik Giri
Umut Isik
A. Krishnaswamy
Hassan Sawaf
106
43
0
19 Jan 2020
Singing Synthesis: with a little help from my attention
Singing Synthesis: with a little help from my attention
Orazio Angelini
Alexis Moinet
K. Yanagisawa
Thomas Drugman
61
17
0
12 Dec 2019
Dynamic Prosody Generation for Speech Synthesis using Linguistics-Driven
  Acoustic Embedding Selection
Dynamic Prosody Generation for Speech Synthesis using Linguistics-Driven Acoustic Embedding Selection
Shubhi Tyagi
M. Nicolis
Jonas Rohnke
Thomas Drugman
Jaime Lorenzo-Trueba
77
32
0
02 Dec 2019
Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis
  of Expressive Speech
Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech
Vatsal Aggarwal
Marius Cotescu
N. Prateek
Jaime Lorenzo-Trueba
Roberto Barra-Chicote
90
31
0
28 Nov 2019
Interpretable Deep Learning Model for the Detection and Reconstruction
  of Dysarthric Speech
Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech
Daniel Korzekwa
Roberto Barra-Chicote
B. Kostek
Thomas Drugman
Mateusz Lajszczak
27
20
0
10 Jul 2019
Fine-grained robust prosody transfer for single-speaker neural
  text-to-speech
Fine-grained robust prosody transfer for single-speaker neural text-to-speech
V. Klimkov
S. Ronanki
Jonas Rohnke
Thomas Drugman
AI4TS
89
82
0
04 Jul 2019
In Other News: A Bi-style Text-to-speech Model for Synthesizing
  Newscaster Voice with Limited Data
In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data
N. Prateek
Mateusz Lajszczak
Roberto Barra-Chicote
Thomas Drugman
Jaime Lorenzo-Trueba
Thomas Merritt
S. Ronanki
Trevor Wood
84
30
0
04 Apr 2019
Training Multi-Speaker Neural Text-to-Speech Systems using
  Speaker-Imbalanced Speech Corpora
Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora
Hieu-Thi Luong
Xin Wang
Junichi Yamagishi
Nobuyuki Nishizawa
86
23
0
01 Apr 2019
1