Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2011.05707
Cited By
v1
v2 (latest)
Low-resource expressive text-to-speech using data augmentation
11 November 2020
Goeric Huybrechts
Thomas Merritt
Giulia Comini
Bartek Perz
Raahil Shah
Jaime Lorenzo-Trueba
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Low-resource expressive text-to-speech using data augmentation"
28 / 28 papers shown
Integrating Feedback Loss from Bi-modal Sarcasm Detector for Sarcastic Speech Synthesis
Zhu Li
Yuqing Zhang
Xiyuan Gao
Devraj Raghuvanshi
Nagendra Kumar
Shekhar Nayak
Matt Coler
147
1
0
18 Aug 2025
Exploring synthetic data for cross-speaker style transfer in style representation based TTS
Lucas Ueda
Leonardo B. de M. M. Marques
Flávio O. Simões
Mário Uliani Neto
Fernando Runstein
Bianca Dal Bó
Paula D. P. Costa
258
2
0
25 Sep 2024
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data
Mateusz Lajszczak
Guillermo Cámbara
Yang Li
Fatih Beyhan
Arent van Korlaar
...
Bartosz Putrycz
Soledad López Gambino
Kayeon Yoo
Elena Sokolova
Thomas Drugman
LM&MA
478
116
0
12 Feb 2024
Creating New Voices using Normalizing Flows
Piotr Bilinski
Thomas Merritt
Abdelhamid Ezzerg
Kamil Pokora
Sebastian Cygert
K. Yanagisawa
Roberto Barra-Chicote
Daniel Korzekwa
272
18
0
22 Dec 2023
Custom Data Augmentation for low resource ASR using Bark and Retrieval-Based Voice Conversion
Anand Kamble
Aniket Tathe
Suyash Kumbharkar
Atharva Bhandare
Anirban C. Mitra
477
8
0
24 Nov 2023
Low-Resource Text-to-Speech Using Specific Data and Noise Augmentation
European Signal Processing Conference (EUSIPCO), 2023
K. Lakshminarayana
C. Dittmar
N. Pia
Emanuel Habets
240
2
0
16 Jun 2023
Learning Emotional Representations from Imbalanced Speech Data for Speech Emotion Recognition and Emotional Text-to-Speech
Interspeech (Interspeech), 2023
Shijun Wang
Jón Guðnason
Damian Borth
281
5
0
09 Jun 2023
Unsupervised Pre-Training For Data-Efficient Text-to-Speech On Low Resource Languages
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Seong-Hyun Park
Myungseo Song
Bohyung Kim
Tae-Hyun Oh
198
2
0
28 Mar 2023
Adapitch: Adaption Multi-Speaker Text-to-Speech Conditioned on Pitch Disentangling with Untranscribed Data
International Conference on Mobile Ad-hoc and Sensor Networks (MSN), 2022
Xulong Zhang
Jianzong Wang
Ning Cheng
Jing Xiao
179
1
0
25 Oct 2022
An Overview of Affective Speech Synthesis and Conversion in the Deep Learning Era
Proceedings of the IEEE (Proc. IEEE), 2022
Andreas Triantafyllopoulos
Björn W. Schuller
Gokcce .Iymen
M. Sezgin
Xiangheng He
...
Shuo Liu
Silvan Mertes
Elisabeth André
Ruibo Fu
Jianhua Tao
308
93
0
06 Oct 2022
Low-data? No problem: low-resource, language-agnostic conversational text-to-speech via F0-conditioned data augmentation
Interspeech (Interspeech), 2022
Giulia Comini
Goeric Huybrechts
M. Ribeiro
Adam Gabry's
Jaime Lorenzo-Trueba
197
7
0
29 Jul 2022
Transplantation of Conversational Speaking Style with Interjections in Sequence-to-Sequence Speech Synthesis
Interspeech (Interspeech), 2022
Raul Fernandez
David Haws
Guy Lorberbom
Slava Shechtman
A. Sorin
140
12
0
25 Jul 2022
Computer-assisted Pronunciation Training -- Speech synthesis is almost all you need
Speech Communication (Speech Commun.), 2022
Daniel Korzekwa
Jaime Lorenzo-Trueba
Thomas Drugman
B. Kostek
208
35
0
02 Jul 2022
Automatic Evaluation of Speaker Similarity
Interspeech (Interspeech), 2022
Kamil Deja
Ariadna Sánchez
Julian Roth
Marius Cotescu
232
7
0
01 Jul 2022
TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder
Interspeech (Interspeech), 2022
Eunwoo Song
Ryuichi Yamamoto
Ohsung Kwon
Chan Song
Min-Jae Hwang
Suhyeon Oh
Hyun-Wook Yoon
Jin-Seob Kim
Jae-Min Kim
231
9
0
30 Jun 2022
TDASS: Target Domain Adaptation Speech Synthesis Framework for Multi-speaker Low-Resource TTS
IEEE International Joint Conference on Neural Network (IJCNN), 2022
Xulong Zhang
Jianzong Wang
Ning Cheng
Jing Xiao
226
14
0
24 May 2022
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation
Interspeech (Interspeech), 2022
Ryo Terashima
Ryuichi Yamamoto
Eunwoo Song
Yuma Shirahata
Hyun-Wook Yoon
Jae-Min Kim
Kentaro Tachibana
259
20
0
21 Apr 2022
Data-augmented cross-lingual synthesis in a teacher-student framework
Interspeech (Interspeech), 2022
M. D. Korte
Jaebok Kim
A. Kunikoshi
Adaeze Adigwe
E. Klabbers
264
0
0
31 Mar 2022
SingAug: Data Augmentation for Singing Voice Synthesis with Cycle-consistent Training Strategy
Interspeech (Interspeech), 2022
Shuai Guo
Jiatong Shi
Tao Qian
Shinji Watanabe
Qin Jin
322
16
0
31 Mar 2022
Text-free non-parallel many-to-many voice conversion using normalising flows
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Thomas Merritt
Abdelhamid Ezzerg
Piotr Bilinski
Magdalena Proszewska
Kamil Pokora
Roberto Barra-Chicote
Daniel Korzekwa
291
15
0
15 Mar 2022
Voice Filter: Few-shot text-to-speech speaker adaptation using voice conversion as a post-processing module
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Adam Gabry's
Goeric Huybrechts
M. Ribeiro
C. Chien
Julian Roth
Giulia Comini
Roberto Barra-Chicote
Bartek Perz
Jaime Lorenzo-Trueba
241
29
0
16 Feb 2022
Distribution augmentation for low-resource expressive text-to-speech
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Mateusz Lajszczak
Animesh Prasad
Arent van Korlaar
Bajibabu Bollepalli
Antonio Bonafonte
...
M. Nicolis
Alexis Moinet
Thomas Drugman
Trevor Wood
Elena Sokolova
217
10
0
13 Feb 2022
Cross-speaker style transfer for text-to-speech using data augmentation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
M. Ribeiro
Julian Roth
Giulia Comini
Goeric Huybrechts
Adam Gabry's
Jaime Lorenzo-Trueba
208
28
0
10 Feb 2022
Voice Conversion Can Improve ASR in Very Low-Resource Settings
Interspeech (Interspeech), 2021
Matthew Baas
Herman Kamper
321
21
0
04 Nov 2021
A Survey on Neural Speech Synthesis
Xu Tan
Tao Qin
Frank Soong
Tie-Yan Liu
AI4TS
468
446
0
29 Jun 2021
Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech
Raahil Shah
Kamil Pokora
Abdelhamid Ezzerg
V. Klimkov
Goeric Huybrechts
Bartosz Putrycz
Daniel Korzekwa
Thomas Merritt
223
29
0
24 Jun 2021
Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesis
European Signal Processing Conference (EUSIPCO), 2021
Beáta Lőrincz
Adriana Stan
M. Giurgiu
134
6
0
03 Jun 2021
Review of end-to-end speech synthesis technology based on deep learning
Zhaoxi Mu
Xinyu Yang
Yizhuo Dong
AuLLM
ALM
236
31
0
20 Apr 2021
1
Page 1 of 1