ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.05707
  4. Cited By
Low-resource expressive text-to-speech using data augmentation
v1v2 (latest)

Low-resource expressive text-to-speech using data augmentation

11 November 2020
Goeric Huybrechts
Thomas Merritt
Giulia Comini
Bartek Perz
Raahil Shah
Jaime Lorenzo-Trueba
ArXiv (abs)PDFHTML

Papers citing "Low-resource expressive text-to-speech using data augmentation"

28 / 28 papers shown
Integrating Feedback Loss from Bi-modal Sarcasm Detector for Sarcastic Speech Synthesis
Integrating Feedback Loss from Bi-modal Sarcasm Detector for Sarcastic Speech Synthesis
Zhu Li
Yuqing Zhang
Xiyuan Gao
Devraj Raghuvanshi
Nagendra Kumar
Shekhar Nayak
Matt Coler
147
1
0
18 Aug 2025
Exploring synthetic data for cross-speaker style transfer in style
  representation based TTS
Exploring synthetic data for cross-speaker style transfer in style representation based TTS
Lucas Ueda
Leonardo B. de M. M. Marques
Flávio O. Simões
Mário Uliani Neto
Fernando Runstein
Bianca Dal Bó
Paula D. P. Costa
258
2
0
25 Sep 2024
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model
  on 100K hours of data
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data
Mateusz Lajszczak
Guillermo Cámbara
Yang Li
Fatih Beyhan
Arent van Korlaar
...
Bartosz Putrycz
Soledad López Gambino
Kayeon Yoo
Elena Sokolova
Thomas Drugman
LM&MA
478
116
0
12 Feb 2024
Creating New Voices using Normalizing Flows
Creating New Voices using Normalizing Flows
Piotr Bilinski
Thomas Merritt
Abdelhamid Ezzerg
Kamil Pokora
Sebastian Cygert
K. Yanagisawa
Roberto Barra-Chicote
Daniel Korzekwa
272
18
0
22 Dec 2023
Custom Data Augmentation for low resource ASR using Bark and
  Retrieval-Based Voice Conversion
Custom Data Augmentation for low resource ASR using Bark and Retrieval-Based Voice Conversion
Anand Kamble
Aniket Tathe
Suyash Kumbharkar
Atharva Bhandare
Anirban C. Mitra
477
8
0
24 Nov 2023
Low-Resource Text-to-Speech Using Specific Data and Noise Augmentation
Low-Resource Text-to-Speech Using Specific Data and Noise AugmentationEuropean Signal Processing Conference (EUSIPCO), 2023
K. Lakshminarayana
C. Dittmar
N. Pia
Emanuel Habets
240
2
0
16 Jun 2023
Learning Emotional Representations from Imbalanced Speech Data for
  Speech Emotion Recognition and Emotional Text-to-Speech
Learning Emotional Representations from Imbalanced Speech Data for Speech Emotion Recognition and Emotional Text-to-SpeechInterspeech (Interspeech), 2023
Shijun Wang
Jón Guðnason
Damian Borth
281
5
0
09 Jun 2023
Unsupervised Pre-Training For Data-Efficient Text-to-Speech On Low
  Resource Languages
Unsupervised Pre-Training For Data-Efficient Text-to-Speech On Low Resource LanguagesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Seong-Hyun Park
Myungseo Song
Bohyung Kim
Tae-Hyun Oh
198
2
0
28 Mar 2023
Adapitch: Adaption Multi-Speaker Text-to-Speech Conditioned on Pitch
  Disentangling with Untranscribed Data
Adapitch: Adaption Multi-Speaker Text-to-Speech Conditioned on Pitch Disentangling with Untranscribed DataInternational Conference on Mobile Ad-hoc and Sensor Networks (MSN), 2022
Xulong Zhang
Jianzong Wang
Ning Cheng
Jing Xiao
179
1
0
25 Oct 2022
An Overview of Affective Speech Synthesis and Conversion in the Deep
  Learning Era
An Overview of Affective Speech Synthesis and Conversion in the Deep Learning EraProceedings of the IEEE (Proc. IEEE), 2022
Andreas Triantafyllopoulos
Björn W. Schuller
Gokcce .Iymen
M. Sezgin
Xiangheng He
...
Shuo Liu
Silvan Mertes
Elisabeth André
Ruibo Fu
Jianhua Tao
308
93
0
06 Oct 2022
Low-data? No problem: low-resource, language-agnostic conversational
  text-to-speech via F0-conditioned data augmentation
Low-data? No problem: low-resource, language-agnostic conversational text-to-speech via F0-conditioned data augmentationInterspeech (Interspeech), 2022
Giulia Comini
Goeric Huybrechts
M. Ribeiro
Adam Gabry's
Jaime Lorenzo-Trueba
197
7
0
29 Jul 2022
Transplantation of Conversational Speaking Style with Interjections in
  Sequence-to-Sequence Speech Synthesis
Transplantation of Conversational Speaking Style with Interjections in Sequence-to-Sequence Speech SynthesisInterspeech (Interspeech), 2022
Raul Fernandez
David Haws
Guy Lorberbom
Slava Shechtman
A. Sorin
140
12
0
25 Jul 2022
Computer-assisted Pronunciation Training -- Speech synthesis is almost
  all you need
Computer-assisted Pronunciation Training -- Speech synthesis is almost all you needSpeech Communication (Speech Commun.), 2022
Daniel Korzekwa
Jaime Lorenzo-Trueba
Thomas Drugman
B. Kostek
208
35
0
02 Jul 2022
Automatic Evaluation of Speaker Similarity
Automatic Evaluation of Speaker SimilarityInterspeech (Interspeech), 2022
Kamil Deja
Ariadna Sánchez
Julian Roth
Marius Cotescu
232
7
0
01 Jul 2022
TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis
  using ranking support vector machine with variational autoencoder
TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoderInterspeech (Interspeech), 2022
Eunwoo Song
Ryuichi Yamamoto
Ohsung Kwon
Chan Song
Min-Jae Hwang
Suhyeon Oh
Hyun-Wook Yoon
Jin-Seob Kim
Jae-Min Kim
231
9
0
30 Jun 2022
TDASS: Target Domain Adaptation Speech Synthesis Framework for
  Multi-speaker Low-Resource TTS
TDASS: Target Domain Adaptation Speech Synthesis Framework for Multi-speaker Low-Resource TTSIEEE International Joint Conference on Neural Network (IJCNN), 2022
Xulong Zhang
Jianzong Wang
Ning Cheng
Jing Xiao
226
14
0
24 May 2022
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using
  Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data AugmentationInterspeech (Interspeech), 2022
Ryo Terashima
Ryuichi Yamamoto
Eunwoo Song
Yuma Shirahata
Hyun-Wook Yoon
Jae-Min Kim
Kentaro Tachibana
259
20
0
21 Apr 2022
Data-augmented cross-lingual synthesis in a teacher-student framework
Data-augmented cross-lingual synthesis in a teacher-student frameworkInterspeech (Interspeech), 2022
M. D. Korte
Jaebok Kim
A. Kunikoshi
Adaeze Adigwe
E. Klabbers
264
0
0
31 Mar 2022
SingAug: Data Augmentation for Singing Voice Synthesis with
  Cycle-consistent Training Strategy
SingAug: Data Augmentation for Singing Voice Synthesis with Cycle-consistent Training StrategyInterspeech (Interspeech), 2022
Shuai Guo
Jiatong Shi
Tao Qian
Shinji Watanabe
Qin Jin
322
16
0
31 Mar 2022
Text-free non-parallel many-to-many voice conversion using normalising
  flows
Text-free non-parallel many-to-many voice conversion using normalising flowsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Thomas Merritt
Abdelhamid Ezzerg
Piotr Bilinski
Magdalena Proszewska
Kamil Pokora
Roberto Barra-Chicote
Daniel Korzekwa
291
15
0
15 Mar 2022
Voice Filter: Few-shot text-to-speech speaker adaptation using voice
  conversion as a post-processing module
Voice Filter: Few-shot text-to-speech speaker adaptation using voice conversion as a post-processing moduleIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Adam Gabry's
Goeric Huybrechts
M. Ribeiro
C. Chien
Julian Roth
Giulia Comini
Roberto Barra-Chicote
Bartek Perz
Jaime Lorenzo-Trueba
241
29
0
16 Feb 2022
Distribution augmentation for low-resource expressive text-to-speech
Distribution augmentation for low-resource expressive text-to-speechIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Mateusz Lajszczak
Animesh Prasad
Arent van Korlaar
Bajibabu Bollepalli
Antonio Bonafonte
...
M. Nicolis
Alexis Moinet
Thomas Drugman
Trevor Wood
Elena Sokolova
217
10
0
13 Feb 2022
Cross-speaker style transfer for text-to-speech using data augmentation
Cross-speaker style transfer for text-to-speech using data augmentationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
M. Ribeiro
Julian Roth
Giulia Comini
Goeric Huybrechts
Adam Gabry's
Jaime Lorenzo-Trueba
208
28
0
10 Feb 2022
Voice Conversion Can Improve ASR in Very Low-Resource Settings
Voice Conversion Can Improve ASR in Very Low-Resource SettingsInterspeech (Interspeech), 2021
Matthew Baas
Herman Kamper
321
21
0
04 Nov 2021
A Survey on Neural Speech Synthesis
A Survey on Neural Speech Synthesis
Xu Tan
Tao Qin
Frank Soong
Tie-Yan Liu
AI4TS
468
446
0
29 Jun 2021
Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource
  Highly Expressive Speech
Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech
Raahil Shah
Kamil Pokora
Abdelhamid Ezzerg
V. Klimkov
Goeric Huybrechts
Bartosz Putrycz
Daniel Korzekwa
Thomas Merritt
223
29
0
24 Jun 2021
Speaker verification-derived loss and data augmentation for DNN-based
  multispeaker speech synthesis
Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesisEuropean Signal Processing Conference (EUSIPCO), 2021
Beáta Lőrincz
Adriana Stan
M. Giurgiu
134
6
0
03 Jun 2021
Review of end-to-end speech synthesis technology based on deep learning
Review of end-to-end speech synthesis technology based on deep learning
Zhaoxi Mu
Xinyu Yang
Yizhuo Dong
AuLLMALM
236
31
0
20 Apr 2021
1
Page 1 of 1