v1v2 (latest)

Low-resource expressive text-to-speech using data augmentation

11 November 2020

Papers citing "Low-resource expressive text-to-speech using data augmentation"

28 / 28 papers shown

Integrating Feedback Loss from Bi-modal Sarcasm Detector for Sarcastic Speech Synthesis

147

18 Aug 2025

Exploring synthetic data for cross-speaker style transfer in style representation based TTS

Lucas Ueda

Leonardo B. de M. M. Marques

258

25 Sep 2024

BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data

...

Soledad López Gambino

478

116

12 Feb 2024

Creating New Voices using Normalizing Flows

Roberto Barra-Chicote

Daniel Korzekwa

272

22 Dec 2023

Custom Data Augmentation for low resource ASR using Bark and Retrieval-Based Voice Conversion

477

24 Nov 2023

Low-Resource Text-to-Speech Using Specific Data and Noise AugmentationEuropean Signal Processing Conference (EUSIPCO), 2023

240

16 Jun 2023

Learning Emotional Representations from Imbalanced Speech Data for Speech Emotion Recognition and Emotional Text-to-SpeechInterspeech (Interspeech), 2023

Shijun Wang

Jón Guðnason

Damian Borth

281

09 Jun 2023

Unsupervised Pre-Training For Data-Efficient Text-to-Speech On Low Resource LanguagesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

198

28 Mar 2023

Adapitch: Adaption Multi-Speaker Text-to-Speech Conditioned on Pitch Disentangling with Untranscribed DataInternational Conference on Mobile Ad-hoc and Sensor Networks (MSN), 2022

179

25 Oct 2022

An Overview of Affective Speech Synthesis and Conversion in the Deep Learning EraProceedings of the IEEE (Proc. IEEE), 2022

Andreas Triantafyllopoulos

Björn W. Schuller

...

308

06 Oct 2022

Low-data? No problem: low-resource, language-agnostic conversational text-to-speech via F0-conditioned data augmentationInterspeech (Interspeech), 2022

197

29 Jul 2022

Transplantation of Conversational Speaking Style with Interjections in Sequence-to-Sequence Speech SynthesisInterspeech (Interspeech), 2022

140

25 Jul 2022

Computer-assisted Pronunciation Training -- Speech synthesis is almost all you needSpeech Communication (Speech Commun.), 2022

208

02 Jul 2022

Automatic Evaluation of Speaker SimilarityInterspeech (Interspeech), 2022

232

01 Jul 2022

TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoderInterspeech (Interspeech), 2022

231

30 Jun 2022

TDASS: Target Domain Adaptation Speech Synthesis Framework for Multi-speaker Low-Resource TTSIEEE International Joint Conference on Neural Network (IJCNN), 2022

226

24 May 2022

Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data AugmentationInterspeech (Interspeech), 2022

259

21 Apr 2022

Data-augmented cross-lingual synthesis in a teacher-student frameworkInterspeech (Interspeech), 2022

264

31 Mar 2022

SingAug: Data Augmentation for Singing Voice Synthesis with Cycle-consistent Training StrategyInterspeech (Interspeech), 2022

Shuai Guo

Jiatong Shi

Tao Qian

Shinji Watanabe

Qin Jin

322

31 Mar 2022

Text-free non-parallel many-to-many voice conversion using normalising flowsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Roberto Barra-Chicote

Daniel Korzekwa

291

15 Mar 2022

Voice Filter: Few-shot text-to-speech speaker adaptation using voice conversion as a post-processing moduleIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Roberto Barra-Chicote

Bartek Perz

Jaime Lorenzo-Trueba

241

16 Feb 2022

Distribution augmentation for low-resource expressive text-to-speechIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

...

Trevor Wood

217

13 Feb 2022

Cross-speaker style transfer for text-to-speech using data augmentationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

208

10 Feb 2022

Voice Conversion Can Improve ASR in Very Low-Resource SettingsInterspeech (Interspeech), 2021

Matthew Baas

Herman Kamper

321

04 Nov 2021

A Survey on Neural Speech Synthesis

Xu Tan

468

446

29 Jun 2021

Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech

223

24 Jun 2021

Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesisEuropean Signal Processing Conference (EUSIPCO), 2021

Beáta Lőrincz

Adriana Stan

M. Giurgiu

134

03 Jun 2021

Review of end-to-end speech synthesis technology based on deep learning

236

20 Apr 2021