v1v2v3 (latest)

Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis

8 June 2019

Papers citing "Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis"

26 / 26 papers shown

MLAAD: The Multi-Language Audio Anti-Spoofing DatasetIEEE International Joint Conference on Neural Network (IJCNN), 2024

449

124

17 Jan 2024

Controllable Speaking Styles Using a Large Language ModelIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

A. Sigurgeirsson

Simon King

241

17 May 2023

Do Prosody Transfer Models Transfer Prosody?IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

A. Sigurgeirsson

Simon King

DiffM

212

07 Mar 2023

Controllable speech synthesis by learning discrete phoneme-level prosodic representationsSpeech Communication (Speech Commun.), 2022

Aimilios Chalamandaris

Pirros Tsiakoulis

P. Mastorocostas

194

29 Nov 2022

Into-TTS : Intonation Template Based Prosody Control System

316

04 Apr 2022

Zero-Shot Long-Form Voice Cloning with Dynamic Convolution Attention

Artem Gorodetskii

Ivan Ozhiganov

337

25 Jan 2022

Prosodic Clustering for Phoneme-level Prosody Control in End-to-End Speech SynthesisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Aimilios Chalamandaris

Pirros Tsiakoulis

208

19 Nov 2021

Improved Prosodic Clustering for Multispeaker and Speaker-independent Phoneme-level Prosody ControlInternational Conference on Speech and Computer (SPECOM), 2021

Aimilios Chalamandaris

Pirros Tsiakoulis

204

19 Nov 2021

293

07 Nov 2021

Emotional Prosody Control for Speech Generation

S. Sivaprasad

Saiteja Kosgi

Vineet Gandhi

249

07 Nov 2021

GANtron: Emotional Speech Synthesis with Generative Adversarial Networks

E. Hortal

Rodrigo Brechard Alarcia

GAN

113

06 Oct 2021

Daft-Exprt: Cross-Speaker Prosody Transfer on Any Text for Expressive Speech SynthesisInterspeech (Interspeech), 2021

156

04 Aug 2021

On Prosody Modeling for ASR+TTS based Voice ConversionAutomatic Speech Recognition & Understanding (ASRU), 2021

288

20 Jul 2021

Learning De-identified Representations of Prosody from Raw AudioInternational Conference on Machine Learning (ICML), 2021

281

17 Jul 2021

Fast DCTTS: Efficient Deep Convolutional Text-to-SpeechIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

205

01 Apr 2021

GAN Vocoder: Multi-Resolution Discriminator Is All You NeedInterspeech (Interspeech), 2021

288

09 Mar 2021

FeatherTTS: Robust and Efficient attention based Neural TTSSpeech Synthesis Workshop (SSW), 2020

Heng Lu

182

02 Nov 2020

Multi-speaker Emotion Conversion via Latent Variable Regularization and a Chained Encoder-Decoder-Predictor NetworkInterspeech (Interspeech), 2020

253

25 Jul 2020

Non-parallel Emotion Conversion using a Deep-Generative Hybrid Network and an Adversarial Pair DiscriminatorInterspeech (Interspeech), 2020

300

25 Jul 2020

Attentron: Few-Shot Text-to-Speech Utilizing Attention-Based Variable-Length Embedding

402

18 May 2020

You Do Not Need More Data: Improving End-To-End Speech Recognition by Text-To-Speech Data Augmentation

Ivan Medennikov

310

14 May 2020

Fully-hierarchical fine-grained prosody modeling for interpretable speech synthesisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Guangzhi Sun

223

130

06 Feb 2020

Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody priorIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Guangzhi Sun

Andrew Rosenberg

Bhuvana Ramabhadran

Yonghui Wu

DiffM

270

06 Feb 2020

A unified sequence-to-sequence front-end model for Mandarin text-to-speech synthesisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

Xiang Yin

Yang Zhang

Yuxuan Wang

177

11 Nov 2019

Location-Relative Attention Mechanisms For Robust Long-Form Speech SynthesisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

297

122

23 Oct 2019

Semi-Supervised Generative Modeling for Controllable Speech SynthesisInternational Conference on Learning Representations (ICLR), 2019

223

03 Oct 2019