ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.00417
  4. Cited By
WaveTTS: Tacotron-based TTS with Joint Time-Frequency Domain Loss
v1v2v3 (latest)

WaveTTS: Tacotron-based TTS with Joint Time-Frequency Domain Loss

The Speaker and Language Recognition Workshop (Odyssey), 2020
2 February 2020
Rui Liu
Berrak Sisman
F. Bao
Guanglai Gao
Haizhou Li
ArXiv (abs)PDFHTML

Papers citing "WaveTTS: Tacotron-based TTS with Joint Time-Frequency Domain Loss"

11 / 11 papers shown
Audio representations for deep learning in sound synthesis: A review
Audio representations for deep learning in sound synthesis: A reviewACS/IEEE International Conference on Computer Systems and Applications (AICCSA), 2021
Anastasia Natsiou
Seán O'Leary
AI4TS
189
29
0
07 Jan 2022
Limited Data Emotional Voice Conversion Leveraging Text-to-Speech:
  Two-stage Sequence-to-Sequence Training
Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-stage Sequence-to-Sequence TrainingInterspeech (Interspeech), 2021
Kun Zhou
Berrak Sisman
Haizhou Li
403
35
0
31 Mar 2021
VAW-GAN for Disentanglement and Recomposition of Emotional Elements in
  Speech
VAW-GAN for Disentanglement and Recomposition of Emotional Elements in Speech
Kun Zhou
Berrak Sisman
Haizhou Li
DRL
381
47
0
03 Nov 2020
GraphSpeech: Syntax-Aware Graph Attention Network For Neural Speech
  Synthesis
GraphSpeech: Syntax-Aware Graph Attention Network For Neural Speech SynthesisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Rui Liu
Berrak Sisman
Haizhou Li
373
27
0
23 Oct 2020
Modeling Prosodic Phrasing with Multi-Task Learning in Tacotron-based
  TTS
Modeling Prosodic Phrasing with Multi-Task Learning in Tacotron-based TTSIEEE Signal Processing Letters (IEEE SPL), 2020
Rui Liu
Berrak Sisman
F. Bao
Guanglai Gao
Haizhou Li
162
20
0
11 Aug 2020
Spectrum and Prosody Conversion for Cross-lingual Voice Conversion with
  CycleGAN
Spectrum and Prosody Conversion for Cross-lingual Voice Conversion with CycleGANAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2020
Zongyang Du
Kun Zhou
Berrak Sisman
Haizhou Li
311
8
0
11 Aug 2020
VAW-GAN for Singing Voice Conversion with Non-parallel Training Data
VAW-GAN for Singing Voice Conversion with Non-parallel Training DataAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2020
Junchen Lu
Kun Zhou
Berrak Sisman
Haizhou Li
DRL
205
21
0
10 Aug 2020
An Overview of Voice Conversion and its Challenges: From Statistical
  Modeling to Deep Learning
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep LearningIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Berrak Sisman
Junichi Yamagishi
Simon King
Haizhou Li
BDL
634
413
0
09 Aug 2020
Expressive TTS Training with Frame and Style Reconstruction Loss
Expressive TTS Training with Frame and Style Reconstruction Loss
Rui Liu
Berrak Sisman
Guanglai Gao
Haizhou Li
341
83
0
04 Aug 2020
Transforming Spectrum and Prosody for Emotional Voice Conversion with
  Non-Parallel Training Data
Transforming Spectrum and Prosody for Emotional Voice Conversion with Non-Parallel Training DataThe Speaker and Language Recognition Workshop (Odyssey), 2020
Kun Zhou
Berrak Sisman
Haizhou Li
612
74
0
01 Feb 2020
Teacher-Student Training for Robust Tacotron-based TTS
Teacher-Student Training for Robust Tacotron-based TTSIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Rui Liu
Berrak Sisman
Jingdong Li
F. Bao
Guanglai Gao
Haizhou Li
249
40
0
07 Nov 2019
1
Page 1 of 1