ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.12645
  4. Cited By
Comparison of Speech Representations for Automatic Quality Estimation in
  Multi-Speaker Text-to-Speech Synthesis
v1v2 (latest)

Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis

The Speaker and Language Recognition Workshop (Odyssey), 2020
28 February 2020
Jennifer Williams
Joanna Rownicka
P. Oplustil
Simon King
ArXiv (abs)PDFHTML

Papers citing "Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis"

12 / 12 papers shown
Partial Rank Similarity Minimization Method for Quality MOS Prediction
  of Unseen Speech Synthesis Systems in Zero-Shot and Semi-supervised setting
Partial Rank Similarity Minimization Method for Quality MOS Prediction of Unseen Speech Synthesis Systems in Zero-Shot and Semi-supervised settingAutomatic Speech Recognition & Understanding (ASRU), 2023
Hemant Yadav
Erica Cooper
Junichi Yamagishi
Sunayana Sitaram
R. Shah
295
0
0
08 Oct 2023
Resource-Efficient Fine-Tuning Strategies for Automatic MOS Prediction
  in Text-to-Speech for Low-Resource Languages
Resource-Efficient Fine-Tuning Strategies for Automatic MOS Prediction in Text-to-Speech for Low-Resource LanguagesInterspeech (Interspeech), 2023
P. Do
Matt Coler
J. Dijkstra
E. Klabbers
260
5
0
30 May 2023
Automatic Evaluation of Turn-taking Cues in Conversational Speech
  Synthesis
Automatic Evaluation of Turn-taking Cues in Conversational Speech SynthesisInterspeech (Interspeech), 2023
Erik Ekstedt
Siyang Wang
Éva Székely
Joakim Gustafson
Gabriel Skantze
198
9
0
29 May 2023
SQuId: Measuring Speech Naturalness in Many Languages
SQuId: Measuring Speech Naturalness in Many LanguagesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Thibault Sellam
Ankur Bapna
Joshua Camp
Diana Mackinnon
Ankur P. Parikh
Jason Riesa
375
27
0
12 Oct 2022
Predicting pairwise preferences between TTS audio stimuli using parallel
  ratings data and anti-symmetric twin neural networks
Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networksInterspeech (Interspeech), 2022
Cassia Valentini-Botinhao
M. Ribeiro
O. Watts
Korin Richmond
G. Henter
159
4
0
22 Sep 2022
SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural
  Text-to-Speech Synthesis
SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech SynthesisInterspeech (Interspeech), 2022
Georgia Maniati
Alexandra Vioni
Nikolaos Ellinas
Karolos Nikitaras
Konstantinos Klapsas
June Sig Sung
Gunu Jho
Aimilios Chalamandaris
Pirros Tsiakoulis
310
47
0
06 Apr 2022
The VoiceMOS Challenge 2022
The VoiceMOS Challenge 2022Interspeech (Interspeech), 2022
Wen-Chin Huang
Erica Cooper
Yu Tsao
Hsin-Min Wang
Tomoki Toda
Junichi Yamagishi
475
156
0
21 Mar 2022
Human Perception of Audio Deepfakes
Human Perception of Audio Deepfakes
Nicolas Müller
Karla Markert
Konstantin Böttinger
477
72
0
20 Jul 2021
MBNet: MOS Prediction for Synthesized Speech with Mean-Bias Network
MBNet: MOS Prediction for Synthesized Speech with Mean-Bias NetworkIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Yichong Leng
Xu Tan
Sheng Zhao
Frank Soong
Xiang-Yang Li
Tao Qin
281
124
0
27 Feb 2021
Learning Disentangled Phone and Speaker Representations in a
  Semi-Supervised VQ-VAE Paradigm
Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE ParadigmIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Jennifer Williams
Yi Zhao
Erica Cooper
Junichi Yamagishi
SSL
275
26
0
21 Oct 2020
Predictions of Subjective Ratings and Spoofing Assessments of Voice
  Conversion Challenge 2020 Submissions
Predictions of Subjective Ratings and Spoofing Assessments of Voice Conversion Challenge 2020 Submissions
Rohan Kumar Das
Tomi Kinnunen
Wen-Chin Huang
Zhenhua Ling
Junichi Yamagishi
Yi Zhao
Xiaohai Tian
Tomoki Toda
244
58
0
08 Sep 2020
An Overview of Voice Conversion and its Challenges: From Statistical
  Modeling to Deep Learning
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep LearningIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Berrak Sisman
Junichi Yamagishi
Simon King
Haizhou Li
BDL
631
407
0
09 Aug 2020
1
Page 1 of 1