Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2002.12645
Cited By
v1
v2 (latest)
Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis
The Speaker and Language Recognition Workshop (Odyssey), 2020
28 February 2020
Jennifer Williams
Joanna Rownicka
P. Oplustil
Simon King
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis"
12 / 12 papers shown
Partial Rank Similarity Minimization Method for Quality MOS Prediction of Unseen Speech Synthesis Systems in Zero-Shot and Semi-supervised setting
Automatic Speech Recognition & Understanding (ASRU), 2023
Hemant Yadav
Erica Cooper
Junichi Yamagishi
Sunayana Sitaram
R. Shah
295
0
0
08 Oct 2023
Resource-Efficient Fine-Tuning Strategies for Automatic MOS Prediction in Text-to-Speech for Low-Resource Languages
Interspeech (Interspeech), 2023
P. Do
Matt Coler
J. Dijkstra
E. Klabbers
260
5
0
30 May 2023
Automatic Evaluation of Turn-taking Cues in Conversational Speech Synthesis
Interspeech (Interspeech), 2023
Erik Ekstedt
Siyang Wang
Éva Székely
Joakim Gustafson
Gabriel Skantze
198
9
0
29 May 2023
SQuId: Measuring Speech Naturalness in Many Languages
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Thibault Sellam
Ankur Bapna
Joshua Camp
Diana Mackinnon
Ankur P. Parikh
Jason Riesa
375
27
0
12 Oct 2022
Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks
Interspeech (Interspeech), 2022
Cassia Valentini-Botinhao
M. Ribeiro
O. Watts
Korin Richmond
G. Henter
159
4
0
22 Sep 2022
SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis
Interspeech (Interspeech), 2022
Georgia Maniati
Alexandra Vioni
Nikolaos Ellinas
Karolos Nikitaras
Konstantinos Klapsas
June Sig Sung
Gunu Jho
Aimilios Chalamandaris
Pirros Tsiakoulis
310
47
0
06 Apr 2022
The VoiceMOS Challenge 2022
Interspeech (Interspeech), 2022
Wen-Chin Huang
Erica Cooper
Yu Tsao
Hsin-Min Wang
Tomoki Toda
Junichi Yamagishi
475
156
0
21 Mar 2022
Human Perception of Audio Deepfakes
Nicolas Müller
Karla Markert
Konstantin Böttinger
477
72
0
20 Jul 2021
MBNet: MOS Prediction for Synthesized Speech with Mean-Bias Network
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Yichong Leng
Xu Tan
Sheng Zhao
Frank Soong
Xiang-Yang Li
Tao Qin
281
124
0
27 Feb 2021
Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Jennifer Williams
Yi Zhao
Erica Cooper
Junichi Yamagishi
SSL
275
26
0
21 Oct 2020
Predictions of Subjective Ratings and Spoofing Assessments of Voice Conversion Challenge 2020 Submissions
Rohan Kumar Das
Tomi Kinnunen
Wen-Chin Huang
Zhenhua Ling
Junichi Yamagishi
Yi Zhao
Xiaohai Tian
Tomoki Toda
244
58
0
08 Sep 2020
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Berrak Sisman
Junichi Yamagishi
Simon King
Haizhou Li
BDL
631
407
0
09 Aug 2020
1
Page 1 of 1