v1v2 (latest)

Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis

The Speaker and Language Recognition Workshop (Odyssey), 2020

28 February 2020

Papers citing "Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis"

12 / 12 papers shown

Partial Rank Similarity Minimization Method for Quality MOS Prediction
of Unseen Speech Synthesis Systems in Zero-Shot and Semi-supervised setting

300

08 Oct 2023

Resource-Efficient Fine-Tuning Strategies for Automatic MOS Prediction in Text-to-Speech for Low-Resource LanguagesInterspeech (Interspeech), 2023

261

30 May 2023

Automatic Evaluation of Turn-taking Cues in Conversational Speech SynthesisInterspeech (Interspeech), 2023

200

29 May 2023

SQuId: Measuring Speech Naturalness in Many LanguagesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

386

12 Oct 2022

Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networksInterspeech (Interspeech), 2022

Cassia Valentini-Botinhao

164

22 Sep 2022

SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech SynthesisInterspeech (Interspeech), 2022

Aimilios Chalamandaris

Pirros Tsiakoulis

322

06 Apr 2022

The VoiceMOS Challenge 2022Interspeech (Interspeech), 2022

481

156

21 Mar 2022

Human Perception of Audio Deepfakes

Nicolas Müller

Karla Markert

Konstantin Böttinger

483

20 Jul 2021

MBNet: MOS Prediction for Synthesized Speech with Mean-Bias NetworkIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Xu Tan

Xiang-Yang Li

284

126

27 Feb 2021

Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE ParadigmIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

277

21 Oct 2020

Predictions of Subjective Ratings and Spoofing Assessments of Voice Conversion Challenge 2020 Submissions

Rohan Kumar Das

246

08 Sep 2020

An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep LearningIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020

Haizhou Li

642

413

09 Aug 2020