An Open source Implementation of ITU-T Recommendation P.808 with Validation

17 May 2020

Babak Naderi

Ross Cutler

ArXiv (abs)PDF HTML

Papers citing "An Open source Implementation of ITU-T Recommendation P.808 with Validation"

48 / 48 papers shown

High-Fidelity Speech Enhancement via Discrete Audio Tokens

193

02 Oct 2025

Robust Residual Finite Scalar Quantization for Neural Compression

193

20 Aug 2025

Crowdsourcing MUSHRA Tests in the Age of Generative Speech Technologies: A Comparative Analysis of Subjective and Objective Testing Methods

Laura Lechler

Chamran Moradi

Ivana Balic

197

01 Jun 2025

UBGAN: Enhancing Coded Speech with Blind and Guided Bandwidth Extension

252

22 May 2025

A multidimensional measurement of photorealistic avatar quality of experienceProceedings of the ACM on Human-Computer Interaction (PACMHCI), 2024

Ross Cutler

Babak Naderi

Vishak Gopal

Dharmendar Reddy Palle

488

13 Nov 2024

A Closer Look at Neural Codec Resynthesis: Bridging the Gap between Codec and Waveform Generation

214

29 Oct 2024

On Improving Error Resilience of Neural End-to-End Speech Coders

281

13 Jun 2024

Pre-training Feature Guided Diffusion Model for Speech Enhancement

Yiyuan Yang

Niki Trigoni

Andrew Markham

458

11 Jun 2024

Crowdsourced Multilingual Speech Intelligibility Testing

Laura Lechler

Kamil Wojcicki

345

21 Mar 2024

Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge

360

02 Feb 2024

ICASSP 2023 Acoustic Echo Cancellation ChallengeIEEE Open Journal of Signal Processing (IEEE Open J. Signal Process.), 2023

Nicolae-Cătălin Ristea

283

22 Sep 2023

Multi-dimensional Speech Quality Assessment in CrowdsourcingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Babak Naderi

Ross Cutler

Nicolae-Cătălin Ristea

157

14 Sep 2023

Analysis of XLS-R for Speech Quality AssessmentIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2023

Bastiaan Tamm

Rik Vandenberghe

Hugo Van hamme

308

23 Aug 2023

PLCMOS -- a data-driven non-intrusive metric for the evaluation of packet loss concealment algorithmsInterspeech (Interspeech), 2023

232

24 May 2023

ICASSP 2023 Deep Noise Suppression ChallengeIEEE Open Journal of Signal Processing (IEEE Open J. Signal Process.), 2023

...

364

174

21 Mar 2023

Unsupervised Data Selection for TTS: Using Arabic Broadcast News as a Case Study

237

22 Jan 2023

Neural Codec Language Models are Zero-Shot Text to Speech SynthesizersIEEE Transactions on Audio, Speech, and Language Processing (IEEE TASLP), 2023

...

598

1,122

05 Jan 2023

Speech MOS multi-task learning and rater bias correctionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

H. Akrami

H. Gamper

249

04 Dec 2022

Pre-trained Speech Representations as Feature Extractors for Speech Quality Assessment in Online Conferencing ApplicationsInterspeech (Interspeech), 2022

255

01 Oct 2022

Speech Enhancement and Dereverberation with Diffusion-based Generative ModelsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

502

354

11 Aug 2022

A Comparative Study of Self-supervised Speech Representation Based Voice ConversionIEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022

222

10 Jul 2022

Real-Time Packet Loss Concealment With Mixed Generative and Predictive ModelInterspeech (Interspeech), 2022

J. Valin

Ahmed Mustafa

Christopher Montgomery

Timothy B. Terriberry

Michael Klingbeil

Paris Smaragdis

A. Krishnaswamy

183

11 May 2022

Predicting score distribution to improve non-intrusive speech quality estimation

A. Faridee

H. Gamper

206

13 Apr 2022

INTERSPEECH 2022 Audio Deep Packet Loss Concealment ChallengeInterspeech (Interspeech), 2022

228

11 Apr 2022

Effective data screening technique for crowdsourced speech intelligibility experiments: Evaluation with IRM-based speech enhancementAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2022

222

31 Mar 2022

Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech EnhancementInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2022

295

30 Mar 2022

ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge for Online Conferencing ApplicationsInterspeech (Interspeech), 2022

Sebastian Möller

...

272

30 Mar 2022

A Text-to-Speech Pipeline, Evaluation Methodology, and Initial Fine-Tuning Results for Child Speech SynthesisIEEE Access (IEEE Access), 2022

279

22 Mar 2022

DMF-Net: A decoupling-style multi-band fusion model for full-band speech enhancementAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2022

437

01 Mar 2022

ICASSP 2022 Acoustic Echo Cancellation ChallengeIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

283

27 Feb 2022

ICASSP 2022 Deep Noise Suppression ChallengeIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

...

388

228

27 Feb 2022

AC-VC: Non-parallel Low Latency Phonetic Posteriorgrams Based Voice ConversionAutomatic Speech Recognition & Understanding (ASRU), 2021

Damien Ronssin

Milos Cernak

254

12 Nov 2021

S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech RepresentationsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

206

12 Oct 2021

MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speechIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Mirco Ravanelli

270

12 Oct 2021

Acoustic Echo Cancellation using Residual U-Nets

210

20 Sep 2021

On Prosody Modeling for ASR+TTS based Voice ConversionAutomatic Speech Recognition & Understanding (ASRU), 2021

282

20 Jul 2021

F-T-LSTM based Complex Network for Joint Acoustic Echo Cancellation and Speech EnhancementInterspeech (Interspeech), 2021

Shimin Zhang

Yuxiang Kong

Shubo Lv

Yanxin Hu

Lei Xie

222

14 Jun 2021

Comparison of remote experiments using crowdsourcing and laboratory experiments on speech intelligibilityInterspeech (Interspeech), 2021

159

17 Apr 2021

Weighted Recursive Least Square Filter and Neural Network based Residual Echo Suppression for the AEC-ChallengeIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

304

17 Feb 2021

Interspeech 2021 Deep Noise Suppression ChallengeInterspeech (Interspeech), 2021

516

197

06 Jan 2021

Using IPA-Based Tacotron for Data Efficient Cross-Lingual Speaker Adaptation and Pronunciation Enhancement

Hamed Hemati

Damian Borth

238

12 Nov 2020

DNSMOS: A Non-Intrusive Perceptual Objective Speech Quality metric to evaluate Noise SuppressorsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Chandan K. A. Reddy

Vishak Gopal

Ross Cutler

422

499

28 Oct 2020

Effect of Language Proficiency on Subjective Evaluation of Noise Suppression AlgorithmsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Babak Naderi

Gabriel Mittag

Rafael Zequeira Jimaénez

Sebastian Möller

193

26 Oct 2020

Subjective Evaluation of Noise Suppression Algorithms in Crowdsourcing

Babak Naderi

Ross Cutler

338

25 Oct 2020

Crowdsourcing approach for subjective evaluation of echo impairmentIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

388

25 Oct 2020

Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech RepresentationsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

369

23 Oct 2020

ICASSP 2021 Acoustic Echo Cancellation Challenge: Datasets, Testing Framework, and Results

348

10 Sep 2020

Pretraining Techniques for Sequence-to-Sequence Voice ConversionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020

385

07 Aug 2020