An Open source Implementation of ITU-T Recommendation P.808 with Validation

17 May 2020

Papers citing "An Open source Implementation of ITU-T Recommendation P.808 with Validation"

46 / 46 papers shown

Title
Crowdsourcing MUSHRA Tests in the Age of Generative Speech Technologies: A Comparative Analysis of Subjective and Objective Testing Methods Laura Lechler Chamran Moradi Ivana Balic 30 0 0 01 Jun 2025
UBGAN: Enhancing Coded Speech with Blind and Guided Bandwidth Extension Kishan Gupta Srikanth Korse Andreas Brendel N. Pia Guillaume Fuchs 45 0 0 22 May 2025
A multidimensional measurement of photorealistic avatar quality of experience Ross Cutler Babak Naderi Vishak Gopal Dharmendar Reddy Palle 88 0 0 13 Nov 2024
A Closer Look at Neural Codec Resynthesis: Bridging the Gap between Codec and Waveform Generation Alexander H. Liu Qirui Wang Yuan Gong James Glass 66 0 0 29 Oct 2024
On Improving Error Resilience of Neural End-to-End Speech Coders Kishan Gupta N. Pia Srikanth Korse Andreas Brendel Guillaume Fuchs M. Multrus 80 0 0 13 Jun 2024
Pre-training Feature Guided Diffusion Model for Speech Enhancement Yiyuan Yang Niki Trigoni Andrew Markham 156 3 0 11 Jun 2024
Crowdsourced Multilingual Speech Intelligibility Testing Laura Lechler Kamil Wojcicki 57 2 0 21 Mar 2024
Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge Simon Leglaive Matthieu Fraticelli Hend ElGhazaly Léonie Borne Mostafa Sadeghi Scott Wisdom Manuel Pariente J. Hershey Daniel Pressnitzer Jon P. Barker 77 11 0 02 Feb 2024
ICASSP 2023 Acoustic Echo Cancellation Challenge Ross Cutler Ando Saabas Tanel Pärnamaa Marju Purin Evgenii Indenbom Nicolae-Cătălin Ristea Jegor Guzvin H. Gamper Sebastian Braun R. Aichner 81 22 0 22 Sep 2023
Multi-dimensional Speech Quality Assessment in Crowdsourcing Babak Naderi Ross Cutler Nicolae-Cătălin Ristea 67 15 0 14 Sep 2023
Analysis of XLS-R for Speech Quality Assessment Bastiaan Tamm Rik Vandenberghe Hugo Van hamme 51 3 0 23 Aug 2023
PLCMOS -- a data-driven non-intrusive metric for the evaluation of packet loss concealment algorithms Lorenz Diener Marju Purin Sten Sootla Ando Saabas R. Aichner Ross Cutler 69 20 0 24 May 2023
ICASSP 2023 Deep Noise Suppression Challenge Harishchandra Dubey A. Aazami Vishak Gopal Sergiy Matusevych Sebastian Braun ... Sefik Emre Eskimez Manthan Thakker H. Gamper Takuya Yoshioka R. Aichner 92 96 0 21 Mar 2023
Unsupervised Data Selection for TTS: Using Arabic Broadcast News as a Case Study Massa Baali Tomoki Hayashi Hamdy Mubarak Soumi Maiti Shinji Watanabe W. El-Hajj Ahmed M. Ali 47 11 0 22 Jan 2023
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers Chengyi Wang Sanyuan Chen Yu-Huan Wu Zi-Hua Zhang Long Zhou ... Huaming Wang Jinyu Li Lei He Sheng Zhao Furu Wei 193 727 0 05 Jan 2023
Speech MOS multi-task learning and rater bias correction H. Akrami H. Gamper 76 0 0 04 Dec 2022
Pre-trained Speech Representations as Feature Extractors for Speech Quality Assessment in Online Conferencing Applications Bastiaan Tamm Helena Balabin Rik Vandenberghe Hugo Van hamme 75 9 0 01 Oct 2022
Speech Enhancement and Dereverberation with Diffusion-based Generative Models Julius Richter Simon Welker Jean-Marie Lemercier Bunlong Lay Timo Gerkmann DiffM 93 208 0 11 Aug 2022
A Comparative Study of Self-supervised Speech Representation Based Voice Conversion Wen-Chin Huang Shu-Wen Yang Tomoki Hayashi Tomoki Toda 66 17 0 10 Jul 2022
Real-Time Packet Loss Concealment With Mixed Generative and Predictive Model J. Valin Ahmed Mustafa Christopher Montgomery Timothy B. Terriberry Michael Klingbeil Paris Smaragdis A. Krishnaswamy 61 18 0 11 May 2022
Predicting score distribution to improve non-intrusive speech quality estimation A. Faridee H. Gamper 51 1 0 13 Apr 2022
INTERSPEECH 2022 Audio Deep Packet Loss Concealment Challenge Lorenz Diener Sten Sootla Solomiya Branets Ando Saabas R. Aichner Ross Cutler 68 43 0 11 Apr 2022
Effective data screening technique for crowdsourced speech intelligibility experiments: Evaluation with IRM-based speech enhancement Ayako Yamamoto Toshio Irino S. Araki Kenichi Arai A. Ogawa K. Kinoshita Tomohiro Nakatani 55 2 0 31 Mar 2022
Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement Guochen Yu Andong Li Wenzhe Liu C. Zheng Yutian Wang Haibo Wang 88 4 0 30 Mar 2022
ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge for Online Conferencing Applications Gaoxiong Yi Wei Xiao Yiming Xiao Babak Naderi Sebastian Möller ... Z. Zhang Donald Williamson Fei Chen Fuzheng Yang Shidong Shang 90 49 0 30 Mar 2022
A Text-to-Speech Pipeline, Evaluation Methodology, and Initial Fine-Tuning Results for Child Speech Synthesis Rishabh Jain Mariam Yiwere Dan Bigioi Peter Corcoran H. Cucu 69 14 0 22 Mar 2022
DMF-Net: A decoupling-style multi-band fusion model for full-band speech enhancement Guochen Yu Yuansheng Guan Weixin Meng C. Zheng Haibo Wang 96 2 0 01 Mar 2022
ICASSP 2022 Acoustic Echo Cancellation Challenge Ross Cutler Ando Saabas Tanel Pärnamaa Marju Purin H. Gamper Sebastian Braun K. Sørensen R. Aichner 80 74 0 27 Feb 2022
ICASSP 2022 Deep Noise Suppression Challenge Harishchandra Dubey Vishak Gopal Ross Cutler Chandan K. A. Reddy Sergiy Matusevych ... Sefik Emre Eskimez Manthan Thakker Sriram Srinivasan H. Gamper R. Aichner 104 194 0 27 Feb 2022
AC-VC: Non-parallel Low Latency Phonetic Posteriorgrams Based Voice Conversion Damien Ronssin Milos Cernak 69 11 0 12 Nov 2021
S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations Wen-Chin Huang Shu-Wen Yang Tomoki Hayashi Hung-yi Lee Shinji Watanabe Tomoki Toda 71 40 0 12 Oct 2021
MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech Szu-Wei Fu Cheng Yu Kuo-Hsuan Hung Mirco Ravanelli Yu Tsao 93 46 0 12 Oct 2021
Acoustic Echo Cancellation using Residual U-Nets J. Silva-Rodríguez Manuel F. Dolz M. Ferrer Adrián Castelló V. Naranjo G. Piñero 89 2 0 20 Sep 2021
On Prosody Modeling for ASR+TTS based Voice Conversion Wen-Chin Huang Tomoki Hayashi Xinjian Li Shinji Watanabe Tomoki Toda 73 9 0 20 Jul 2021
F-T-LSTM based Complex Network for Joint Acoustic Echo Cancellation and Speech Enhancement Shimin Zhang Yuxiang Kong Shubo Lv Yanxin Hu Lei Xie 67 44 0 14 Jun 2021
Comparison of remote experiments using crowdsourcing and laboratory experiments on speech intelligibility Ayako Yamamoto Toshio Irino Kenichi Arai S. Araki A. Ogawa K. Kinoshita Tomohiro Nakatani 37 10 0 17 Apr 2021
Weighted Recursive Least Square Filter and Neural Network based Residual Echo Suppression for the AEC-Challenge Ziteng Wang Yueyue Na Zhang Liu Biao Tian Q. Fu 76 36 0 17 Feb 2021
Interspeech 2021 Deep Noise Suppression Challenge Chandan K. A. Reddy Harishchandra Dubey K. Koishida A. Nair Vishak Gopal Ross Cutler Sebastian Braun H. Gamper R. Aichner Sriram Srinivasan AI4CE 127 164 0 06 Jan 2021
Using IPA-Based Tacotron for Data Efficient Cross-Lingual Speaker Adaptation and Pronunciation Enhancement Hamed Hemati Damian Borth 69 9 0 12 Nov 2020
DNSMOS: A Non-Intrusive Perceptual Objective Speech Quality metric to evaluate Noise Suppressors Chandan K. A. Reddy Vishak Gopal Ross Cutler 114 316 0 28 Oct 2020
Effect of Language Proficiency on Subjective Evaluation of Noise Suppression Algorithms Babak Naderi Gabriel Mittag Rafael Zequeira Jimaénez Sebastian Möller 56 0 0 26 Oct 2020
Subjective Evaluation of Noise Suppression Algorithms in Crowdsourcing Babak Naderi Ross Cutler 65 10 0 25 Oct 2020
Crowdsourcing approach for subjective evaluation of echo impairment Ross Cutler Babak Nadari Markus Loide Sten Sootla Ando Saabas 89 18 0 25 Oct 2020
Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech Representations Wen-Chin Huang Yi-Chiao Wu Tomoki Hayashi Tomoki Toda BDL 111 38 0 23 Oct 2020
ICASSP 2021 Acoustic Echo Cancellation Challenge: Datasets, Testing Framework, and Results K. Sridhar Ross Cutler Ando Saabas Tanel Pärnamaa Markus Loide H. Gamper Sebastian Braun R. Aichner Sriram Srinivasan 73 20 0 10 Sep 2020
Pretraining Techniques for Sequence-to-Sequence Voice Conversion Wen-Chin Huang Tomoki Hayashi Yi-Chiao Wu Hirokazu Kameoka Tomoki Toda 118 40 0 07 Aug 2020