Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.08138
Cited By
An Open source Implementation of ITU-T Recommendation P.808 with Validation
17 May 2020
Babak Naderi
Ross Cutler
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"An Open source Implementation of ITU-T Recommendation P.808 with Validation"
46 / 46 papers shown
Title
Crowdsourcing MUSHRA Tests in the Age of Generative Speech Technologies: A Comparative Analysis of Subjective and Objective Testing Methods
Laura Lechler
Chamran Moradi
Ivana Balic
30
0
0
01 Jun 2025
UBGAN: Enhancing Coded Speech with Blind and Guided Bandwidth Extension
Kishan Gupta
Srikanth Korse
Andreas Brendel
N. Pia
Guillaume Fuchs
45
0
0
22 May 2025
A multidimensional measurement of photorealistic avatar quality of experience
Ross Cutler
Babak Naderi
Vishak Gopal
Dharmendar Reddy Palle
88
0
0
13 Nov 2024
A Closer Look at Neural Codec Resynthesis: Bridging the Gap between Codec and Waveform Generation
Alexander H. Liu
Qirui Wang
Yuan Gong
James Glass
66
0
0
29 Oct 2024
On Improving Error Resilience of Neural End-to-End Speech Coders
Kishan Gupta
N. Pia
Srikanth Korse
Andreas Brendel
Guillaume Fuchs
M. Multrus
80
0
0
13 Jun 2024
Pre-training Feature Guided Diffusion Model for Speech Enhancement
Yiyuan Yang
Niki Trigoni
Andrew Markham
156
3
0
11 Jun 2024
Crowdsourced Multilingual Speech Intelligibility Testing
Laura Lechler
Kamil Wojcicki
57
2
0
21 Mar 2024
Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge
Simon Leglaive
Matthieu Fraticelli
Hend ElGhazaly
Léonie Borne
Mostafa Sadeghi
Scott Wisdom
Manuel Pariente
J. Hershey
Daniel Pressnitzer
Jon P. Barker
77
11
0
02 Feb 2024
ICASSP 2023 Acoustic Echo Cancellation Challenge
Ross Cutler
Ando Saabas
Tanel Pärnamaa
Marju Purin
Evgenii Indenbom
Nicolae-Cătălin Ristea
Jegor Guzvin
H. Gamper
Sebastian Braun
R. Aichner
81
22
0
22 Sep 2023
Multi-dimensional Speech Quality Assessment in Crowdsourcing
Babak Naderi
Ross Cutler
Nicolae-Cătălin Ristea
67
15
0
14 Sep 2023
Analysis of XLS-R for Speech Quality Assessment
Bastiaan Tamm
Rik Vandenberghe
Hugo Van hamme
51
3
0
23 Aug 2023
PLCMOS -- a data-driven non-intrusive metric for the evaluation of packet loss concealment algorithms
Lorenz Diener
Marju Purin
Sten Sootla
Ando Saabas
R. Aichner
Ross Cutler
69
20
0
24 May 2023
ICASSP 2023 Deep Noise Suppression Challenge
Harishchandra Dubey
A. Aazami
Vishak Gopal
Sergiy Matusevych
Sebastian Braun
...
Sefik Emre Eskimez
Manthan Thakker
H. Gamper
Takuya Yoshioka
R. Aichner
92
96
0
21 Mar 2023
Unsupervised Data Selection for TTS: Using Arabic Broadcast News as a Case Study
Massa Baali
Tomoki Hayashi
Hamdy Mubarak
Soumi Maiti
Shinji Watanabe
W. El-Hajj
Ahmed M. Ali
47
11
0
22 Jan 2023
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
Chengyi Wang
Sanyuan Chen
Yu-Huan Wu
Zi-Hua Zhang
Long Zhou
...
Huaming Wang
Jinyu Li
Lei He
Sheng Zhao
Furu Wei
193
727
0
05 Jan 2023
Speech MOS multi-task learning and rater bias correction
H. Akrami
H. Gamper
76
0
0
04 Dec 2022
Pre-trained Speech Representations as Feature Extractors for Speech Quality Assessment in Online Conferencing Applications
Bastiaan Tamm
Helena Balabin
Rik Vandenberghe
Hugo Van hamme
75
9
0
01 Oct 2022
Speech Enhancement and Dereverberation with Diffusion-based Generative Models
Julius Richter
Simon Welker
Jean-Marie Lemercier
Bunlong Lay
Timo Gerkmann
DiffM
93
208
0
11 Aug 2022
A Comparative Study of Self-supervised Speech Representation Based Voice Conversion
Wen-Chin Huang
Shu-Wen Yang
Tomoki Hayashi
Tomoki Toda
66
17
0
10 Jul 2022
Real-Time Packet Loss Concealment With Mixed Generative and Predictive Model
J. Valin
Ahmed Mustafa
Christopher Montgomery
Timothy B. Terriberry
Michael Klingbeil
Paris Smaragdis
A. Krishnaswamy
61
18
0
11 May 2022
Predicting score distribution to improve non-intrusive speech quality estimation
A. Faridee
H. Gamper
51
1
0
13 Apr 2022
INTERSPEECH 2022 Audio Deep Packet Loss Concealment Challenge
Lorenz Diener
Sten Sootla
Solomiya Branets
Ando Saabas
R. Aichner
Ross Cutler
68
43
0
11 Apr 2022
Effective data screening technique for crowdsourced speech intelligibility experiments: Evaluation with IRM-based speech enhancement
Ayako Yamamoto
Toshio Irino
S. Araki
Kenichi Arai
A. Ogawa
K. Kinoshita
Tomohiro Nakatani
55
2
0
31 Mar 2022
Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement
Guochen Yu
Andong Li
Wenzhe Liu
C. Zheng
Yutian Wang
Haibo Wang
88
4
0
30 Mar 2022
ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge for Online Conferencing Applications
Gaoxiong Yi
Wei Xiao
Yiming Xiao
Babak Naderi
Sebastian Möller
...
Z. Zhang
Donald Williamson
Fei Chen
Fuzheng Yang
Shidong Shang
90
49
0
30 Mar 2022
A Text-to-Speech Pipeline, Evaluation Methodology, and Initial Fine-Tuning Results for Child Speech Synthesis
Rishabh Jain
Mariam Yiwere
Dan Bigioi
Peter Corcoran
H. Cucu
69
14
0
22 Mar 2022
DMF-Net: A decoupling-style multi-band fusion model for full-band speech enhancement
Guochen Yu
Yuansheng Guan
Weixin Meng
C. Zheng
Haibo Wang
96
2
0
01 Mar 2022
ICASSP 2022 Acoustic Echo Cancellation Challenge
Ross Cutler
Ando Saabas
Tanel Pärnamaa
Marju Purin
H. Gamper
Sebastian Braun
K. Sørensen
R. Aichner
80
74
0
27 Feb 2022
ICASSP 2022 Deep Noise Suppression Challenge
Harishchandra Dubey
Vishak Gopal
Ross Cutler
Chandan K. A. Reddy
Sergiy Matusevych
...
Sefik Emre Eskimez
Manthan Thakker
Sriram Srinivasan
H. Gamper
R. Aichner
104
194
0
27 Feb 2022
AC-VC: Non-parallel Low Latency Phonetic Posteriorgrams Based Voice Conversion
Damien Ronssin
Milos Cernak
69
11
0
12 Nov 2021
S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations
Wen-Chin Huang
Shu-Wen Yang
Tomoki Hayashi
Hung-yi Lee
Shinji Watanabe
Tomoki Toda
71
40
0
12 Oct 2021
MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech
Szu-Wei Fu
Cheng Yu
Kuo-Hsuan Hung
Mirco Ravanelli
Yu Tsao
93
46
0
12 Oct 2021
Acoustic Echo Cancellation using Residual U-Nets
J. Silva-Rodríguez
Manuel F. Dolz
M. Ferrer
Adrián Castelló
V. Naranjo
G. Piñero
89
2
0
20 Sep 2021
On Prosody Modeling for ASR+TTS based Voice Conversion
Wen-Chin Huang
Tomoki Hayashi
Xinjian Li
Shinji Watanabe
Tomoki Toda
73
9
0
20 Jul 2021
F-T-LSTM based Complex Network for Joint Acoustic Echo Cancellation and Speech Enhancement
Shimin Zhang
Yuxiang Kong
Shubo Lv
Yanxin Hu
Lei Xie
67
44
0
14 Jun 2021
Comparison of remote experiments using crowdsourcing and laboratory experiments on speech intelligibility
Ayako Yamamoto
Toshio Irino
Kenichi Arai
S. Araki
A. Ogawa
K. Kinoshita
Tomohiro Nakatani
37
10
0
17 Apr 2021
Weighted Recursive Least Square Filter and Neural Network based Residual Echo Suppression for the AEC-Challenge
Ziteng Wang
Yueyue Na
Zhang Liu
Biao Tian
Q. Fu
76
36
0
17 Feb 2021
Interspeech 2021 Deep Noise Suppression Challenge
Chandan K. A. Reddy
Harishchandra Dubey
K. Koishida
A. Nair
Vishak Gopal
Ross Cutler
Sebastian Braun
H. Gamper
R. Aichner
Sriram Srinivasan
AI4CE
127
164
0
06 Jan 2021
Using IPA-Based Tacotron for Data Efficient Cross-Lingual Speaker Adaptation and Pronunciation Enhancement
Hamed Hemati
Damian Borth
69
9
0
12 Nov 2020
DNSMOS: A Non-Intrusive Perceptual Objective Speech Quality metric to evaluate Noise Suppressors
Chandan K. A. Reddy
Vishak Gopal
Ross Cutler
114
316
0
28 Oct 2020
Effect of Language Proficiency on Subjective Evaluation of Noise Suppression Algorithms
Babak Naderi
Gabriel Mittag
Rafael Zequeira Jimaénez
Sebastian Möller
56
0
0
26 Oct 2020
Subjective Evaluation of Noise Suppression Algorithms in Crowdsourcing
Babak Naderi
Ross Cutler
65
10
0
25 Oct 2020
Crowdsourcing approach for subjective evaluation of echo impairment
Ross Cutler
Babak Nadari
Markus Loide
Sten Sootla
Ando Saabas
89
18
0
25 Oct 2020
Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech Representations
Wen-Chin Huang
Yi-Chiao Wu
Tomoki Hayashi
Tomoki Toda
BDL
111
38
0
23 Oct 2020
ICASSP 2021 Acoustic Echo Cancellation Challenge: Datasets, Testing Framework, and Results
K. Sridhar
Ross Cutler
Ando Saabas
Tanel Pärnamaa
Markus Loide
H. Gamper
Sebastian Braun
R. Aichner
Sriram Srinivasan
73
20
0
10 Sep 2020
Pretraining Techniques for Sequence-to-Sequence Voice Conversion
Wen-Chin Huang
Tomoki Hayashi
Yi-Chiao Wu
Hirokazu Kameoka
Tomoki Toda
118
40
0
07 Aug 2020
1