ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.08138
  4. Cited By
An Open source Implementation of ITU-T Recommendation P.808 with
  Validation

An Open source Implementation of ITU-T Recommendation P.808 with Validation

17 May 2020
Babak Naderi
Ross Cutler
ArXiv (abs)PDFHTML

Papers citing "An Open source Implementation of ITU-T Recommendation P.808 with Validation"

48 / 48 papers shown
High-Fidelity Speech Enhancement via Discrete Audio Tokens
High-Fidelity Speech Enhancement via Discrete Audio Tokens
Luca A. Lanzendörfer
Frédéric Berdoz
Antonis Asonitis
Roger Wattenhofer
186
1
0
02 Oct 2025
Robust Residual Finite Scalar Quantization for Neural Compression
Robust Residual Finite Scalar Quantization for Neural Compression
Xiaoxu Zhu
Jiakui Li
Ken Zheng
Guiping Zhong
Huimeng Wang
MQ
181
0
0
20 Aug 2025
Crowdsourcing MUSHRA Tests in the Age of Generative Speech Technologies: A Comparative Analysis of Subjective and Objective Testing Methods
Crowdsourcing MUSHRA Tests in the Age of Generative Speech Technologies: A Comparative Analysis of Subjective and Objective Testing Methods
Laura Lechler
Chamran Moradi
Ivana Balic
195
4
0
01 Jun 2025
UBGAN: Enhancing Coded Speech with Blind and Guided Bandwidth Extension
UBGAN: Enhancing Coded Speech with Blind and Guided Bandwidth Extension
Kishan Gupta
Srikanth Korse
Andreas Brendel
N. Pia
Guillaume Fuchs
251
0
0
22 May 2025
A multidimensional measurement of photorealistic avatar quality of experience
A multidimensional measurement of photorealistic avatar quality of experienceProceedings of the ACM on Human-Computer Interaction (PACMHCI), 2024
Ross Cutler
Babak Naderi
Vishak Gopal
Dharmendar Reddy Palle
487
0
0
13 Nov 2024
A Closer Look at Neural Codec Resynthesis: Bridging the Gap between
  Codec and Waveform Generation
A Closer Look at Neural Codec Resynthesis: Bridging the Gap between Codec and Waveform Generation
Alexander H. Liu
Qirui Wang
Yuan Gong
James Glass
205
2
0
29 Oct 2024
On Improving Error Resilience of Neural End-to-End Speech Coders
On Improving Error Resilience of Neural End-to-End Speech Coders
Kishan Gupta
N. Pia
Srikanth Korse
Andreas Brendel
Guillaume Fuchs
M. Multrus
273
1
0
13 Jun 2024
Pre-training Feature Guided Diffusion Model for Speech Enhancement
Pre-training Feature Guided Diffusion Model for Speech Enhancement
Yiyuan Yang
Niki Trigoni
Andrew Markham
458
9
0
11 Jun 2024
Crowdsourced Multilingual Speech Intelligibility Testing
Crowdsourced Multilingual Speech Intelligibility Testing
Laura Lechler
Kamil Wojcicki
344
6
0
21 Mar 2024
Objective and subjective evaluation of speech enhancement methods in the
  UDASE task of the 7th CHiME challenge
Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge
Simon Leglaive
Matthieu Fraticelli
Hend ElGhazaly
Léonie Borne
Mostafa Sadeghi
Scott Wisdom
Manuel Pariente
J. Hershey
Daniel Pressnitzer
Jon P. Barker
351
18
0
02 Feb 2024
ICASSP 2023 Acoustic Echo Cancellation Challenge
ICASSP 2023 Acoustic Echo Cancellation ChallengeIEEE Open Journal of Signal Processing (IEEE Open J. Signal Process.), 2023
Ross Cutler
Ando Saabas
Tanel Pärnamaa
Marju Purin
Evgenii Indenbom
Nicolae-Cătălin Ristea
Jegor Guzvin
H. Gamper
Sebastian Braun
R. Aichner
279
36
0
22 Sep 2023
Multi-dimensional Speech Quality Assessment in Crowdsourcing
Multi-dimensional Speech Quality Assessment in CrowdsourcingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Babak Naderi
Ross Cutler
Nicolae-Cătălin Ristea
154
21
0
14 Sep 2023
Analysis of XLS-R for Speech Quality Assessment
Analysis of XLS-R for Speech Quality AssessmentIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2023
Bastiaan Tamm
Rik Vandenberghe
Hugo Van hamme
304
8
0
23 Aug 2023
PLCMOS -- a data-driven non-intrusive metric for the evaluation of
  packet loss concealment algorithms
PLCMOS -- a data-driven non-intrusive metric for the evaluation of packet loss concealment algorithmsInterspeech (Interspeech), 2023
Lorenz Diener
Marju Purin
Sten Sootla
Ando Saabas
R. Aichner
Ross Cutler
226
32
0
24 May 2023
ICASSP 2023 Deep Noise Suppression Challenge
ICASSP 2023 Deep Noise Suppression ChallengeIEEE Open Journal of Signal Processing (IEEE Open J. Signal Process.), 2023
Harishchandra Dubey
A. Aazami
Vishak Gopal
Sergiy Matusevych
Sebastian Braun
...
Sefik Emre Eskimez
Manthan Thakker
H. Gamper
Takuya Yoshioka
R. Aichner
361
173
0
21 Mar 2023
Unsupervised Data Selection for TTS: Using Arabic Broadcast News as a
  Case Study
Unsupervised Data Selection for TTS: Using Arabic Broadcast News as a Case Study
Massa Baali
Tomoki Hayashi
Hamdy Mubarak
Soumi Maiti
Shinji Watanabe
W. El-Hajj
Ahmed M. Ali
234
12
0
22 Jan 2023
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
Neural Codec Language Models are Zero-Shot Text to Speech SynthesizersIEEE Transactions on Audio, Speech, and Language Processing (IEEE TASLP), 2023
Chengyi Wang
Sanyuan Chen
Yu-Huan Wu
Zi-Hua Zhang
Long Zhou
...
Huaming Wang
Jinyu Li
Lei He
Sheng Zhao
Furu Wei
552
1,122
0
05 Jan 2023
Speech MOS multi-task learning and rater bias correction
Speech MOS multi-task learning and rater bias correctionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
H. Akrami
H. Gamper
243
0
0
04 Dec 2022
Pre-trained Speech Representations as Feature Extractors for Speech
  Quality Assessment in Online Conferencing Applications
Pre-trained Speech Representations as Feature Extractors for Speech Quality Assessment in Online Conferencing ApplicationsInterspeech (Interspeech), 2022
Bastiaan Tamm
Helena Balabin
Rik Vandenberghe
Hugo Van hamme
255
10
0
01 Oct 2022
Speech Enhancement and Dereverberation with Diffusion-based Generative Models
Speech Enhancement and Dereverberation with Diffusion-based Generative ModelsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Julius Richter
Simon Welker
Jean-Marie Lemercier
Bunlong Lay
Timo Gerkmann
DiffM
502
350
0
11 Aug 2022
A Comparative Study of Self-supervised Speech Representation Based Voice
  Conversion
A Comparative Study of Self-supervised Speech Representation Based Voice ConversionIEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022
Wen-Chin Huang
Shu-Wen Yang
Tomoki Hayashi
Tomoki Toda
214
24
0
10 Jul 2022
Real-Time Packet Loss Concealment With Mixed Generative and Predictive
  Model
Real-Time Packet Loss Concealment With Mixed Generative and Predictive ModelInterspeech (Interspeech), 2022
J. Valin
Ahmed Mustafa
Christopher Montgomery
Timothy B. Terriberry
Michael Klingbeil
Paris Smaragdis
A. Krishnaswamy
182
24
0
11 May 2022
Predicting score distribution to improve non-intrusive speech quality
  estimation
Predicting score distribution to improve non-intrusive speech quality estimation
A. Faridee
H. Gamper
200
1
0
13 Apr 2022
INTERSPEECH 2022 Audio Deep Packet Loss Concealment Challenge
INTERSPEECH 2022 Audio Deep Packet Loss Concealment ChallengeInterspeech (Interspeech), 2022
Lorenz Diener
Sten Sootla
Solomiya Branets
Ando Saabas
R. Aichner
Ross Cutler
212
55
0
11 Apr 2022
Effective data screening technique for crowdsourced speech
  intelligibility experiments: Evaluation with IRM-based speech enhancement
Effective data screening technique for crowdsourced speech intelligibility experiments: Evaluation with IRM-based speech enhancementAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2022
Ayako Yamamoto
Toshio Irino
S. Araki
Kenichi Arai
A. Ogawa
K. Kinoshita
Tomohiro Nakatani
215
3
0
31 Mar 2022
Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for
  Real-Time Full-Band Speech Enhancement
Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech EnhancementInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2022
Guochen Yu
Andong Li
Wenzhe Liu
C. Zheng
Yutian Wang
Haibo Wang
295
4
0
30 Mar 2022
ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech
  Quality Assessment (NISQA) Challenge for Online Conferencing Applications
ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge for Online Conferencing ApplicationsInterspeech (Interspeech), 2022
Gaoxiong Yi
Wei Xiao
Yiming Xiao
Babak Naderi
Sebastian Möller
...
Z. Zhang
Donald Williamson
Fei Chen
Fuzheng Yang
Shidong Shang
261
66
0
30 Mar 2022
A Text-to-Speech Pipeline, Evaluation Methodology, and Initial
  Fine-Tuning Results for Child Speech Synthesis
A Text-to-Speech Pipeline, Evaluation Methodology, and Initial Fine-Tuning Results for Child Speech SynthesisIEEE Access (IEEE Access), 2022
Rishabh Jain
Mariam Yiwere
Dan Bigioi
Peter Corcoran
H. Cucu
276
21
0
22 Mar 2022
DMF-Net: A decoupling-style multi-band fusion model for full-band speech
  enhancement
DMF-Net: A decoupling-style multi-band fusion model for full-band speech enhancementAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2022
Guochen Yu
Yuansheng Guan
Weixin Meng
C. Zheng
Haibo Wang
436
3
0
01 Mar 2022
ICASSP 2022 Acoustic Echo Cancellation Challenge
ICASSP 2022 Acoustic Echo Cancellation ChallengeIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Ross Cutler
Ando Saabas
Tanel Pärnamaa
Marju Purin
H. Gamper
Sebastian Braun
K. Sørensen
R. Aichner
280
79
0
27 Feb 2022
ICASSP 2022 Deep Noise Suppression Challenge
ICASSP 2022 Deep Noise Suppression ChallengeIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Harishchandra Dubey
Vishak Gopal
Ross Cutler
Chandan K. A. Reddy
Sergiy Matusevych
...
Sefik Emre Eskimez
Manthan Thakker
Sriram Srinivasan
H. Gamper
R. Aichner
378
228
0
27 Feb 2022
AC-VC: Non-parallel Low Latency Phonetic Posteriorgrams Based Voice
  Conversion
AC-VC: Non-parallel Low Latency Phonetic Posteriorgrams Based Voice ConversionAutomatic Speech Recognition & Understanding (ASRU), 2021
Damien Ronssin
Milos Cernak
244
14
0
12 Nov 2021
S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised
  Speech Representations
S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech RepresentationsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Wen-Chin Huang
Shu-Wen Yang
Tomoki Hayashi
Hung-yi Lee
Shinji Watanabe
Tomoki Toda
206
45
0
12 Oct 2021
MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only
  on noisy/ reverberated speech
MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speechIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Szu-Wei Fu
Cheng Yu
Kuo-Hsuan Hung
Mirco Ravanelli
Yu Tsao
266
61
0
12 Oct 2021
Acoustic Echo Cancellation using Residual U-Nets
Acoustic Echo Cancellation using Residual U-Nets
J. Silva-Rodríguez
Manuel F. Dolz
M. Ferrer
Adrián Castelló
V. Naranjo
G. Piñero
204
3
0
20 Sep 2021
On Prosody Modeling for ASR+TTS based Voice Conversion
On Prosody Modeling for ASR+TTS based Voice ConversionAutomatic Speech Recognition & Understanding (ASRU), 2021
Wen-Chin Huang
Tomoki Hayashi
Xinjian Li
Shinji Watanabe
Tomoki Toda
272
11
0
20 Jul 2021
F-T-LSTM based Complex Network for Joint Acoustic Echo Cancellation and
  Speech Enhancement
F-T-LSTM based Complex Network for Joint Acoustic Echo Cancellation and Speech EnhancementInterspeech (Interspeech), 2021
Shimin Zhang
Yuxiang Kong
Shubo Lv
Yanxin Hu
Lei Xie
222
50
0
14 Jun 2021
Comparison of remote experiments using crowdsourcing and laboratory
  experiments on speech intelligibility
Comparison of remote experiments using crowdsourcing and laboratory experiments on speech intelligibilityInterspeech (Interspeech), 2021
Ayako Yamamoto
Toshio Irino
Kenichi Arai
S. Araki
A. Ogawa
K. Kinoshita
Tomohiro Nakatani
153
13
0
17 Apr 2021
Weighted Recursive Least Square Filter and Neural Network based Residual
  Echo Suppression for the AEC-Challenge
Weighted Recursive Least Square Filter and Neural Network based Residual Echo Suppression for the AEC-ChallengeIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Ziteng Wang
Yueyue Na
Zhang Liu
Biao Tian
Q. Fu
301
41
0
17 Feb 2021
Interspeech 2021 Deep Noise Suppression Challenge
Interspeech 2021 Deep Noise Suppression ChallengeInterspeech (Interspeech), 2021
Chandan K. A. Reddy
Harishchandra Dubey
K. Koishida
A. Nair
Vishak Gopal
Ross Cutler
Sebastian Braun
H. Gamper
R. Aichner
Sriram Srinivasan
AI4CE
510
197
0
06 Jan 2021
Using IPA-Based Tacotron for Data Efficient Cross-Lingual Speaker
  Adaptation and Pronunciation Enhancement
Using IPA-Based Tacotron for Data Efficient Cross-Lingual Speaker Adaptation and Pronunciation Enhancement
Hamed Hemati
Damian Borth
232
11
0
12 Nov 2020
DNSMOS: A Non-Intrusive Perceptual Objective Speech Quality metric to
  evaluate Noise Suppressors
DNSMOS: A Non-Intrusive Perceptual Objective Speech Quality metric to evaluate Noise SuppressorsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Chandan K. A. Reddy
Vishak Gopal
Ross Cutler
417
499
0
28 Oct 2020
Effect of Language Proficiency on Subjective Evaluation of Noise
  Suppression Algorithms
Effect of Language Proficiency on Subjective Evaluation of Noise Suppression AlgorithmsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Babak Naderi
Gabriel Mittag
Rafael Zequeira Jimaénez
Sebastian Möller
192
0
0
26 Oct 2020
Subjective Evaluation of Noise Suppression Algorithms in Crowdsourcing
Subjective Evaluation of Noise Suppression Algorithms in Crowdsourcing
Babak Naderi
Ross Cutler
335
11
0
25 Oct 2020
Crowdsourcing approach for subjective evaluation of echo impairment
Crowdsourcing approach for subjective evaluation of echo impairmentIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Ross Cutler
Babak Nadari
Markus Loide
Sten Sootla
Ando Saabas
366
19
0
25 Oct 2020
Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised
  Discrete Speech Representations
Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech RepresentationsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Wen-Chin Huang
Yi-Chiao Wu
Tomoki Hayashi
Tomoki Toda
BDL
352
48
0
23 Oct 2020
ICASSP 2021 Acoustic Echo Cancellation Challenge: Datasets, Testing
  Framework, and Results
ICASSP 2021 Acoustic Echo Cancellation Challenge: Datasets, Testing Framework, and Results
K. Sridhar
Ross Cutler
Ando Saabas
Tanel Pärnamaa
Markus Loide
H. Gamper
Sebastian Braun
R. Aichner
Sriram Srinivasan
348
20
0
10 Sep 2020
Pretraining Techniques for Sequence-to-Sequence Voice Conversion
Pretraining Techniques for Sequence-to-Sequence Voice ConversionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Wen-Chin Huang
Tomoki Hayashi
Yi-Chiao Wu
Hirokazu Kameoka
Tomoki Toda
373
48
0
07 Aug 2020
1
Page 1 of 1