Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2002.05879
Cited By
Stable Training of DNN for Speech Enhancement based on Perceptually-Motivated Black-Box Cost Function
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
14 February 2020
M. Kawanaka
Yuma Koizumi
Ryoichi Miyazaki
Kohei Yatabe
AAML
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Stable Training of DNN for Speech Enhancement based on Perceptually-Motivated Black-Box Cost Function"
12 / 12 papers shown
Analysis of Noisy-target Training for DNN-based speech enhancement
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Takuya Fujimura
Tomoki Toda
265
10
0
02 Nov 2022
Reduction of Subjective Listening Effort for TV Broadcast Signals with Recurrent Neural Networks
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Nils L. Westhausen
R. Huber
Hannah Baumgartner
Ragini Sinha
J. Rennies
B. Meyer
195
11
0
02 Nov 2021
Objective Measures of Perceptual Audio Quality Reviewed: An Evaluation of Their Application Domain Dependence
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Matteo Torcoli
T. Kastner
Jürgen Herre
254
83
0
21 Oct 2021
Improving Character Error Rate Is Not Equal to Having Clean Speech: Speech Enhancement for ASR Systems with Black-box Acoustic Models
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Ryosuke Sawata
Yosuke Kashiwagi
Shusuke Takahashi
269
10
0
12 Oct 2021
Controlling the Remixing of Separated Dialogue with a Non-Intrusive Quality Estimate
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2021
Matteo Torcoli
Jouni Paulus
T. Kastner
C. Uhle
128
8
0
21 Jul 2021
Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Haoyu Li
Junichi Yamagishi
291
15
0
17 Apr 2021
MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement
Interspeech (Interspeech), 2021
Szu-Wei Fu
Cheng Yu
Tsun-An Hsieh
Peter William VanHarn Plantinga
Mirco Ravanelli
Xugang Lu
Yu Tsao
344
287
0
08 Apr 2021
Expressive TTS Training with Frame and Style Reconstruction Loss
Rui Liu
Berrak Sisman
Guanglai Gao
Haizhou Li
341
83
0
04 Aug 2020
Boosting Objective Scores of a Speech Enhancement Model by MetricGAN Post-processing
Szu-Wei Fu
Chien-Feng Liao
Tsun-An Hsieh
Kuo-Hsuan Hung
Syu-Siang Wang
...
Ryandhimas E. Zezario
You-Jin Li
Shang-Yi Chuang
Yen-Ju Lu
Yu Tsao
227
6
0
18 Jun 2020
Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Yuma Koizumi
Kohei Yatabe
Marc Delcroix
Yoshiki Masuyama
Daiki Takeuchi
232
138
0
14 Feb 2020
Real-time speech enhancement using equilibriated RNN
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Daiki Takeuchi
Kohei Yatabe
Yuma Koizumi
Yasuhiro Oikawa
Noboru Harada
153
44
0
14 Feb 2020
Invertible DNN-based nonlinear time-frequency transform for speech enhancement
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Daiki Takeuchi
Kohei Yatabe
Yuma Koizumi
Yasuhiro Oikawa
Noboru Harada
382
10
0
25 Nov 2019
1
Page 1 of 1