94

Reference Microphone Selection for Guided Source Separation based on the Normalized L-p Norm

Main:4 Pages
2 Figures
Bibliography:1 Pages
2 Tables
Abstract

Guided Source Separation (GSS) is a popular front-end for distant automatic speech recognition (ASR) systems using spatially distributed microphones. When considering spatially distributed microphones, the choice of reference microphone may have a large influence on the quality of the output signal and the downstream ASR performance. In GSS-based speech enhancement, reference microphone selection is typically performed using the signal-to-noise ratio (SNR), which is optimal for noise reduction but may neglect differences in early-to-late-reverberant ratio (ELR) across microphones. In this paper, we propose two reference microphone selection methods for GSS-based speech enhancement that are based on the normalized p\ell_p-norm, either using only the normalized p\ell_p-norm or combining the normalized p\ell_p-norm and the SNR to account for both differences in SNR and ELR across microphones. Experimental evaluation using a CHiME-8 distant ASR system shows that the proposed p\ell_p-norm-based methods outperform the baseline method, reducing the macro-average word error rate.

View on arXiv
Comments on this paper