In this paper, we present a deep-learning method to filter out effects such as ambient noise, reflections, or source directivity from microphone array data represented as cross-spectral matrices. Specifically, we focus on a generative adversarial network (GAN) architecture designed to transform fixed-size cross-spectral matrices. Theses models were trained using sound pressure simulations of varying complexity developed for this purpose. Based on the results from applying these methods in a hyperparameter optimization of an auto-encoding task, we trained the optimized model to perform five distinct transformation tasks derived from different complexities inherent in our sound pressure simulations.
View on arXiv@article{puhle2025_2502.21097, title={ Deep learning-based filtering of cross-spectral matrices using generative adversarial networks }, author={ Christof Puhle }, journal={arXiv preprint arXiv:2502.21097}, year={ 2025 } }