244

Meta-Learning in Audio and Speech Processing: An End to End Comprehensive Review

International Workshop on Multi-disciplinary Trends in Artificial Intelligence (MTAI), 2024
Main:12 Pages
1 Figures
Bibliography:3 Pages
5 Tables
Abstract

This survey overviews various meta-learning approaches used in audio and speech processing scenarios. Meta-learning is used where model performance needs to be maximized with minimum annotated samples, making it suitable for low-sample audio processing. Although the field has made some significant contributions, audio meta-learning still lacks the presence of comprehensive survey papers. We present a systematic review of meta-learning methodologies in audio processing. This includes audio-specific discussions on data augmentation, feature extraction, preprocessing techniques, meta-learners, task selection strategies and also presents important datasets in audio, together with crucial real-world use cases. Through this extensive review, we aim to provide valuable insights and identify future research directions in the intersection of meta-learning and audio processing.

View on arXiv
Comments on this paper