CAMEO: Collection of Multilingual Emotional Speech Corpora

16 May 2025

Abstract

This paper presents CAMEO -- a curated collection of multilingual emotional speech datasets designed to facilitate research in emotion recognition and other speech-related tasks. The main objectives were to ensure easy access to the data, to allow reproducibility of the results, and to provide a standardized benchmark for evaluating speech emotion recognition (SER) systems across different emotional states and languages. The paper describes the dataset selection criteria, the curation and normalization process, and provides performance results for several models. The collection, along with metadata, and a leaderboard, is publicly available via the Hugging Face platform.

View on arXiv

@article{christop2025_2505.11051,
  title={ CAMEO: Collection of Multilingual Emotional Speech Corpora },
  author={ Iwona Christop and Maciej Czajka },
  journal={arXiv preprint arXiv:2505.11051},
  year={ 2025 }
}

Comments on this paper