Evaluating Fake Music Detection Performance Under Audio Augmentations

7 July 2025

Tomasz Sroka

Tomasz Wężowicz

Dominik Sidorczuk

Mateusz Modrzejewski

ArXiv (abs)PDF HTML

Main:1 Pages

1 Figures

1 Tables

Appendix:2 Pages

Abstract

With the rapid advancement of generative audio models, distinguishing between human-composed and generated music is becoming increasingly challenging. As a response, models for detecting fake music have been proposed. In this work, we explore the robustness of such systems under audio augmentations. To evaluate model generalization, we constructed a dataset consisting of both real and synthetic music generated using several systems. We then apply a range of audio transformations and analyze how they affect classification accuracy. We test the performance of a recent state-of-the-art musical deepfake detection model in the presence of audio augmentations. The performance of the model decreases significantly even with the introduction of light augmentations.

View on arXiv

Comments on this paper