On the Effect of Ruleset Tuning and Data Imbalance on Explainable Network Security Alert Classifications: a Case-Study on DeepCASE

2 July 2025

Koen T. W. Teuwen

Sam Baggen

Emmanuele Zambon

Luca Allodi

ArXiv (abs)PDF HTML

Main:7 Pages

5 Figures

Bibliography:2 Pages

10 Tables

Appendix:5 Pages

Abstract

Automation in Security Operations Centers (SOCs) plays a prominent role in alert classification and incident escalation. However, automated methods must be robust in the presence of imbalanced input data, which can negatively affect performance. Additionally, automated methods should make explainable decisions. In this work, we evaluate the effect of label imbalance on the classification of network intrusion alerts. As our use-case we employ DeepCASE, the state-of-the-art method for automated alert classification. We show that label imbalance impacts both classification performance and correctness of the classification explanations offered by DeepCASE. We conclude tuning the detection rules used in SOCs can significantly reduce imbalance and may benefit the performance and explainability offered by alert post-processing methods such as DeepCASE. Therefore, our findings suggest that traditional methods to improve the quality of input data can benefit automation.

View on arXiv

@article{teuwen2025_2507.01571,
  title={ On the Effect of Ruleset Tuning and Data Imbalance on Explainable Network Security Alert Classifications: a Case-Study on DeepCASE },
  author={ Koen T. W. Teuwen and Sam Baggen and Emmanuele Zambon and Luca Allodi },
  journal={arXiv preprint arXiv:2507.01571},
  year={ 2025 }
}

Comments on this paper