SaRoHead: A Dataset for Satire Detection in Romanian Multi-Domain News Headlines

Abstract
The headline is an important part of a news article, influenced by expressiveness and connection to the exposed subject. Although most news outlets aim to present reality objectively, some publications prefer a humorous approach in which stylistic elements of satire, irony, and sarcasm blend to cover specific topics. Satire detection can be difficult because a headline aims to expose the main idea behind a news article. In this paper, we propose SaRoHead, the first corpus for satire detection in Romanian multi-domain news headlines. Our findings show that the clickbait used in some non-satirical headlines significantly influences the model.
View on arXiv@article{vîrlan2025_2504.07612, title={ SaRoHead: A Dataset for Satire Detection in Romanian Multi-Domain News Headlines }, author={ Mihnea-Alexandru Vîrlan and Răzvan-Alexandru Smădu and Dumitru-Clementin Cercel }, journal={arXiv preprint arXiv:2504.07612}, year={ 2025 } }
Comments on this paper