Decoupling Content and Expression: Two-Dimensional Detection of AI-Generated Text

1 March 2025

Abstract

The wide usage of LLMs raises critical requirements on detecting AI participation in texts. Existing studies investigate these detections in scattered contexts, leaving a systematic and unified approach unexplored. In this paper, we present HART, a hierarchical framework of AI risk levels, each corresponding to a detection task. To address these tasks, we propose a novel 2D Detection Method, decoupling a text into content and language expression. Our findings show that content is resistant to surface-level changes, which can serve as a key feature for detection. Experiments demonstrate that 2D method significantly outperforms existing detectors, achieving an AUROC improvement from 0.705 to 0.849 for level-2 detection and from 0.807 to 0.886 for RAID. We release our data and code atthis https URL.

View on arXiv

@article{bao2025_2503.00258,
  title={ Decoupling Content and Expression: Two-Dimensional Detection of AI-Generated Text },
  author={ Guangsheng Bao and Lihua Rong and Yanbin Zhao and Qiji Zhou and Yue Zhang },
  journal={arXiv preprint arXiv:2503.00258},
  year={ 2025 }
}

Comments on this paper