v1v2 (latest)

RADAR: Retrieval-Augmented Detector with Adversarial Refinement for Robust Fake News Detection

7 January 2026

Song-Duo Ma

Yi-Hung Liu

Hsin-Yu Lin

Pin-Yu Chen

Hong-Yan Huang

Shau-Yung Hsu

Yun-Nung Chen

AAML

ArXiv (abs)PDF HTML Github

Main:8 Pages

4 Figures

Bibliography:2 Pages

5 Tables

Appendix:2 Pages

Abstract

To efficiently combat the spread of LLM-generated misinformation, we present RADAR, a Retrieval-Augmented Detector with Adversarial Refinement for robust fake news detection. Our approach employs a generator that rewrites real articles with factual perturbations, paired with a lightweight detector that verifies claims using dense passage retrieval. To enable effective co-evolution, we introduce verbal adversarial feedback (VAF). Rather than relying on scalar rewards, VAF issues structured natural-language critiques; these guide the generator toward more sophisticated evasion attempts, compelling the detector to adapt and improve. On a fake news detection benchmark, RADAR consistently outperforms strong retrieval-augmented trainable baselines, as well as general-purpose LLMs with retrieval. Further analysis shows that detector-side retrieval yields the largest gains, while VAF and few-shot demonstrations provide complementary benefits. RADAR also transfers better to fake news generated by an unseen external attacker, indicating improved robustness beyond the co-evolved training setting.

View on arXiv

Comments on this paper