Towards Mitigating Excessive Forgetting in LLM Unlearning via Entanglement-Aware Unlearning with Proxy Constraint

28 August 2025

Zhihao Liu

ArXiv (abs)PDF HTML Github

Main:11 Pages

11 Figures

Bibliography:2 Pages

11 Tables

Appendix:4 Pages

Abstract

Large language models (LLMs) are trained on massive datasets that may include private or copyrighted content. Due to growing privacy and ownership concerns, data owners may request the removal of their data from trained models. Machine unlearning provides a practical solution by removing the influence of specific data without full retraining. However, most existing methods lack a sound forgetting boundary, causing some samples to be under-forgotten, leaving residual leakage risks, while others remain over-forgotten at the expense of degraded utility.

View on arXiv

Comments on this paper