RAP-SM: Robust Adversarial Prompt via Shadow Models for Copyright Verification of Large Language Models

Recent advances in large language models (LLMs) have underscored the importance of safeguarding intellectual property rights through robust fingerprinting techniques. Traditional fingerprint verification approaches typically focus on a single model, seeking to improve the robustness of itsthis http URL, these single-model methods often struggle to capture intrinsic commonalities across multiple related models. In this paper, we propose RAP-SM (Robust Adversarial Prompt via Shadow Models), a novel framework that extracts a public fingerprint for an entire series of LLMs. Experimental results demonstrate that RAP-SM effectively captures the intrinsic commonalities among different models while exhibiting strong adversarial robustness. Our findings suggest that RAP-SM presents a valuable avenue for scalable fingerprint verification, offering enhanced protection against potential model breaches in the era of increasingly prevalent LLMs.
View on arXiv@article{xu2025_2505.06304, title={ RAP-SM: Robust Adversarial Prompt via Shadow Models for Copyright Verification of Large Language Models }, author={ Zhenhua Xu and Zhebo Wang and Maike Li and Wenpeng Xing and Chunqiang Hu and Chen Zhi and Meng Han }, journal={arXiv preprint arXiv:2505.06304}, year={ 2025 } }