Special-Character Adversarial Attacks on Open-Source Language Model
Main:6 Pages
8 Figures
Bibliography:2 Pages
1 Tables
Appendix:6 Pages
Abstract
Large language models (LLMs) have achieved remarkable performance across diverse natural language processing tasks, yet their vulnerability to character-level adversarial manipulations presents significant security challenges for real-world deployments.
View on arXivComments on this paper
