Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2502.15594
Cited By
SafeInt: Shielding Large Language Models from Jailbreak Attacks via Safety-Aware Representation Intervention
24 February 2025
Jiaqi Wu
Chen Chen
Chunyan Hou
Xiaojie Yuan
AAML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SafeInt: Shielding Large Language Models from Jailbreak Attacks via Safety-Aware Representation Intervention"
Title
No papers