Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.01550
Cited By
Representation Bending for Large Language Model Safety
2 April 2025
Ashkan Yousefpour
Taeheon Kim
Ryan S. Kwon
Seungbeen Lee
Wonje Jeung
Seungju Han
Alvin Wan
Harrison Ngan
Youngjae Yu
Jonghyun Choi
AAML
ALM
KELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Representation Bending for Large Language Model Safety"
Title
No papers