Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2502.17420
Cited By
The Geometry of Refusal in Large Language Models: Concept Cones and Representational Independence
24 February 2025
Tom Wollschlager
Jannes Elstner
Simon Geisler
Vincent Cohen-Addad
Stephan Günnemann
Johannes Gasteiger
LLMSV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Geometry of Refusal in Large Language Models: Concept Cones and Representational Independence"
1 / 1 papers shown
Title
A Probabilistic Perspective on Unlearning and Alignment for Large Language Models
Yan Scholten
Stephan Günnemann
Leo Schwinn
MU
38
6
0
04 Oct 2024
1