Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2502.19721
Cited By
Sensing and Steering Stereotypes: Extracting and Applying Gender Representation Vectors in LLMs
27 February 2025
Hannah Cyberey
Yangfeng Ji
David E. Evans
LLMSV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Sensing and Steering Stereotypes: Extracting and Applying Gender Representation Vectors in LLMs"
1 / 1 papers shown
Title
Steering the CensorShip: Uncovering Representation Vectors for LLM "Thought" Control
Hannah Cyberey
David E. Evans
LLMSV
74
0
0
23 Apr 2025
1