Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2509.17665
Cited By
Mechanistic Interpretability with SAEs: Probing Religion, Violence, and Geography in Large Language Models
22 September 2025
Katharina Simbeck
Mariam Mahran
MILM
LLMSV
Re-assign community
ArXiv (abs)
PDF
HTML
Github (1★)
Papers citing
"Mechanistic Interpretability with SAEs: Probing Religion, Violence, and Geography in Large Language Models"
1 / 1 papers shown
GPT and Prejudice: A Sparse Approach to Understanding Learned Representations in Large Language Models
Mariam Mahran
Katharina Simbeck
273
0
0
24 Sep 2025
1