Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2509.17665
Cited By

Mechanistic Interpretability with SAEs: Probing Religion, Violence, and Geography in Large Language Models

Mechanistic Interpretability with SAEs: Probing Religion, Violence, and Geography in Large Language Models

22 September 2025

Katharina Simbeck

ArXiv (abs)PDF HTML Github (1★)

Papers citing "Mechanistic Interpretability with SAEs: Probing Religion, Violence, and Geography in Large Language Models"

1 / 1 papers shown

GPT and Prejudice: A Sparse Approach to Understanding Learned Representations in Large Language Models

GPT and Prejudice: A Sparse Approach to Understanding Learned Representations in Large Language Models

Katharina Simbeck

273

0

0

24 Sep 2025