ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2509.17665
  4. Cited By
Mechanistic Interpretability with SAEs: Probing Religion, Violence, and Geography in Large Language Models

Mechanistic Interpretability with SAEs: Probing Religion, Violence, and Geography in Large Language Models

22 September 2025
Katharina Simbeck
Mariam Mahran
    MILMLLMSV
ArXiv (abs)PDFHTMLGithub (1★)

Papers citing "Mechanistic Interpretability with SAEs: Probing Religion, Violence, and Geography in Large Language Models"

1 / 1 papers shown
GPT and Prejudice: A Sparse Approach to Understanding Learned Representations in Large Language Models
GPT and Prejudice: A Sparse Approach to Understanding Learned Representations in Large Language Models
Mariam Mahran
Katharina Simbeck
273
0
0
24 Sep 2025
1