ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2411.04430
  4. Cited By
Towards Unifying Interpretability and Control: Evaluation via Intervention

Towards Unifying Interpretability and Control: Evaluation via Intervention

7 November 2024
Usha Bhalla
Suraj Srinivas
Asma Ghandeharioun
Himabindu Lakkaraju
ArXivPDFHTML

Papers citing "Towards Unifying Interpretability and Control: Evaluation via Intervention"

1 / 1 papers shown
Title
Universal Sparse Autoencoders: Interpretable Cross-Model Concept Alignment
Universal Sparse Autoencoders: Interpretable Cross-Model Concept Alignment
Harrish Thasarathan
Julian Forsyth
Thomas Fel
M. Kowal
Konstantinos G. Derpanis
86
7
0
06 Feb 2025
1