Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2410.08201
Cited By

Efficient Dictionary Learning with Switch Sparse Autoencoders

v1v2 (latest)

Efficient Dictionary Learning with Switch Sparse Autoencoders

International Conference on Learning Representations (ICLR), 2024

10 October 2024

Eric J. Michaud

Christian Schroeder de Witt

ArXiv (abs)PDF HTML

Papers citing "Efficient Dictionary Learning with Switch Sparse Autoencoders"

18 / 18 papers shown

Beyond Redundancy: Diverse and Specialized Multi-Expert Sparse Autoencoder

Beyond Redundancy: Diverse and Specialized Multi-Expert Sparse Autoencoder

278

0

0

07 Nov 2025

Does higher interpretability imply better utility? A Pairwise Analysis on Sparse Autoencoders

Does higher interpretability imply better utility? A Pairwise Analysis on Sparse Autoencoders

208

1

0

04 Oct 2025

LLM Interpretability with Identifiable Temporal-Instantaneous Representation

LLM Interpretability with Identifiable Temporal-Instantaneous Representation

128

0

0

27 Sep 2025

The Secret Agenda: LLMs Strategically Lie and Our Current Safety Tools Are Blind

The Secret Agenda: LLMs Strategically Lie and Our Current Safety Tools Are Blind

105

1

0

23 Sep 2025

AdaptiveK Sparse Autoencoders: Dynamic Sparsity Allocation for Interpretable LLM Representations

AdaptiveK Sparse Autoencoders: Dynamic Sparsity Allocation for Interpretable LLM Representations

172

0

0

24 Aug 2025

Attention Layers Add Into Low-Dimensional Residual Subspaces

Attention Layers Add Into Low-Dimensional Residual Subspaces

166

0

0

23 Aug 2025

Probing the Representational Power of Sparse Autoencoders in Vision Models

Probing the Representational Power of Sparse Autoencoders in Vision Models

Matthew Lyle Olson

212

1

0

15 Aug 2025

Interpreting CFD Surrogates through Sparse Autoencoders

Interpreting CFD Surrogates through Sparse Autoencoders

131

0

0

21 Jul 2025

Incorporating Hierarchical Semantics in Sparse Autoencoder Architectures

Incorporating Hierarchical Semantics in Sparse Autoencoder Architectures

Sean Richardson

212

2

0

01 Jun 2025

Kronecker Factorization Improves Efficiency and Interpretability of Sparse Autoencoders

Kronecker Factorization Improves Efficiency and Interpretability of Sparse Autoencoders

Yaroslav Aksenov

Daniil Gavrilov

Nikita Balagansky

248

0

0

28 May 2025

Sparsification and Reconstruction from the Perspective of Representation Geometry

Sparsification and Reconstruction from the Perspective of Representation Geometry

237

0

0

28 May 2025

Evaluating Adversarial Robustness of Concept Representations in Sparse Autoencoders

Evaluating Adversarial Robustness of Concept Representations in Sparse Autoencoders

Aaron Jiaxun Li

Himabindu Lakkaraju

370

4

0

21 May 2025

Are Sparse Autoencoders Useful for Java Function Bug Detection?

Are Sparse Autoencoders Useful for Java Function Bug Detection?

Henrique Lopes Cardoso

413

1

0

15 May 2025

Revisiting End-To-End Sparse Autoencoder Training: A Short Finetune Is All You Need

Revisiting End-To-End Sparse Autoencoder Training: A Short Finetune Is All You Need

282

1

0

21 Mar 2025

SAEBench: A Comprehensive Benchmark for Sparse Autoencoders in Language Model Interpretability

SAEBench: A Comprehensive Benchmark for Sparse Autoencoders in Language Model Interpretability

Joseph Isaac Bloom

...

Matthew Wearden

593

51

0

12 Mar 2025

Are Sparse Autoencoders Useful? A Case Study in Sparse Probing

Are Sparse Autoencoders Useful? A Case Study in Sparse Probing

Subhash Kantamneni

Senthooran Rajamanoharan

356

46

0

23 Feb 2025

Steering Language Model Refusal with Sparse Autoencoders

Xavier Fernandes

Blake Bullwinkel

Forough Poursabzi-Sangde

389

40

0

18 Nov 2024

Llama Scope: Extracting Millions of Features from Llama-3.1-8B with
Sparse Autoencoders

Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders

Junxuan Wang

...

Qipeng Guo

Xuanjing Huang

Xipeng Qiu

337

77

0

27 Oct 2024