Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2411.02193
Cited By
Improving Steering Vectors by Targeting Sparse Autoencoder Features
4 November 2024
Sviatoslav Chalnev
Matthew Siu
Arthur Conmy
LLMSV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Improving Steering Vectors by Targeting Sparse Autoencoder Features"
2 / 2 papers shown
Title
Patterns and Mechanisms of Contrastive Activation Engineering
Yixiong Hao
Ayush Panda
Stepan Shabalin
Sheikh Abdur Raheem Ali
LLMSV
58
0
0
06 May 2025
Tracking the Feature Dynamics in LLM Training: A Mechanistic Study
Yang Xu
Y. Wang
Hao Wang
49
1
0
23 Dec 2024
1