Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2505.16505
Cited By

Sparse Activation Editing for Reliable Instruction Following in Narratives

Sparse Activation Editing for Reliable Instruction Following in Narratives

22 May 2025

ArXiv (abs)PDF HTML

Papers citing "Sparse Activation Editing for Reliable Instruction Following in Narratives"

6 / 6 papers shown

Enhancing LLM Steering through Sparse Autoencoder-Based Vector Refinement

Enhancing LLM Steering through Sparse Autoencoder-Based Vector Refinement

183

0

0

28 Sep 2025

RoleMRC: A Fine-Grained Composite Benchmark for Role-Playing and Instruction-Following

RoleMRC: A Fine-Grained Composite Benchmark for Role-Playing and Instruction-FollowingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

107

12

0

17 Feb 2025

A Survey of Personalized Large Language Models: Progress and Future Directions

A Survey of Personalized Large Language Models: Progress and Future Directions

337

30

0

17 Feb 2025

SAIF: A Sparse Autoencoder Framework for Interpreting and Steering Instruction Following of Language Models

SAIF: A Sparse Autoencoder Framework for Interpreting and Steering Instruction Following of Language Models

305

16

0

17 Feb 2025

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation EngineeringNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

Aryo Pradipta Gema

Hongru Wang

Pasquale Minervini

320

49

0

21 Oct 2024

Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models

Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models

Eric J. Michaud

Yonatan Belinkov

568

251

0

28 Mar 2024