ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.16505
  4. Cited By
Sparse Activation Editing for Reliable Instruction Following in Narratives

Sparse Activation Editing for Reliable Instruction Following in Narratives

22 May 2025
Runcong Zhao
Chengyu Cao
Qinglin Zhu
Xiucheng Lv
Shun Shao
Lin Gui
Ruifeng Xu
Yulan He
ArXiv (abs)PDFHTML

Papers citing "Sparse Activation Editing for Reliable Instruction Following in Narratives"

6 / 6 papers shown
Enhancing LLM Steering through Sparse Autoencoder-Based Vector Refinement
Enhancing LLM Steering through Sparse Autoencoder-Based Vector Refinement
Anyi Wang
Xuansheng Wu
Dong Shu
Yunpu Ma
Ninghao Liu
LLMSV
183
0
0
28 Sep 2025
RoleMRC: A Fine-Grained Composite Benchmark for Role-Playing and Instruction-Following
RoleMRC: A Fine-Grained Composite Benchmark for Role-Playing and Instruction-FollowingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Junru Lu
Jiazheng Li
Guodong Shen
Lin Gui
Siyu An
Yulan He
Di Yin
Xing Sun
107
12
0
17 Feb 2025
A Survey of Personalized Large Language Models: Progress and Future Directions
A Survey of Personalized Large Language Models: Progress and Future Directions
Jiahong Liu
Zexuan Qiu
Zhongyang Li
Quanyu Dai
Jieming Zhu
Minda Hu
Menglin Yang
Irwin King
Tat-Seng Chua
Irwin King
LM&MA
337
30
0
17 Feb 2025
SAIF: A Sparse Autoencoder Framework for Interpreting and Steering Instruction Following of Language Models
SAIF: A Sparse Autoencoder Framework for Interpreting and Steering Instruction Following of Language Models
Z. He
Haiyan Zhao
Yiran Qiao
Fan Yang
Ali Payani
Jing Ma
Jundong Li
LLMSV
305
16
0
17 Feb 2025
Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation EngineeringNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Yu Zhao
Alessio Devoto
Giwon Hong
Xiaotang Du
Aryo Pradipta Gema
Hongru Wang
Xuanli He
Kam-Fai Wong
Pasquale Minervini
KELMLLMSV
320
49
0
21 Oct 2024
Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models
Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models
Samuel Marks
Can Rager
Eric J. Michaud
Yonatan Belinkov
David Bau
Aaron Mueller
568
251
0
28 Mar 2024
1