RoleMRC: A Fine-Grained Composite Benchmark for Role-Playing and Instruction-FollowingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation EngineeringNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024 |