
Neuroplasticity and Corruption in Model Mechanisms: A Case Study Of Indirect Object IdentificationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025 |
![]() Optimal ablation for interpretabilityNeural Information Processing Systems (NeurIPS), 2024 |