229

Language Arithmetics: Towards Systematic Language Neuron Identification and Manipulation

Main:9 Pages
29 Figures
Bibliography:3 Pages
9 Tables
Appendix:15 Pages
Abstract

Large language models (LLMs) exhibit strong multilingual abilities, yet the neural mechanisms behind language-specific processing remain unclear. We analyze language-specific neurons in Llama-3.1-8B, Mistral-Nemo-12B, and Aya-Expanse-8B & 32B across 21 typologically diverse languages, identifying neurons that control language behavior. Using the Language Activation Probability Entropy (LAPE) method, we show that these neurons cluster in deeper layers, with non-Latin scripts showing greater specialization. Related languages share overlapping neurons, reflecting internal representations of linguistic proximity.

View on arXiv
Comments on this paper