Language Arithmetics: Towards Systematic Language Neuron Identification and Manipulation

30 July 2025

Daniil Gurgurov

Main:9 Pages

29 Figures

Bibliography:3 Pages

9 Tables

Appendix:15 Pages

Abstract

Large language models (LLMs) exhibit strong multilingual abilities, yet the neural mechanisms behind language-specific processing remain unclear. We analyze language-specific neurons in Llama-3.1-8B, Mistral-Nemo-12B, and Aya-Expanse-8B & 32B across 21 typologically diverse languages, identifying neurons that control language behavior. Using the Language Activation Probability Entropy (LAPE) method, we show that these neurons cluster in deeper layers, with non-Latin scripts showing greater specialization. Related languages share overlapping neurons, reflecting internal representations of linguistic proximity.

View on arXiv

Comments on this paper