v1v2 (latest)

Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models

26 February 2024

Papers citing "Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models"

36 / 36 papers shown

Understanding Post-Training Structural Changes in Large Language Models

Xinyu He

Xianghui Cao

167

30 Jan 2026

Where Culture Fades: Revealing the Cultural Gap in Text-to-Image Generation

...

227

21 Nov 2025

LatentPrintFormer: A Hybrid CNN-Transformer with Spatial Attention for Latent Fingerprint identification

Arnab Maity

Manasa

Pavan Kumar C

Raghavendra Ramachandra

286

11 Nov 2025

Sparse Subnetwork Enhancement for Underrepresented Languages in Large Language Models

202

15 Oct 2025

Evaluation of a Robust Control System in Real-World Cable-Driven Parallel Robots

09 Oct 2025

Multilingual Routing in Mixture-of-Experts

162

06 Oct 2025

Transcribe, Translate, or Transliterate: An Investigation of Intermediate Representations in Spoken Language Models

Tolúl\d{o}pé Ògúnrèmí

Christopher D. Manning

Dan Jurafsky

Karen Livescu

AuLLM

208

02 Oct 2025

$What if I ask in \textit{alia lingua}? Measuring Functional Similarity Across Languages$

What if I ask in \textit{alia lingua}? Measuring Functional Similarity Across Languages

Ponnurangam Kumaraguru

129

04 Sep 2025

Linguistic Neuron Overlap Patterns to Facilitate Cross-lingual Transfer on Low-resource Languages

145

23 Aug 2025

Isolating Culture Neurons in Multilingual Large Language Models

Danial Namazifard

Lukas Galke

160

04 Aug 2025

Unveiling the Influence of Amplifying Language-Specific Neurons

Inaya Rahmanisa

Lyzander Marciano Andrylie

Mahardika Krisna Ihsani

Alfan Farizki Wicaksono

Haryo Akbarianto Wibowo

Alham Fikri Aji

153

30 Jul 2025

Language Arithmetics: Towards Systematic Language Neuron Identification and Manipulation

249

30 Jul 2025

What Language(s) Does Aya-23 Think In? How Multilinguality Affects Internal Language Representations

206

27 Jul 2025

AudioLens: A Closer Look at Auditory Attribute Perception of Large Audio-Language Models

376

05 Jun 2025

Pruning General Large Language Models into Customized Expert ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

213

03 Jun 2025

How does Alignment Enhance LLMs' Multilingual Capabilities? A Language Neurons Perspective

306

27 May 2025

Paths Not Taken: Understanding and Mending the Multilingual Factual Recall Pipeline

359

26 May 2025

ExPLAIND: Unifying Model, Data, and Training Attribution to Study Model Behavior

473

26 May 2025

Understanding How Value Neurons Shape the Generation of Specified Values in LLMs

433

23 May 2025

When Less Language is More: Language-Reasoning Disentanglement Makes LLMs Better Multilingual Reasoners

...

439

21 May 2025

Pierce the Mists, Greet the Sky: Decipher Knowledge Overshadowing via Knowledge Circuit Analysis

601

20 May 2025

Neuron-level Balance between Stability and Plasticity in Deep Reinforcement Learning

290

09 Apr 2025

SciReplicate-Bench: Benchmarking LLMs in Agent-driven Algorithmic Reproduction from Research Papers

413

31 Mar 2025

Uncovering inequalities in new knowledge learning by large language models across different languages

...

296

06 Mar 2025

Deciphering Functions of Neurons in Vision-Language Models

877

10 Feb 2025

The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and ModalitiesInternational Conference on Learning Representations (ICLR), 2024

506

07 Nov 2024

Neuron-based Personality Trait Induction in Large Language Models

243

16 Oct 2024

Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family ExpertsInternational Conference on Learning Representations (ICLR), 2024

Xidong Wang

311

14 Oct 2024

Extracting and Combining Abilities For Building Multi-lingual Ability-enhanced Large Language Models

410

10 Oct 2024

MINER: Mining the Underlying Pattern of Modality-Specific Neurons in Multimodal Large Language Models

Kun Wang

Xuming Hu

251

07 Oct 2024

CiMaTe: Citation Count Prediction Effectively Leveraging the Main Text

Jun Hirako

Ryohei Sasano

Koichi Takeda

335

06 Oct 2024

Mitigating Copy Bias in In-Context Learning through Neuron Pruning

Ameen Ali

Lior Wolf

Ivan Titov

202

02 Oct 2024

Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024

456

02 Oct 2024

Towards Understanding Multi-Task Learning (Generalization) of LLMs via Detecting and Exploring Task-Specific Neurons

Yongqi Leng

Deyi Xiong

394

09 Jul 2024

Sharing Matters: Analysing Neurons Across Languages and Tasks in LLMs

393

13 Jun 2024

Talking Heads: Understanding Inter-layer Communication in Transformer Language Models

Jack Merullo

Carsten Eickhoff

Ellie Pavlick

549

13 Jun 2024