v1v2 (latest)

Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models

26 February 2024

Papers citing "Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models"

36 / 36 papers shown

Where Culture Fades: Revealing the Cultural Gap in Text-to-Image Generation

...

222

21 Nov 2025

LatentPrintFormer: A Hybrid CNN-Transformer with Spatial Attention for Latent Fingerprint identification

Arnab Maity

Manasa

Pavan Kumar C

Raghavendra Ramachandra

283

11 Nov 2025

Sparse Subnetwork Enhancement for Underrepresented Languages in Large Language Models

198

15 Oct 2025

Evaluation of a Robust Control System in Real-World Cable-Driven Parallel Robots

09 Oct 2025

Multilingual Routing in Mixture-of-Experts

152

06 Oct 2025

Transcribe, Translate, or Transliterate: An Investigation of Intermediate Representations in Spoken Language Models

Tolúl\d{o}pé Ògúnrèmí

Christopher D. Manning

Dan Jurafsky

Karen Livescu

AuLLM

207

02 Oct 2025

Understanding Post-Training Structural Changes in Large Language Models

Xinyu He

Xianghui Cao

158

22 Sep 2025

$What if I ask in \textit{alia lingua}? Measuring Functional Similarity Across Languages$

What if I ask in \textit{alia lingua}? Measuring Functional Similarity Across Languages

Ponnurangam Kumaraguru

122

04 Sep 2025

Linguistic Neuron Overlap Patterns to Facilitate Cross-lingual Transfer on Low-resource Languages

143

23 Aug 2025

Isolating Culture Neurons in Multilingual Large Language Models

Danial Namazifard

Lukas Galke

159

04 Aug 2025

Language Arithmetics: Towards Systematic Language Neuron Identification and Manipulation

246

30 Jul 2025

Unveiling the Influence of Amplifying Language-Specific Neurons

Inaya Rahmanisa

Lyzander Marciano Andrylie

Mahardika Krisna Ihsani

Alfan Farizki Wicaksono

Haryo Akbarianto Wibowo

Alham Fikri Aji

137

30 Jul 2025

What Language(s) Does Aya-23 Think In? How Multilinguality Affects Internal Language Representations

202

27 Jul 2025

AudioLens: A Closer Look at Auditory Attribute Perception of Large Audio-Language Models

373

05 Jun 2025

Pruning General Large Language Models into Customized Expert ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

206

03 Jun 2025

How does Alignment Enhance LLMs' Multilingual Capabilities? A Language Neurons Perspective

298

27 May 2025

ExPLAIND: Unifying Model, Data, and Training Attribution to Study Model Behavior

466

26 May 2025

Paths Not Taken: Understanding and Mending the Multilingual Factual Recall Pipeline

347

26 May 2025

Understanding How Value Neurons Shape the Generation of Specified Values in LLMs

414

23 May 2025

When Less Language is More: Language-Reasoning Disentanglement Makes LLMs Better Multilingual Reasoners

...

431

21 May 2025

Pierce the Mists, Greet the Sky: Decipher Knowledge Overshadowing via Knowledge Circuit Analysis

570

20 May 2025

Neuron-level Balance between Stability and Plasticity in Deep Reinforcement Learning

282

09 Apr 2025

SciReplicate-Bench: Benchmarking LLMs in Agent-driven Algorithmic Reproduction from Research Papers

397

31 Mar 2025

Uncovering inequalities in new knowledge learning by large language models across different languages

...

288

06 Mar 2025

Deciphering Functions of Neurons in Vision-Language Models

863

10 Feb 2025

The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and ModalitiesInternational Conference on Learning Representations (ICLR), 2024

486

07 Nov 2024

Neuron-based Personality Trait Induction in Large Language Models

238

16 Oct 2024

Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family ExpertsInternational Conference on Learning Representations (ICLR), 2024

Xidong Wang

310

14 Oct 2024

Extracting and Combining Abilities For Building Multi-lingual Ability-enhanced Large Language Models

386

10 Oct 2024

MINER: Mining the Underlying Pattern of Modality-Specific Neurons in Multimodal Large Language Models

Kun Wang

Xuming Hu

244

07 Oct 2024

CiMaTe: Citation Count Prediction Effectively Leveraging the Main Text

Jun Hirako

Ryohei Sasano

Koichi Takeda

328

06 Oct 2024

Mitigating Copy Bias in In-Context Learning through Neuron Pruning

Ameen Ali

Lior Wolf

Ivan Titov

193

02 Oct 2024

Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024

443

02 Oct 2024

Towards Understanding Multi-Task Learning (Generalization) of LLMs via Detecting and Exploring Task-Specific Neurons

Yongqi Leng

Deyi Xiong

381

09 Jul 2024

Sharing Matters: Analysing Neurons Across Languages and Tasks in LLMs

379

13 Jun 2024

Talking Heads: Understanding Inter-layer Communication in Transformer Language Models

Jack Merullo

Carsten Eickhoff

Ellie Pavlick

538

13 Jun 2024