ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.16438
  4. Cited By
Language-Specific Neurons: The Key to Multilingual Capabilities in Large
  Language Models
v1v2 (latest)

Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models

26 February 2024
Tianyi Tang
Wenyang Luo
Haoyang Huang
Dongdong Zhang
Xiaolei Wang
Xin Zhao
Furu Wei
Ji-Rong Wen
ArXiv (abs)PDFHTML

Papers citing "Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models"

36 / 36 papers shown
Where Culture Fades: Revealing the Cultural Gap in Text-to-Image Generation
Where Culture Fades: Revealing the Cultural Gap in Text-to-Image Generation
Chuancheng Shi
Shangze Li
Shiming Guo
Simiao Xie
Wenhua Wu
...
Canran Xiao
Cong Wang
Zifeng Cheng
Fei Shen
Tat-Seng Chua
VLM
222
0
0
21 Nov 2025
LatentPrintFormer: A Hybrid CNN-Transformer with Spatial Attention for Latent Fingerprint identification
LatentPrintFormer: A Hybrid CNN-Transformer with Spatial Attention for Latent Fingerprint identification
Arnab Maity
Manasa
Pavan Kumar C
Raghavendra Ramachandra
283
0
0
11 Nov 2025
Sparse Subnetwork Enhancement for Underrepresented Languages in Large Language Models
Sparse Subnetwork Enhancement for Underrepresented Languages in Large Language Models
Daniil Gurgurov
Josef van Genabith
Simon Ostermann
MoE
198
0
0
15 Oct 2025
Evaluation of a Robust Control System in Real-World Cable-Driven Parallel Robots
Evaluation of a Robust Control System in Real-World Cable-Driven Parallel Robots
Damir Nurtdinov
Aliaksei Korshuk
Alexei Kornaev
Alexander Maloletov
73
0
0
09 Oct 2025
Multilingual Routing in Mixture-of-Experts
Multilingual Routing in Mixture-of-Experts
Lucas Bandarkar
Chenyuan Yang
Mohsen Fayyaz
Junlin Hu
Nanyun Peng
MoE
152
0
0
06 Oct 2025
Transcribe, Translate, or Transliterate: An Investigation of Intermediate Representations in Spoken Language Models
Transcribe, Translate, or Transliterate: An Investigation of Intermediate Representations in Spoken Language Models
Tolúl\d{o}pé Ògúnrèmí
Christopher D. Manning
Dan Jurafsky
Karen Livescu
AuLLM
207
0
0
02 Oct 2025
Understanding Post-Training Structural Changes in Large Language Models
Understanding Post-Training Structural Changes in Large Language Models
Xinyu He
Xianghui Cao
158
0
0
22 Sep 2025
What if I ask in \textit{alia lingua}? Measuring Functional Similarity Across Languages
What if I ask in \textit{alia lingua}? Measuring Functional Similarity Across Languages
Debangan Mishra
Arihant Rastogi
Agyeya Negi
Shashwat Goel
Ponnurangam Kumaraguru
122
0
0
04 Sep 2025
Linguistic Neuron Overlap Patterns to Facilitate Cross-lingual Transfer on Low-resource Languages
Linguistic Neuron Overlap Patterns to Facilitate Cross-lingual Transfer on Low-resource Languages
Yuemei Xu
Kexin Xu
Jian Zhou
Ling Hu
Lin Gui
143
1
0
23 Aug 2025
Isolating Culture Neurons in Multilingual Large Language Models
Isolating Culture Neurons in Multilingual Large Language Models
Danial Namazifard
Lukas Galke
159
1
0
04 Aug 2025
Language Arithmetics: Towards Systematic Language Neuron Identification and Manipulation
Language Arithmetics: Towards Systematic Language Neuron Identification and Manipulation
Daniil Gurgurov
Katharina Trinley
Yusser Al Ghussin
Tanja Baeumel
Josef van Genabith
Simon Ostermann
MILM
246
2
0
30 Jul 2025
Unveiling the Influence of Amplifying Language-Specific Neurons
Inaya Rahmanisa
Lyzander Marciano Andrylie
Mahardika Krisna Ihsani
Alfan Farizki Wicaksono
Haryo Akbarianto Wibowo
Alham Fikri Aji
137
0
0
30 Jul 2025
What Language(s) Does Aya-23 Think In? How Multilinguality Affects Internal Language Representations
What Language(s) Does Aya-23 Think In? How Multilinguality Affects Internal Language Representations
Katharina Trinley
Toshiki Nakai
Tatiana Anikina
Tanja Baeumel
202
1
0
27 Jul 2025
AudioLens: A Closer Look at Auditory Attribute Perception of Large Audio-Language Models
AudioLens: A Closer Look at Auditory Attribute Perception of Large Audio-Language Models
Chih-Kai Yang
Neo Ho
Yi-Jyun Lee
Hung-yi Lee
AuLLM
373
4
0
05 Jun 2025
Pruning General Large Language Models into Customized Expert Models
Pruning General Large Language Models into Customized Expert ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Yirao Zhao
Guizhen Chen
Kenji Kawaguchi
Lidong Bing
Wenxuan Zhang
206
0
0
03 Jun 2025
How does Alignment Enhance LLMs' Multilingual Capabilities? A Language Neurons Perspective
How does Alignment Enhance LLMs' Multilingual Capabilities? A Language Neurons Perspective
Shimao Zhang
Z. Lai
Xiang Liu
Shuaijie She
Xiao Liu
Yeyun Gong
Shujian Huang
Jiajun Chen
298
1
0
27 May 2025
ExPLAIND: Unifying Model, Data, and Training Attribution to Study Model Behavior
ExPLAIND: Unifying Model, Data, and Training Attribution to Study Model Behavior
Florian Eichin
Yupei Du
Philipp Mondorf
Maria Matveev
Barbara Plank
Michael A. Hedderich
FAtt
466
0
0
26 May 2025
Paths Not Taken: Understanding and Mending the Multilingual Factual Recall Pipeline
Paths Not Taken: Understanding and Mending the Multilingual Factual Recall Pipeline
Meng Lu
Ruochen Zhang
Carsten Eickhoff
Ellie Pavlick
HILMKELMLRM
347
7
0
26 May 2025
Understanding How Value Neurons Shape the Generation of Specified Values in LLMs
Yi Su
Jiayi Zhang
Shu Yang
Xinhai Wang
Lijie Hu
Di Wang
OffRL
414
5
0
23 May 2025
When Less Language is More: Language-Reasoning Disentanglement Makes LLMs Better Multilingual Reasoners
When Less Language is More: Language-Reasoning Disentanglement Makes LLMs Better Multilingual Reasoners
Weixiang Zhao
Jiahe Guo
Yang Deng
Tongtong Wu
Wenxuan Zhang
...
Yanyan Zhao
Wanxiang Che
Bing Qin
Tat-Seng Chua
Ting Liu
LRM
431
1
0
21 May 2025
Pierce the Mists, Greet the Sky: Decipher Knowledge Overshadowing via Knowledge Circuit Analysis
Pierce the Mists, Greet the Sky: Decipher Knowledge Overshadowing via Knowledge Circuit Analysis
Haoming Huang
Yibo Yan
Jiahao Huo
Xin Zou
Xinfeng Li
Kun Wang
Xuming Hu
570
1
0
20 May 2025
Neuron-level Balance between Stability and Plasticity in Deep Reinforcement Learning
Neuron-level Balance between Stability and Plasticity in Deep Reinforcement Learning
Jiahua Lan
Sen Zhang
Haixia Pan
Ruijun Liu
Li Shen
Dacheng Tao
CLL
282
0
0
09 Apr 2025
SciReplicate-Bench: Benchmarking LLMs in Agent-driven Algorithmic Reproduction from Research Papers
SciReplicate-Bench: Benchmarking LLMs in Agent-driven Algorithmic Reproduction from Research Papers
Yanzheng Xiang
Hanqi Yan
Shuyin Ouyang
Lin Gui
Yulan He
397
13
0
31 Mar 2025
Uncovering inequalities in new knowledge learning by large language models across different languages
Chenglong Wang
Haoyu Tang
Xiyuan Yang
Yueqi Xie
Jina Suh
...
Junming Huang
Yu Xie
Zhaoya Gong
Xing Xie
Fangzhao Wu
288
2
0
06 Mar 2025
Deciphering Functions of Neurons in Vision-Language Models
Deciphering Functions of Neurons in Vision-Language Models
Jiaqi Xu
Cuiling Lan
Xuejin Chen
VLM
863
0
0
10 Feb 2025
The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and Modalities
The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and ModalitiesInternational Conference on Learning Representations (ICLR), 2024
Zhaofeng Wu
Xinyan Velocity Yu
Dani Yogatama
Jiasen Lu
Yoon Kim
AIFin
486
37
0
07 Nov 2024
Neuron-based Personality Trait Induction in Large Language Models
Neuron-based Personality Trait Induction in Large Language Models
Jia Deng
Tianyi Tang
Yanbin Yin
Wenhao Yang
Wayne Xin Zhao
Ji-Rong Wen
238
3
0
16 Oct 2024
Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts
Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family ExpertsInternational Conference on Learning Representations (ICLR), 2024
Guorui Zheng
Xidong Wang
Juhao Liang
Nuo Chen
Yuping Zheng
Benyou Wang
MoE
310
10
0
14 Oct 2024
Extracting and Combining Abilities For Building Multi-lingual Ability-enhanced Large Language Models
Extracting and Combining Abilities For Building Multi-lingual Ability-enhanced Large Language Models
Zhipeng Chen
Liang Song
K. Zhou
Wayne Xin Zhao
Binghai Wang
Weipeng Chen
Ji-Rong Wen
386
0
0
10 Oct 2024
MINER: Mining the Underlying Pattern of Modality-Specific Neurons in
  Multimodal Large Language Models
MINER: Mining the Underlying Pattern of Modality-Specific Neurons in Multimodal Large Language Models
Kaichen Huang
Jiahao Huo
Yibo Yan
Kun Wang
Yutao Yue
Xuming Hu
244
2
0
07 Oct 2024
CiMaTe: Citation Count Prediction Effectively Leveraging the Main Text
CiMaTe: Citation Count Prediction Effectively Leveraging the Main Text
Jun Hirako
Ryohei Sasano
Koichi Takeda
328
4
0
06 Oct 2024
Mitigating Copy Bias in In-Context Learning through Neuron Pruning
Mitigating Copy Bias in In-Context Learning through Neuron Pruning
Ameen Ali
Lior Wolf
Ivan Titov
193
6
0
02 Oct 2024
Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models
Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024
Lucas Bandarkar
Benjamin Muller
Pritish Yuvraj
Rui Hou
Nayan Singhal
Hongjiang Lv
Bing-Quan Liu
KELMLRMMoMe
443
13
0
02 Oct 2024
Towards Understanding Multi-Task Learning (Generalization) of LLMs via Detecting and Exploring Task-Specific Neurons
Towards Understanding Multi-Task Learning (Generalization) of LLMs via Detecting and Exploring Task-Specific Neurons
Yongqi Leng
Deyi Xiong
381
16
0
09 Jul 2024
Sharing Matters: Analysing Neurons Across Languages and Tasks in LLMs
Sharing Matters: Analysing Neurons Across Languages and Tasks in LLMs
Weixuan Wang
Barry Haddow
Minghao Wu
Alexandra Birch
Alexandra Birch
MILM
379
28
0
13 Jun 2024
Talking Heads: Understanding Inter-layer Communication in Transformer Language Models
Talking Heads: Understanding Inter-layer Communication in Transformer Language Models
Jack Merullo
Carsten Eickhoff
Ellie Pavlick
538
31
0
13 Jun 2024
1