v1v2 (latest)

Knowledge Neurons in Pretrained Transformers

Annual Meeting of the Association for Computational Linguistics (ACL), 2021

18 April 2021

Damai Dai

Li Dong

Y. Hao

Zhifang Sui

Baobao Chang

Furu Wei

KELM

ArXiv (abs)PDF HTML Github (168★)

Papers citing "Knowledge Neurons in Pretrained Transformers"

50 / 410 papers shown

Parameter Importance-Driven Continual Learning for Foundation Models

492

19 Nov 2025

Fine-Tuned LLMs Know They Don't Know: A Parameter-Efficient Approach to Recovering Honesty

102

17 Nov 2025

Continual Unlearning for Text-to-Image Diffusion Models: A Regularization Perspective

203

11 Nov 2025

On the Analogy between Human Brain and LLMs: Spotting Key Neurons in Grammar Perception

Sanaz Saki Norouzi

Mohammad Masjedi

Pascal Hitzler

128

09 Nov 2025

ExplicitLM: Decoupling Knowledge from Parameters via Explicit Memory Banks

152

03 Nov 2025

Balancing Knowledge Updates: Toward Unified Modular Editing in LLMs

148

31 Oct 2025

Layer of Truth: Probing Belief Shifts under Continual Pre-Training Poisoning

369

29 Oct 2025

From Memorization to Reasoning in the Spectrum of Loss Curvature

219

28 Oct 2025

Edit Less, Achieve More: Dynamic Sparse Neuron Masking for Lifelong Knowledge Editing in LLMs

368

25 Oct 2025

Probing Neural Combinatorial Optimization Models

107

25 Oct 2025

Model-Aware Tokenizer Transfer

Mykola Haltiuk

Aleksander Smywiński-Pohl

122

24 Oct 2025

A Graph Signal Processing Framework for Hallucination Detection in Large Language Models

Valentin Noël

135

21 Oct 2025

From Memorization to Generalization: Fine-Tuning Large Language Models for Biomedical Term-to-Identifier Normalization

Suswitha Pericharla

D. B. Hier

Tayo Obafemi-Ajayi

148

21 Oct 2025

Neuronal Group Communication for Efficient Neural representation

Zhengqi Pei

Qingming Huang

Shuhui Wang

114

19 Oct 2025

Facts in Stats: Impacts of Pretraining Diversity on Language Model Generalization

Tina Behnia

Puneesh Deora

Christos Thrampoulidis

118

17 Oct 2025

Hierarchical Frequency Tagging Probe (HFTP): A Unified Approach to Investigate Syntactic Structure Representations in Large Language Models and the Human Brain

152

15 Oct 2025

Sparse Subnetwork Enhancement for Underrepresented Languages in Large Language Models

202

15 Oct 2025

Medical Interpretability and Knowledge Maps of Large Language Models

Razvan Marinescu

Victoria-Elisabeth Gruber

Diego Fajardo

FAtt AI4MH

240

13 Oct 2025

Preserving LLM Capabilities through Calibration Data Curation: From Analysis to Optimization

112

12 Oct 2025

The Achilles' Heel of LLMs: How Altering a Handful of Neurons Can Cripple Language Abilities

129

11 Oct 2025

ADEPT: Continual Pretraining via Adaptive Expansion and Dynamic Decoupled Tuning

138

11 Oct 2025

ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall

09 Oct 2025

Evaluation of a Robust Control System in Real-World Cable-Driven Parallel Robots

09 Oct 2025

POME: Post Optimization Model Edit via Muon-style Projection

103

08 Oct 2025

Machine Unlearning Meets Adversarial Robustness via Constrained Interventions on LLMs

Fatmazohra Rezkellah

Ramzi Dakhmouche

AAML MU

225

03 Oct 2025

LEAML: Label-Efficient Adaptation to Out-of-Distribution Visual Tasks for Multimodal Large Language Models

152

03 Oct 2025

What Drives Compositional Generalization in Visual Generative Models?

328

03 Oct 2025

Muon Outperforms Adam in Tail-End Associative Memory Learning

175

30 Sep 2025

Pretraining with hierarchical memories: separating long-tail and common knowledge

249

29 Sep 2025

Bridging the Knowledge-Prediction Gap in LLMs on Multiple-Choice Questions

375

28 Sep 2025

Knowledge Homophily in Large Language Models

125

28 Sep 2025

Timber: Training-free Instruct Model Refining with Base via Effective Rank

140

28 Sep 2025

Beyond Benchmarks: Understanding Mixture-of-Experts Models through Internal Mechanisms

28 Sep 2025

Hedonic Neurons: A Mechanistic Mapping of Latent Coalitions in Transformer MLPs

133

28 Sep 2025

Probability Signature: Bridging Data Semantics and Embedding Structure in Language Models

Junjie Yao

Zhi-hai Xu

139

24 Sep 2025

Unveiling the Response of Large Vision-Language Models to Visually Absent Tokens

153

03 Sep 2025

Unraveling LLM Jailbreaks Through Safety Knowledge Neurons

Chongwen Zhao

Kaizhu Huang

AAML KELM

169

01 Sep 2025

DFAMS: Dynamic-flow guided Federated Alignment based Multi-prototype Search

...

185

28 Aug 2025

Provable Benefits of In-Tool Learning for Large Language Models

155

28 Aug 2025

LFD: Layer Fused Decoding to Exploit External Knowledge in Retrieval-Augmented Generation

138

27 Aug 2025

Layerwise Importance Analysis of Feed-Forward Networks in Transformer-based Language Models

25 Aug 2025

From Confidence to Collapse in LLM Factual RobustnessConference on Empirical Methods in Natural Language Processing (EMNLP), 2025

230

22 Aug 2025

Side Effects of Erasing Concepts from Diffusion Models

240

20 Aug 2025

WSS-CL: Weight Saliency Soft-Guided Contrastive Learning for Efficient Machine Unlearning Image Classification

Thang Duc Tran

Thai Hoang Le

129

06 Aug 2025

Understanding and Mitigating Political Stance Cross-topic Generalization in Large Language Models

214

04 Aug 2025

Granular Concept Circuits: Toward a Fine-Grained Circuit Discovery for Concept Representations

Dahee Kwon

Sehyun Lee

Jaesik Choi

171

03 Aug 2025

Prompting Large Language Models with Partial Knowledge for Answering Questions with Unseen Entities

136

02 Aug 2025

Latent Knowledge Scalpel: Precise and Massive Knowledge Editing for Large Language Models

190

01 Aug 2025

Trustworthy Reasoning: Evaluating and Enhancing Factual Accuracy in LLM Intermediate Thought Processes

205

25 Jul 2025

Deep Hidden Cognition Facilitates Reliable Chain-of-Thought Reasoning

171

14 Jul 2025