v1v2 (latest)

Knowledge Neurons in Pretrained Transformers

Annual Meeting of the Association for Computational Linguistics (ACL), 2021

18 April 2021

Damai Dai

Li Dong

Y. Hao

Zhifang Sui

Baobao Chang

Furu Wei

KELM

ArXiv (abs)PDF HTML Github (168★)

Papers citing "Knowledge Neurons in Pretrained Transformers"

50 / 410 papers shown

Deep Hidden Cognition Facilitates Reliable Chain-of-Thought Reasoning

153

14 Jul 2025

Flexible Feature Distillation for Large Language Models

Khouloud Saadi

Di Wang

263

14 Jul 2025

Steering Information Utility in Key-Value Memory for Language Model Post-Training

364

07 Jul 2025

Sparse Feature Coactivation Reveals Causal Semantic Modules in Large Language Models

184

22 Jun 2025

From Concepts to Components: Concept-Agnostic Attention Module Discovery in Transformers

Jingtong Su

Julia Kempe

Karen Ullrich

268

20 Jun 2025

Representation Consistency for Accurate and Coherent LLM Answer Aggregation

187

18 Jun 2025

Attribution-guided Pruning for Compression, Circuit Discovery, and Targeted Correction in LLMs

Sayed Mohammad Vakilzadeh Hatefi

246

16 Jun 2025

Beyond Frequency: The Role of Redundancy in Large Language Model Memorization

128

14 Jun 2025

Beyond Benchmarks: A Novel Framework for Domain-Specific LLM Evaluation and Knowledge Mapping

169

09 Jun 2025

Do LLMs Really Forget? Evaluating Unlearning with Knowledge Correlation and Confidence Awareness

...

514

06 Jun 2025

AudioLens: A Closer Look at Auditory Attribute Perception of Large Audio-Language Models

373

05 Jun 2025

MobiEdit: Resource-efficient Knowledge Editing for Personalized On-device LLMs

203

05 Jun 2025

Establishing Trustworthy LLM Evaluation via Shortcut Neuron AnalysisAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

223

04 Jun 2025

Beyond Memorization: A Rigorous Evaluation Framework for Medical Knowledge Editing

368

04 Jun 2025

Is Random Attention Sufficient for Sequence Modeling? Disentangling Trainable Components in the Transformer

441

01 Jun 2025

Soft Reasoning: Navigating Solution Spaces in Large Language Models through Controlled Embedding Exploration

386

30 May 2025

InComeS: Integrating Compression and Selection Mechanisms into LLMs for Efficient Model Editing

209

28 May 2025

Rhetorical Text-to-Image Generation via Two-layer Diffusion Policy Optimization

239

28 May 2025

Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework for LLM's Instruction-Following Capabilities

...

211

27 May 2025

Understanding the learned look-ahead behavior of chess neural networks

Diogo Cruz

314

26 May 2025

A Graph Perspective to Probe Structural Patterns of Knowledge in Large Language Models

292

25 May 2025

Benchmarking and Rethinking Knowledge Editing for Large Language Models

218

24 May 2025

Disentangling Knowledge Representations for Large Language Model Editing

172

24 May 2025

TRACE for Tracking the Emergence of Semantic Representations in Transformers

Nura Aljaafari

Danilo S. Carvalho

André Freitas

240

23 May 2025

Locate-then-Merge: Neuron-Level Parameter Fusion for Mitigating Catastrophic Forgetting in Multimodal LLMs

Zeping Yu

Sophia Ananiadou

MoMe KELM CLL

254

22 May 2025

The Atlas of In-Context Learning: How Attention Heads Shape In-Context Retrieval Augmentation

345

21 May 2025

Pixels Versus Priors: Controlling Knowledge Priors in Vision-Language Models through Visual Counterfacts

337

21 May 2025

447

18 May 2025

EAMET: Robust Massive Model Editing via Embedding Alignment Optimization

231

17 May 2025

On the Superimposed Noise Accumulation Problem in Sequential Knowledge Editing of Large Language Models

405

12 May 2025

Defending against Indirect Prompt Injection by Instruction Detection

315

08 May 2025

Polysemy of Synthetic Neurons Towards a New Type of Explanatory Categorical Vector Spaces

Michael Veillet-Guillem

MILM

287

30 Apr 2025

SetKE: Knowledge Editing for Knowledge Elements OverlapInternational Joint Conference on Artificial Intelligence (IJCAI), 2025

271

29 Apr 2025

Exploring How LLMs Capture and Represent Domain-Specific Knowledge

Mirian Hipolito Garcia

Camille Couturier

Daniel Madrigal Diaz

Ankur Mallick

Anastasios Kyrillidis

Robert Sim

Victor Rühle

Saravan Rajmohan

380

23 Apr 2025

Model Utility Law: Evaluating LLMs beyond Performance through Mechanism Interpretable Metric

270

10 Apr 2025

Task-Circuit Quantization: Leveraging Knowledge Localization and Interpretability for Compression

399

10 Apr 2025

Neuron-level Balance between Stability and Plasticity in Deep Reinforcement Learning

283

09 Apr 2025

Les Dissonances: Cross-Tool Harvesting and Polluting in Pool-of-Tools Empowered LLM Agents

286

04 Apr 2025

Towards Understanding How Knowledge Evolves in Large Vision-Language ModelsComputer Vision and Pattern Recognition (CVPR), 2025

796

31 Mar 2025

Intra-neuronal attention within language models Relationships between activation and semantics

Corbet Alois Georgeon

Michael Veillet-Guillem

MILM

256

17 Mar 2025

Cognitive Activation and Chaotic Dynamics in Large Language Models: A Quasi-Lyapunov Analysis of Reasoning Mechanisms

192

15 Mar 2025

Discovering Influential Neuron Path in Vision TransformersInternational Conference on Learning Representations (ICLR), 2025

605

12 Mar 2025

From Style to Facts: Mapping the Boundaries of Knowledge Injection with Finetuning

Eric Zhao

Pranjal Awasthi

Nika Haghtalab

172

07 Mar 2025

Unlocking Efficient, Scalable, and Continual Knowledge Editing with Basis-Level Representation Fine-TuningInternational Conference on Learning Representations (ICLR), 2025

...

265

01 Mar 2025

Triple Phase Transitions: Understanding the Learning Dynamics of Large Language Models from a Neuroscience Perspective

363

28 Feb 2025

Capability Localization: Capabilities Can be Localized rather than Individual KnowledgeInternational Conference on Learning Representations (ICLR), 2025

277

28 Feb 2025

Promote, Suppress, Iterate: How Language Models Answer One-to-Many Factual Queries

Tianyi Lorena Yan

Robin Jia

KELM MU

316

27 Feb 2025

Synthetic Categorical Restructuring large Or How AIs Gradually Extract Efficient Regularities from Their Experience of the World

Michael Veillet-Guillem

244

25 Feb 2025

Model LakesInternational Conference on Extending Database Technology (EDBT), 2024

Koyena Pal

David Bau

Renée J. Miller

343

24 Feb 2025

CODESYNC: Synchronizing Large Language Models with Dynamic Code Evolution at Scale

318

23 Feb 2025