ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.08696
  4. Cited By
Knowledge Neurons in Pretrained Transformers
v1v2 (latest)

Knowledge Neurons in Pretrained Transformers

Annual Meeting of the Association for Computational Linguistics (ACL), 2021
18 April 2021
Damai Dai
Li Dong
Y. Hao
Zhifang Sui
Baobao Chang
Furu Wei
    KELMMU
ArXiv (abs)PDFHTMLGithub (168★)

Papers citing "Knowledge Neurons in Pretrained Transformers"

50 / 410 papers shown
Deep Hidden Cognition Facilitates Reliable Chain-of-Thought Reasoning
Deep Hidden Cognition Facilitates Reliable Chain-of-Thought Reasoning
Zijun Chen
Wenbo Hu
Richang Hong
LRM
153
0
0
14 Jul 2025
Flexible Feature Distillation for Large Language Models
Flexible Feature Distillation for Large Language Models
Khouloud Saadi
Di Wang
263
0
0
14 Jul 2025
Steering Information Utility in Key-Value Memory for Language Model Post-Training
Steering Information Utility in Key-Value Memory for Language Model Post-Training
Chunyuan Deng
Ruidi Chang
Hanjie Chen
LLMSV
364
0
0
07 Jul 2025
Sparse Feature Coactivation Reveals Causal Semantic Modules in Large Language Models
Sparse Feature Coactivation Reveals Causal Semantic Modules in Large Language Models
Ruixuan Deng
Xiaoyang Hu
Miles Gilberti
Shane Storks
Aman Taxali
Mike Angstadt
Chandra S. Sripada
Joyce Chai
184
0
0
22 Jun 2025
From Concepts to Components: Concept-Agnostic Attention Module Discovery in Transformers
From Concepts to Components: Concept-Agnostic Attention Module Discovery in Transformers
Jingtong Su
Julia Kempe
Karen Ullrich
268
3
0
20 Jun 2025
Representation Consistency for Accurate and Coherent LLM Answer Aggregation
Representation Consistency for Accurate and Coherent LLM Answer Aggregation
Junqi Jiang
Tom Bewley
Salim I. Amoukou
Francesco Leofante
Antonio Rago
Saumitra Mishra
Francesca Toni
187
1
0
18 Jun 2025
Attribution-guided Pruning for Compression, Circuit Discovery, and Targeted Correction in LLMs
Attribution-guided Pruning for Compression, Circuit Discovery, and Targeted Correction in LLMs
Sayed Mohammad Vakilzadeh Hatefi
Maximilian Dreyer
Reduan Achtibat
Patrick Kahardipraja
Thomas Wiegand
Wojciech Samek
Sebastian Lapuschkin
246
2
0
16 Jun 2025
Beyond Frequency: The Role of Redundancy in Large Language Model Memorization
Beyond Frequency: The Role of Redundancy in Large Language Model Memorization
Jie Zhang
Qinghua Zhao
Lei Li
Chi-ho Lin
Lei Li
128
0
0
14 Jun 2025
Beyond Benchmarks: A Novel Framework for Domain-Specific LLM Evaluation and Knowledge Mapping
Beyond Benchmarks: A Novel Framework for Domain-Specific LLM Evaluation and Knowledge Mapping
Nitin Sharma
Thomas Wolfers
Çağatay Yıldız
ALM
169
0
0
09 Jun 2025
Do LLMs Really Forget? Evaluating Unlearning with Knowledge Correlation and Confidence Awareness
Do LLMs Really Forget? Evaluating Unlearning with Knowledge Correlation and Confidence Awareness
Rongzhe Wei
Peizhi Niu
Hans Hao-Hsun Hsu
Ruihan Wu
Haoteng Yin
...
Vamsi K. Potluru
Eli Chien
Kamalika Chaudhuri
S. Rasoul Etesami
P. Li
MUKELM
514
6
0
06 Jun 2025
AudioLens: A Closer Look at Auditory Attribute Perception of Large Audio-Language Models
AudioLens: A Closer Look at Auditory Attribute Perception of Large Audio-Language Models
Chih-Kai Yang
Neo Ho
Yi-Jyun Lee
Hung-yi Lee
AuLLM
373
4
0
05 Jun 2025
MobiEdit: Resource-efficient Knowledge Editing for Personalized On-device LLMs
MobiEdit: Resource-efficient Knowledge Editing for Personalized On-device LLMs
Zhenyan Lu
Daliang Xu
Dongqi Cai
Zexi Li
Wei Liu
Fangming Liu
Shangguang Wang
Mengwei Xu
KELM
203
1
0
05 Jun 2025
Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis
Establishing Trustworthy LLM Evaluation via Shortcut Neuron AnalysisAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Kejian Zhu
Shangqing Tu
Zhuoran Jin
Lei Hou
Juanzi Li
Jun Zhao
KELM
223
0
0
04 Jun 2025
Beyond Memorization: A Rigorous Evaluation Framework for Medical Knowledge Editing
Beyond Memorization: A Rigorous Evaluation Framework for Medical Knowledge Editing
Shigeng Chen
Linhao Luo
Zhangchi Qiu
Yanan Cao
Carl Yang
Shirui Pan
KELM
368
2
0
04 Jun 2025
Is Random Attention Sufficient for Sequence Modeling? Disentangling Trainable Components in the Transformer
Is Random Attention Sufficient for Sequence Modeling? Disentangling Trainable Components in the Transformer
Yihe Dong
Lorenzo Noci
Mikhail Khodak
Mufan Li
441
1
0
01 Jun 2025
Soft Reasoning: Navigating Solution Spaces in Large Language Models through Controlled Embedding Exploration
Soft Reasoning: Navigating Solution Spaces in Large Language Models through Controlled Embedding Exploration
Qinglin Zhu
Runcong Zhao
Hanqi Yan
Yulan He
Yudong Chen
Lin Gui
LRM
386
0
0
30 May 2025
InComeS: Integrating Compression and Selection Mechanisms into LLMs for Efficient Model Editing
InComeS: Integrating Compression and Selection Mechanisms into LLMs for Efficient Model Editing
Shuaiyi Li
Zhisong Zhang
Yang Deng
Chenlong Deng
Tianqing Fang
Hongming Zhang
Haitao Mi
Dong Yu
Wai Lam
KELM
209
0
0
28 May 2025
Rhetorical Text-to-Image Generation via Two-layer Diffusion Policy Optimization
Rhetorical Text-to-Image Generation via Two-layer Diffusion Policy Optimization
Yuxi Zhang
Yueting Li
Xinyu Du
Sibo Wang
DiffMEGVM
239
0
0
28 May 2025
Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework for LLM's Instruction-Following Capabilities
Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework for LLM's Instruction-Following Capabilities
Junyan Zhang
Yubo Gao
Yibo Yan
Jia-Chen Gu
Zhaorui Hou
...
Qi Zheng
Song Dai
Yonghua Hei
Junzhuo Li
Xuming Hu
211
3
0
27 May 2025
Understanding the learned look-ahead behavior of chess neural networks
Understanding the learned look-ahead behavior of chess neural networks
Diogo Cruz
314
0
0
26 May 2025
A Graph Perspective to Probe Structural Patterns of Knowledge in Large Language Models
A Graph Perspective to Probe Structural Patterns of Knowledge in Large Language Models
Utkarsh Sahu
Zhisheng Qi
Y. Lei
Ryan Rossi
Franck Dernoncourt
Nesreen K. Ahmed
M. Halappanavar
Yao Ma
Yu Wang
292
0
0
25 May 2025
Benchmarking and Rethinking Knowledge Editing for Large Language Models
Benchmarking and Rethinking Knowledge Editing for Large Language Models
Guoxiu He
Xin Song
Futing Wang
Aixin Sun
KELM
218
0
0
24 May 2025
Disentangling Knowledge Representations for Large Language Model Editing
Disentangling Knowledge Representations for Large Language Model Editing
Mengqi Zhang
Zisheng Zhou
Xiaotian Ye
Qiang Liu
Zhaochun Ren
Zhumin Chen
Sudipta Singha Roy
KELM
172
4
0
24 May 2025
TRACE for Tracking the Emergence of Semantic Representations in Transformers
TRACE for Tracking the Emergence of Semantic Representations in Transformers
Nura Aljaafari
Danilo S. Carvalho
André Freitas
240
1
0
23 May 2025
Locate-then-Merge: Neuron-Level Parameter Fusion for Mitigating Catastrophic Forgetting in Multimodal LLMs
Locate-then-Merge: Neuron-Level Parameter Fusion for Mitigating Catastrophic Forgetting in Multimodal LLMs
Zeping Yu
Sophia Ananiadou
MoMeKELMCLL
254
1
0
22 May 2025
The Atlas of In-Context Learning: How Attention Heads Shape In-Context Retrieval Augmentation
The Atlas of In-Context Learning: How Attention Heads Shape In-Context Retrieval Augmentation
Patrick Kahardipraja
Reduan Achtibat
Thomas Wiegand
Wojciech Samek
Sebastian Lapuschkin
345
4
0
21 May 2025
Pixels Versus Priors: Controlling Knowledge Priors in Vision-Language Models through Visual Counterfacts
Pixels Versus Priors: Controlling Knowledge Priors in Vision-Language Models through Visual Counterfacts
Michal Golovanevsky
William Rudman
Michael Lepori
Amir Bar
Ritambhara Singh
Carsten Eickhoff
337
5
0
21 May 2025
Truth Neurons
Truth Neurons
Haohang Li
Yun Feng
Yangyang Yu
Jordan W. Suchow
Zining Zhu
HILMMILMKELM
447
1
0
18 May 2025
EAMET: Robust Massive Model Editing via Embedding Alignment Optimization
EAMET: Robust Massive Model Editing via Embedding Alignment Optimization
Yanbo Dai
Zhenlan Ji
Zongjie Li
Shuai Wang
KELM
231
0
0
17 May 2025
On the Superimposed Noise Accumulation Problem in Sequential Knowledge Editing of Large Language Models
On the Superimposed Noise Accumulation Problem in Sequential Knowledge Editing of Large Language Models
Ding Cao
Yuchen Cai
Yuqing Huang
Xiaoxiao He
Rongxi Guo
Guiquan Liu
Guangzhong Sun
KELM
405
0
0
12 May 2025
Defending against Indirect Prompt Injection by Instruction Detection
Defending against Indirect Prompt Injection by Instruction Detection
Tongyu Wen
Chenglong Wang
Xiyuan Yang
Haoyu Tang
Yueqi Xie
Lingjuan Lyu
Zhicheng Dou
Fangzhao Wu
AAML
315
6
0
08 May 2025
Polysemy of Synthetic Neurons Towards a New Type of Explanatory Categorical Vector Spaces
Polysemy of Synthetic Neurons Towards a New Type of Explanatory Categorical Vector Spaces
Michael Pichat
William Pogrund
Paloma Pichat
Judicael Poumay
Armanouche Gasparian
Samuel Demarchi
Martin Corbet
Alois Georgeon
Michael Veillet-Guillem
MILM
287
0
0
30 Apr 2025
SetKE: Knowledge Editing for Knowledge Elements Overlap
SetKE: Knowledge Editing for Knowledge Elements OverlapInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Yifan Wei
Xiaoyan Yu
Ran Song
Hao Peng
Angsheng Li
KELM
271
1
0
29 Apr 2025
Exploring How LLMs Capture and Represent Domain-Specific Knowledge
Exploring How LLMs Capture and Represent Domain-Specific Knowledge
Mirian Hipolito Garcia
Camille Couturier
Daniel Madrigal Diaz
Ankur Mallick
Anastasios Kyrillidis
Robert Sim
Victor Rühle
Saravan Rajmohan
380
2
0
23 Apr 2025
Model Utility Law: Evaluating LLMs beyond Performance through Mechanism Interpretable Metric
Model Utility Law: Evaluating LLMs beyond Performance through Mechanism Interpretable Metric
Yixin Cao
Jiahao Ying
Longji Xu
Xipeng Qiu
Qi Zhang
Yugang Jiang
ELM
270
2
0
10 Apr 2025
Task-Circuit Quantization: Leveraging Knowledge Localization and Interpretability for Compression
Task-Circuit Quantization: Leveraging Knowledge Localization and Interpretability for Compression
Hanqi Xiao
Yi-Lin Sung
Elias Stengel-Eskin
Joey Tianyi Zhou
MQ
399
1
0
10 Apr 2025
Neuron-level Balance between Stability and Plasticity in Deep Reinforcement Learning
Neuron-level Balance between Stability and Plasticity in Deep Reinforcement Learning
Jiahua Lan
Sen Zhang
Haixia Pan
Ruijun Liu
Li Shen
Dacheng Tao
CLL
283
0
0
09 Apr 2025
Les Dissonances: Cross-Tool Harvesting and Polluting in Pool-of-Tools Empowered LLM Agents
Les Dissonances: Cross-Tool Harvesting and Polluting in Pool-of-Tools Empowered LLM Agents
Zichuan Li
Jian Cui
Xiaojing Liao
Luyi Xing
LLMAG
286
1
0
04 Apr 2025
Towards Understanding How Knowledge Evolves in Large Vision-Language Models
Towards Understanding How Knowledge Evolves in Large Vision-Language ModelsComputer Vision and Pattern Recognition (CVPR), 2025
Sudong Wang
Yujiao Shi
Yao Zhu
Jianing Li
Zizhe Wang
Yi Liu
Xiangyang Ji
796
2
0
31 Mar 2025
Intra-neuronal attention within language models Relationships between activation and semantics
Intra-neuronal attention within language models Relationships between activation and semantics
Michael Pichat
William Pogrund
Paloma Pichat
Armanouche Gasparian
Samuel Demarchi
Corbet Alois Georgeon
Michael Veillet-Guillem
MILM
256
0
0
17 Mar 2025
Cognitive Activation and Chaotic Dynamics in Large Language Models: A Quasi-Lyapunov Analysis of Reasoning Mechanisms
Cognitive Activation and Chaotic Dynamics in Large Language Models: A Quasi-Lyapunov Analysis of Reasoning Mechanisms
Xiaojian Li
Yongkang Leng
Ruiqing Ding
Hangjie Mo
Shanlin Yang
LRM
192
2
0
15 Mar 2025
Discovering Influential Neuron Path in Vision Transformers
Discovering Influential Neuron Path in Vision TransformersInternational Conference on Learning Representations (ICLR), 2025
Yifan Wang
Yifei Liu
Yingdong Shi
Chong Li
Anqi Pang
Sibei Yang
Jingyi Yu
Kan Ren
ViT
605
3
0
12 Mar 2025
From Style to Facts: Mapping the Boundaries of Knowledge Injection with Finetuning
Eric Zhao
Pranjal Awasthi
Nika Haghtalab
172
4
0
07 Mar 2025
Unlocking Efficient, Scalable, and Continual Knowledge Editing with Basis-Level Representation Fine-TuningInternational Conference on Learning Representations (ICLR), 2025
Tianci Liu
R. Li
Yunzhe Qi
Hui Liu
Xianfeng Tang
...
Qingyu Yin
Monica Cheng
Jun Huan
Haoyu Wang
Jing Gao
KELM
265
11
0
01 Mar 2025
Triple Phase Transitions: Understanding the Learning Dynamics of Large Language Models from a Neuroscience Perspective
Triple Phase Transitions: Understanding the Learning Dynamics of Large Language Models from a Neuroscience Perspective
Yuko Nakagi
Keigo Tada
Sota Yoshino
Shinji Nishimoto
Yu Takagi
LRM
363
3
0
28 Feb 2025
Capability Localization: Capabilities Can be Localized rather than Individual Knowledge
Capability Localization: Capabilities Can be Localized rather than Individual KnowledgeInternational Conference on Learning Representations (ICLR), 2025
Xiusheng Huang
Jiaxiang Liu
Yequan Wang
Jun Zhao
Kang Liu
277
1
0
28 Feb 2025
Promote, Suppress, Iterate: How Language Models Answer One-to-Many Factual Queries
Promote, Suppress, Iterate: How Language Models Answer One-to-Many Factual Queries
Tianyi Lorena Yan
Robin Jia
KELMMU
316
0
0
27 Feb 2025
Synthetic Categorical Restructuring large Or How AIs Gradually Extract Efficient Regularities from Their Experience of the World
Michael Pichat
William Pogrund
Paloma Pichat
Armanouche Gasparian
Samuel Demarchi
Martin Corbet
Alois Georgeon
Theo Dasilva
Michael Veillet-Guillem
244
2
0
25 Feb 2025
Model Lakes
Model LakesInternational Conference on Extending Database Technology (EDBT), 2024
Koyena Pal
David Bau
Renée J. Miller
343
2
0
24 Feb 2025
CODESYNC: Synchronizing Large Language Models with Dynamic Code Evolution at Scale
CODESYNC: Synchronizing Large Language Models with Dynamic Code Evolution at Scale
Chenlong Wang
Zhaoyang Chu
Zhengxiang Cheng
Xuyi Yang
Kaiyue Qiu
Yao Wan
Zhou Zhao
Xuanhua Shi
Benlin Liu
ALMSyDa
318
3
0
23 Feb 2025
Previous
123456789
Next