ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.08696
  4. Cited By
Knowledge Neurons in Pretrained Transformers
v1v2 (latest)

Knowledge Neurons in Pretrained Transformers

Annual Meeting of the Association for Computational Linguistics (ACL), 2021
18 April 2021
Damai Dai
Li Dong
Y. Hao
Zhifang Sui
Baobao Chang
Furu Wei
    KELMMU
ArXiv (abs)PDFHTMLGithub (168★)

Papers citing "Knowledge Neurons in Pretrained Transformers"

50 / 410 papers shown
Parameter Importance-Driven Continual Learning for Foundation Models
Parameter Importance-Driven Continual Learning for Foundation Models
LingXiang Wang
Hainan Zhang
Zhiming Zheng
KELMCLL
492
0
0
19 Nov 2025
Fine-Tuned LLMs Know They Don't Know: A Parameter-Efficient Approach to Recovering Honesty
Fine-Tuned LLMs Know They Don't Know: A Parameter-Efficient Approach to Recovering Honesty
Zeyu Shi
Ziming Wang
Tianyu Chen
Shiqi Gao
Haoyi Zhou
Qingyun Sun
Jianxin Li
102
0
0
17 Nov 2025
Continual Unlearning for Text-to-Image Diffusion Models: A Regularization Perspective
Continual Unlearning for Text-to-Image Diffusion Models: A Regularization Perspective
Justin Lee
Zheda Mai
Jinsu Yoo
Chongyu Fan
Cheng Zhang
Wei-Lun Chao
DiffMVLM
203
0
0
11 Nov 2025
On the Analogy between Human Brain and LLMs: Spotting Key Neurons in Grammar Perception
On the Analogy between Human Brain and LLMs: Spotting Key Neurons in Grammar Perception
Sanaz Saki Norouzi
Mohammad Masjedi
Pascal Hitzler
128
0
0
09 Nov 2025
ExplicitLM: Decoupling Knowledge from Parameters via Explicit Memory Banks
ExplicitLM: Decoupling Knowledge from Parameters via Explicit Memory Banks
Chengzhang Yu
Zening Lu
Chenyang Zheng
C. Wang
Yiming Zhang
Zhanpeng Jin
KELM
152
0
0
03 Nov 2025
Balancing Knowledge Updates: Toward Unified Modular Editing in LLMs
Balancing Knowledge Updates: Toward Unified Modular Editing in LLMs
Jiahao Liu
Zijian Wang
Kuo Zhao
Dong Hu
KELM
148
0
0
31 Oct 2025
Layer of Truth: Probing Belief Shifts under Continual Pre-Training Poisoning
Layer of Truth: Probing Belief Shifts under Continual Pre-Training Poisoning
S. Churina
Niranjan Chebrolu
Kokil Jaidka
KELMHILMCLL
369
0
0
29 Oct 2025
From Memorization to Reasoning in the Spectrum of Loss Curvature
From Memorization to Reasoning in the Spectrum of Loss Curvature
Jack Merullo
Srihita Vatsavaya
Lucius Bushnaq
Owen Lewis
219
1
0
28 Oct 2025
Edit Less, Achieve More: Dynamic Sparse Neuron Masking for Lifelong Knowledge Editing in LLMs
Edit Less, Achieve More: Dynamic Sparse Neuron Masking for Lifelong Knowledge Editing in LLMs
Jinzhe Liu
Junshu Sun
Shufan Shen
Chenxue Yang
Shuhui Wang
KELMCLL
368
1
0
25 Oct 2025
Probing Neural Combinatorial Optimization Models
Probing Neural Combinatorial Optimization Models
Zhiqin Zhang
Yining Ma
Zhiguang Cao
Hoong Chuin Lau
107
0
0
25 Oct 2025
Model-Aware Tokenizer Transfer
Model-Aware Tokenizer Transfer
Mykola Haltiuk
Aleksander Smywiński-Pohl
122
0
0
24 Oct 2025
A Graph Signal Processing Framework for Hallucination Detection in Large Language Models
A Graph Signal Processing Framework for Hallucination Detection in Large Language Models
Valentin Noël
135
1
0
21 Oct 2025
From Memorization to Generalization: Fine-Tuning Large Language Models for Biomedical Term-to-Identifier Normalization
From Memorization to Generalization: Fine-Tuning Large Language Models for Biomedical Term-to-Identifier Normalization
Suswitha Pericharla
D. B. Hier
Tayo Obafemi-Ajayi
148
1
0
21 Oct 2025
Neuronal Group Communication for Efficient Neural representation
Neuronal Group Communication for Efficient Neural representation
Zhengqi Pei
Qingming Huang
Shuhui Wang
114
0
0
19 Oct 2025
Facts in Stats: Impacts of Pretraining Diversity on Language Model Generalization
Facts in Stats: Impacts of Pretraining Diversity on Language Model Generalization
Tina Behnia
Puneesh Deora
Christos Thrampoulidis
118
0
0
17 Oct 2025
Hierarchical Frequency Tagging Probe (HFTP): A Unified Approach to Investigate Syntactic Structure Representations in Large Language Models and the Human Brain
Hierarchical Frequency Tagging Probe (HFTP): A Unified Approach to Investigate Syntactic Structure Representations in Large Language Models and the Human Brain
Jingmin An
Yilong Song
Ruolin Yang
Nai Ding
Lingxi Lu
Yuxuan Wang
Wei Wang
Chu Zhuang
Q. Wang
Fang Fang
152
1
0
15 Oct 2025
Sparse Subnetwork Enhancement for Underrepresented Languages in Large Language Models
Sparse Subnetwork Enhancement for Underrepresented Languages in Large Language Models
Daniil Gurgurov
Josef van Genabith
Simon Ostermann
MoE
202
0
0
15 Oct 2025
Medical Interpretability and Knowledge Maps of Large Language Models
Medical Interpretability and Knowledge Maps of Large Language Models
Razvan Marinescu
Victoria-Elisabeth Gruber
Diego Fajardo
FAttAI4MH
240
0
0
13 Oct 2025
Preserving LLM Capabilities through Calibration Data Curation: From Analysis to Optimization
Preserving LLM Capabilities through Calibration Data Curation: From Analysis to Optimization
Bowei He
Lihao Yin
Huiling Zhen
Shuqi Liu
Han Wu
Xiaokun Zhang
Mingxuan Yuan
Chen Ma
112
0
0
12 Oct 2025
The Achilles' Heel of LLMs: How Altering a Handful of Neurons Can Cripple Language Abilities
The Achilles' Heel of LLMs: How Altering a Handful of Neurons Can Cripple Language Abilities
Zixuan Qin
Kunlin Lyu
Qingchen Yu
Yifan Sun
Zhaoxin Fan
AAML
129
2
0
11 Oct 2025
ADEPT: Continual Pretraining via Adaptive Expansion and Dynamic Decoupled Tuning
ADEPT: Continual Pretraining via Adaptive Expansion and Dynamic Decoupled Tuning
Jinyang Zhang
Yue Fang
Hongxin Ding
Weibin Liao
Muyang Ye
Xu Chu
Junfeng Zhao
Yasha Wang
CLL
138
0
0
11 Oct 2025
ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall
ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall
Jiayu Yang
Yuxuan Fan
Songning Lai
Shengen Wu
J. Tang
Chun Kang
Zhijiang Guo
Yutao Yue
KELM
83
0
0
09 Oct 2025
Evaluation of a Robust Control System in Real-World Cable-Driven Parallel Robots
Evaluation of a Robust Control System in Real-World Cable-Driven Parallel Robots
Damir Nurtdinov
Aliaksei Korshuk
Alexei Kornaev
Alexander Maloletov
84
0
0
09 Oct 2025
POME: Post Optimization Model Edit via Muon-style Projection
POME: Post Optimization Model Edit via Muon-style Projection
Yong Liu
Di Fu
Yang Luo
Zirui Zhu
Minhao Cheng
Cho-Jui Hsieh
Yang You
103
0
0
08 Oct 2025
Machine Unlearning Meets Adversarial Robustness via Constrained Interventions on LLMs
Machine Unlearning Meets Adversarial Robustness via Constrained Interventions on LLMs
Fatmazohra Rezkellah
Ramzi Dakhmouche
AAMLMU
225
1
0
03 Oct 2025
LEAML: Label-Efficient Adaptation to Out-of-Distribution Visual Tasks for Multimodal Large Language Models
LEAML: Label-Efficient Adaptation to Out-of-Distribution Visual Tasks for Multimodal Large Language Models
Ci-Siang Lin
Min-Hung Chen
Yu-Yang Sheng
Y. Wang
VLM
152
0
0
03 Oct 2025
What Drives Compositional Generalization in Visual Generative Models?
What Drives Compositional Generalization in Visual Generative Models?
Karim Farid
Rajat Sahay
Yumna Ali Alnaggar
Simon Schrodi
Volker Fischer
Cordelia Schmid
Thomas Brox
CoGe
328
0
0
03 Oct 2025
Muon Outperforms Adam in Tail-End Associative Memory Learning
Muon Outperforms Adam in Tail-End Associative Memory Learning
Shuche Wang
Fengzhuo Zhang
Jiaxiang Li
Cunxiao Du
C. Du
Tianyu Pang
Zhuoran Yang
Mingyi Hong
Vincent Y. F. Tan
175
3
0
30 Sep 2025
Pretraining with hierarchical memories: separating long-tail and common knowledge
Pretraining with hierarchical memories: separating long-tail and common knowledge
Hadi Pouransari
David Grangier
C Thomas
Michael Kirchhof
Oncel Tuzel
RALMKELM
249
1
0
29 Sep 2025
Bridging the Knowledge-Prediction Gap in LLMs on Multiple-Choice Questions
Bridging the Knowledge-Prediction Gap in LLMs on Multiple-Choice Questions
Yoonah Park
Haesung Pyun
Yohan Jo
KELM
375
0
0
28 Sep 2025
Knowledge Homophily in Large Language Models
Knowledge Homophily in Large Language Models
Utkarsh Sahu
Zhisheng Qi
M. Halappanavar
Nedim Lipka
Ryan Rossi
Franck Dernoncourt
Yu Zhang
Yao Ma
Yu Wang
125
0
0
28 Sep 2025
Timber: Training-free Instruct Model Refining with Base via Effective Rank
Timber: Training-free Instruct Model Refining with Base via Effective Rank
Taiqiang Wu
Runming Yang
Tao Liu
Jiahao Wang
Zenan Xu
Ngai Wong
140
1
0
28 Sep 2025
Beyond Benchmarks: Understanding Mixture-of-Experts Models through Internal Mechanisms
Beyond Benchmarks: Understanding Mixture-of-Experts Models through Internal Mechanisms
Jiahao Ying
Mingbao Lin
Qianru Sun
Yixin Cao
MoE
73
0
0
28 Sep 2025
Hedonic Neurons: A Mechanistic Mapping of Latent Coalitions in Transformer MLPs
Hedonic Neurons: A Mechanistic Mapping of Latent Coalitions in Transformer MLPs
Tanya Chowdhury
Atharva Nijasure
Yair Zick
James Allan
133
0
0
28 Sep 2025
Probability Signature: Bridging Data Semantics and Embedding Structure in Language Models
Probability Signature: Bridging Data Semantics and Embedding Structure in Language Models
Junjie Yao
Zhi-hai Xu
139
0
0
24 Sep 2025
Unveiling the Response of Large Vision-Language Models to Visually Absent Tokens
Unveiling the Response of Large Vision-Language Models to Visually Absent Tokens
Sohee Kim
Soohyun Ryu
Joonhyung Park
Eunho Yang
153
0
0
03 Sep 2025
Unraveling LLM Jailbreaks Through Safety Knowledge Neurons
Unraveling LLM Jailbreaks Through Safety Knowledge Neurons
Chongwen Zhao
Kaizhu Huang
Kaizhu Huang
AAMLKELM
169
2
0
01 Sep 2025
DFAMS: Dynamic-flow guided Federated Alignment based Multi-prototype Search
DFAMS: Dynamic-flow guided Federated Alignment based Multi-prototype Search
Zhibang Yang
Xinke Jiang
Rihong Qiu
Ruiqing Li
Yihang Zhang
...
Yongxin Xu
Hongxin Ding
Xu Chu
Junfeng Zhao
Yasha Wang
185
1
0
28 Aug 2025
Provable Benefits of In-Tool Learning for Large Language Models
Provable Benefits of In-Tool Learning for Large Language Models
Sam Houliston
Ambroise Odonnat
Charles Arnal
Vivien A. Cabannes
RALM
155
1
0
28 Aug 2025
LFD: Layer Fused Decoding to Exploit External Knowledge in Retrieval-Augmented Generation
LFD: Layer Fused Decoding to Exploit External Knowledge in Retrieval-Augmented Generation
Yang Sun
Lixin Zou
Dan Luo
Zhiyong Xie
Liming Dong
Liming Dong
Yunwei Zhao
Y. Lu
Y. Lu
Chenliang Li
OffRL
138
0
0
27 Aug 2025
Layerwise Importance Analysis of Feed-Forward Networks in Transformer-based Language Models
Layerwise Importance Analysis of Feed-Forward Networks in Transformer-based Language Models
Wataru Ikeda
Kazuki Yano
Ryosuke Takahashi
Jaesung Lee
Keigo Shibata
Jun Suzuki
99
1
0
25 Aug 2025
From Confidence to Collapse in LLM Factual Robustness
From Confidence to Collapse in LLM Factual RobustnessConference on Empirical Methods in Natural Language Processing (EMNLP), 2025
Alina Fastowski
Bardh Prenkaj
Gjergji Kasneci
HILMAAML
230
1
0
22 Aug 2025
Side Effects of Erasing Concepts from Diffusion Models
Side Effects of Erasing Concepts from Diffusion Models
Shaswati Saha
Sourajit Saha
Manas Gaur
Tejas Gokhale
DiffM
240
1
0
20 Aug 2025
WSS-CL: Weight Saliency Soft-Guided Contrastive Learning for Efficient Machine Unlearning Image Classification
WSS-CL: Weight Saliency Soft-Guided Contrastive Learning for Efficient Machine Unlearning Image Classification
Thang Duc Tran
Thai Hoang Le
MU
129
0
0
06 Aug 2025
Understanding and Mitigating Political Stance Cross-topic Generalization in Large Language Models
Understanding and Mitigating Political Stance Cross-topic Generalization in Large Language Models
J. Zhang
Shu Yang
Junchao Wu
Yang Li
Haiyan Zhao
214
1
0
04 Aug 2025
Granular Concept Circuits: Toward a Fine-Grained Circuit Discovery for Concept Representations
Granular Concept Circuits: Toward a Fine-Grained Circuit Discovery for Concept Representations
Dahee Kwon
Sehyun Lee
Jaesik Choi
171
1
0
03 Aug 2025
Prompting Large Language Models with Partial Knowledge for Answering Questions with Unseen Entities
Prompting Large Language Models with Partial Knowledge for Answering Questions with Unseen Entities
Zhichao Yan
Jiapu Wang
Jiaoyan Chen
Yanyan Wang
Hongye Tan
Jiye Liang
Xiaoli Li
Ru Li
Jeff Z.Pan
RALM
136
2
0
02 Aug 2025
Latent Knowledge Scalpel: Precise and Massive Knowledge Editing for Large Language Models
Latent Knowledge Scalpel: Precise and Massive Knowledge Editing for Large Language Models
Xin Liu
Qiyang Song
Shaowen Xu
Kerou Zhou
Wenbo Jiang
Xiaoqi Jia
Weijuan Zhang
Heqing Huang
Yakai Li
KELM
190
0
0
01 Aug 2025
Trustworthy Reasoning: Evaluating and Enhancing Factual Accuracy in LLM Intermediate Thought Processes
Trustworthy Reasoning: Evaluating and Enhancing Factual Accuracy in LLM Intermediate Thought Processes
Rui Jiao
Yue Zhang
Jinku Li
LRM
205
0
0
25 Jul 2025
Deep Hidden Cognition Facilitates Reliable Chain-of-Thought Reasoning
Deep Hidden Cognition Facilitates Reliable Chain-of-Thought Reasoning
Zijun Chen
Wenbo Hu
Richang Hong
LRM
171
0
0
14 Jul 2025
123456789
Next
Page 1 of 9