ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.08696
  4. Cited By
Knowledge Neurons in Pretrained Transformers
v1v2 (latest)

Knowledge Neurons in Pretrained Transformers

Annual Meeting of the Association for Computational Linguistics (ACL), 2021
18 April 2021
Damai Dai
Li Dong
Y. Hao
Zhifang Sui
Baobao Chang
Furu Wei
    KELMMU
ArXiv (abs)PDFHTMLGithub (168★)

Papers citing "Knowledge Neurons in Pretrained Transformers"

50 / 410 papers shown
Revealing and Mitigating Over-Attention in Knowledge Editing
Revealing and Mitigating Over-Attention in Knowledge EditingInternational Conference on Learning Representations (ICLR), 2025
Pinzheng Wang
Zecheng Tang
Keyan Zhou
Junlin Li
Qiaoming Zhu
Hao Fei
KELM
576
4
0
21 Feb 2025
MLaKE: Multilingual Knowledge Editing Benchmark for Large Language Models
MLaKE: Multilingual Knowledge Editing Benchmark for Large Language ModelsInternational Conference on Computational Linguistics (COLING), 2024
Zihao Wei
Jingcheng Deng
Liang Pang
Hanxing Ding
Huawei Shen
Xueqi Cheng
KELM
268
10
0
20 Feb 2025
UPCORE: Utility-Preserving Coreset Selection for Balanced Unlearning
UPCORE: Utility-Preserving Coreset Selection for Balanced Unlearning
Vaidehi Patil
Elias Stengel-Eskin
Joey Tianyi Zhou
MUCLL
393
6
0
20 Feb 2025
PASER: Post-Training Data Selection for Efficient Pruned Large Language Model Recovery
PASER: Post-Training Data Selection for Efficient Pruned Large Language Model Recovery
Bowei He
Lihao Yin
Hui-Ling Zhen
Xiaokun Zhang
Mingxuan Yuan
Chen Ma
412
2
0
18 Feb 2025
Brain-Inspired Exploration of Functional Networks and Key Neurons in Large Language Models
Brain-Inspired Exploration of Functional Networks and Key Neurons in Large Language Models
Yiheng Liu
Xiaohui Gao
Haiyang Sun
Bao Ge
Tianming Liu
...
Ning Qiang
Bao Ge
Tianming Liu
Junwei Han
Xintao Hu
164
2
0
13 Feb 2025
Reinforced Lifelong Editing for Language Models
Reinforced Lifelong Editing for Language Models
Zherui Li
Houcheng Jiang
Hao Chen
Baolong Bi
Zhenhong Zhou
Fei Sun
Cunchun Li
Xinze Wang
KELM
613
20
0
09 Feb 2025
The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering
The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering
Zhuowei Li
Haizhou Shi
Yunhe Gao
Di Liu
Zhenting Wang
Yuxiao Chen
Ting Liu
Long Zhao
Hao Wang
Dimitris N. Metaxas
MLLM
259
18
0
05 Feb 2025
Discovering Chunks in Neural Embeddings for Interpretability
Discovering Chunks in Neural Embeddings for Interpretability
Shuchen Wu
Stephan Alaniz
Eric Schulz
Zeynep Akata
291
0
0
03 Feb 2025
Understanding and Mitigating Gender Bias in LLMs via Interpretable Neuron Editing
Understanding and Mitigating Gender Bias in LLMs via Interpretable Neuron Editing
Zeping Yu
Sophia Ananiadou
KELM
290
10
0
24 Jan 2025
Episodic Memories Generation and Evaluation Benchmark for Large Language Models
Episodic Memories Generation and Evaluation Benchmark for Large Language ModelsInternational Conference on Learning Representations (ICLR), 2025
Alexis Huet
Zied Ben-Houidi
Dario Rossi
LLMAG
221
7
0
21 Jan 2025
Transforming Vision Transformer: Towards Efficient Multi-Task Asynchronous Learning
Transforming Vision Transformer: Towards Efficient Multi-Task Asynchronous Learning
Hanwen Zhong
Jiaxin Chen
Yutong Zhang
Di Huang
Yunhong Wang
MoE
309
0
0
12 Jan 2025
Multi-Task Model Merging via Adaptive Weight Disentanglement
Multi-Task Model Merging via Adaptive Weight Disentanglement
Feng Xiong
Runxi Cheng
Wang Chen
Zhanqiu Zhang
Yiwen Guo
Chun Yuan
Ruifeng Xu
MoMe
582
11
0
10 Jan 2025
Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic
Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic
Yifei He
Yuzheng Hu
Yong Lin
Tong Zhang
Han Zhao
FedMLMoMe
335
30
0
08 Jan 2025
Foundations of GenIR
Jiaxin Mao
Jingtao Zhan
Wenshu Fan
266
0
0
06 Jan 2025
How Do Artificial Intelligences Think? The Three Mathematico-Cognitive Factors of Categorical Segmentation Operated by Synthetic Neurons
How Do Artificial Intelligences Think? The Three Mathematico-Cognitive Factors of Categorical Segmentation Operated by Synthetic Neurons
Michael Pichat
William Pogrund
Armanush Gasparian
Paloma Pichat
Samuel Demarchi
Michael Veillet-Guillem
276
3
0
26 Dec 2024
Joint Knowledge Editing for Information Enrichment and Probability
  Promotion
Joint Knowledge Editing for Information Enrichment and Probability PromotionAAAI Conference on Artificial Intelligence (AAAI), 2024
Wenhang Shi
Yiren Chen
Shuqing Bian
Xinyi Zhang
Zhe Zhao
Pengfei Hu
Wei Lu
Xiaoyong Du
KELM
210
1
0
22 Dec 2024
Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy and Research
Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy and Research
A. Feder Cooper
Christopher A. Choquette-Choo
Miranda Bogen
Matthew Jagielski
Katja Filippova
...
Hanna M. Wallach
Amy Cyphert
Katherine Lee
Nicolas Papernot
Katherine Lee
MUAILaw
352
29
0
09 Dec 2024
Explainable and Interpretable Multimodal Large Language Models: A
  Comprehensive Survey
Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey
Yunkai Dang
Kaichen Huang
Jiahao Huo
Yibo Yan
Shijie Huang
...
Kun Wang
Yong Liu
Jing Shao
Hui Xiong
Xuming Hu
LRM
425
48
0
03 Dec 2024
Continuous Concepts Removal in Text-to-image Diffusion Models
Continuous Concepts Removal in Text-to-image Diffusion Models
Tingxu Han
Weisong Sun
Yanrong Hu
Chunrong Fang
Yonglong Zhang
Shiqing Ma
Tao Zheng
Zhenyu Chen
Zhenting Wang
DiffM
534
3
0
30 Nov 2024
One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge
  Neurons in Large Language Models
One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models
Pengfei Cao
Yuheng Chen
Zhuoran Jin
Yubo Chen
Kang Liu
Jun Zhao
KELM
310
1
0
26 Nov 2024
Lifelong Knowledge Editing for Vision Language Models with Low-Rank Mixture-of-Experts
Lifelong Knowledge Editing for Vision Language Models with Low-Rank Mixture-of-ExpertsComputer Vision and Pattern Recognition (CVPR), 2024
Qizhou Chen
Chengyu Wang
Dakan Wang
Taolin Zhang
Wangyue Li
Xiaofeng He
KELM
378
5
0
23 Nov 2024
Visual-Oriented Fine-Grained Knowledge Editing for MultiModal Large
  Language Models
Visual-Oriented Fine-Grained Knowledge Editing for MultiModal Large Language Models
Zhen Zeng
Leijiang Gu
Xun Yang
Zhangling Duan
Zenglin Shi
Meng Wang
KELM
250
4
0
19 Nov 2024
Understanding Multimodal LLMs: the Mechanistic Interpretability of Llava in Visual Question Answering
Zeping Yu
Sophia Ananiadou
1.1K
9
0
17 Nov 2024
Information Anxiety in Large Language Models
Prasoon Bajpai
Sarah Masud
Tanmoy Chakraborty
154
0
0
16 Nov 2024
AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient
  and Instant Deployment
AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant DeploymentNeural Information Processing Systems (NeurIPS), 2024
Y. Fu
Zhongzhi Yu
Junwei Li
Jiayi Qian
Yongan Zhang
Xiangchi Yuan
Dachuan Shi
Roman Yakunin
Y. Lin
285
7
0
15 Nov 2024
Controllable Context Sensitivity and the Knob Behind It
Controllable Context Sensitivity and the Knob Behind ItInternational Conference on Learning Representations (ICLR), 2024
Julian Minder
Kevin Du
Niklas Stoehr
Giovanni Monea
Chris Wendler
Robert West
Robert Bamler
KELM
625
17
0
11 Nov 2024
Learning Where to Edit Vision Transformers
Learning Where to Edit Vision TransformersNeural Information Processing Systems (NeurIPS), 2024
Yunqiao Yang
Long-Kai Huang
Shengzhuang Chen
Kede Ma
Ying Wei
KELM
231
2
0
04 Nov 2024
Sparsing Law: Towards Large Language Models with Greater Activation Sparsity
Sparsing Law: Towards Large Language Models with Greater Activation Sparsity
Yuqi Luo
Chenyang Song
Xu Han
Yuxiao Chen
Chaojun Xiao
Zhiyuan Liu
Maosong Sun
Jiansheng Wei
Zhiyuan Liu
Maosong Sun
585
14
0
04 Nov 2024
Commonsense Knowledge Editing Based on Free-Text in LLMs
Commonsense Knowledge Editing Based on Free-Text in LLMsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Xiusheng Huang
Yequan Wang
Jun Zhao
Kang Liu
KELM
182
11
0
31 Oct 2024
Reasons and Solutions for the Decline in Model Performance after Editing
Reasons and Solutions for the Decline in Model Performance after EditingNeural Information Processing Systems (NeurIPS), 2024
Xiusheng Huang
Jiaxiang Liu
Yequan Wang
Kang Liu
KELM
263
8
0
31 Oct 2024
Llama Scope: Extracting Millions of Features from Llama-3.1-8B with
  Sparse Autoencoders
Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders
Zhengfu He
Wentao Shu
Xuyang Ge
Lingjie Chen
Junxuan Wang
...
Qipeng Guo
Xuanjing Huang
Zuxuan Wu
Yu-Gang Jiang
Xipeng Qiu
328
74
0
27 Oct 2024
DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate
  Hallucinations
DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations
Aryo Pradipta Gema
Chen Jin
Ahmed Abdulaal
Tom Diethe
Philip Teare
Beatrice Alex
Pasquale Minervini
Amrutha Saseendran
284
12
0
24 Oct 2024
The Tug of War Within: Mitigating the Fairness-Privacy Conflicts in Large Language Models
The Tug of War Within: Mitigating the Fairness-Privacy Conflicts in Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Chen Qian
Dongrui Liu
Jie Zhang
Yong Liu
Jing Shao
344
1
0
22 Oct 2024
Catastrophic Failure of LLM Unlearning via Quantization
Catastrophic Failure of LLM Unlearning via QuantizationInternational Conference on Learning Representations (ICLR), 2024
Zhiwei Zhang
Fali Wang
Xiaomin Li
Zongyu Wu
Xianfeng Tang
Hui Liu
Qi He
Wenpeng Yin
Suhang Wang
MU
330
5
0
21 Oct 2024
Neuron-based Personality Trait Induction in Large Language Models
Neuron-based Personality Trait Induction in Large Language Models
Jia Deng
Tianyi Tang
Yanbin Yin
Wenhao Yang
Wayne Xin Zhao
Ji-Rong Wen
240
3
0
16 Oct 2024
Cross-Modal Safety Mechanism Transfer in Large Vision-Language Models
Cross-Modal Safety Mechanism Transfer in Large Vision-Language ModelsInternational Conference on Learning Representations (ICLR), 2024
Shicheng Xu
Liang Pang
Yunchang Zhu
Huawei Shen
Xueqi Cheng
MLLM
297
14
0
16 Oct 2024
ReDeEP: Detecting Hallucination in Retrieval-Augmented Generation via Mechanistic Interpretability
ReDeEP: Detecting Hallucination in Retrieval-Augmented Generation via Mechanistic InterpretabilityInternational Conference on Learning Representations (ICLR), 2024
Zhongxiang Sun
Xiaoxue Zang
Kai Zheng
Yang Song
Jun Xu
Xiao Zhang
Weijie Yu
Yang Song
Han Li
310
58
0
15 Oct 2024
LargePiG: Your Large Language Model is Secretly a Pointer Generator
LargePiG: Your Large Language Model is Secretly a Pointer Generator
Zhongxiang Sun
Zihua Si
Xiaoxue Zang
Kai Zheng
Yang Song
Xiao Zhang
Jun Xu
HILMRALM
227
0
0
15 Oct 2024
MoIN: Mixture of Introvert Experts to Upcycle an LLM
MoIN: Mixture of Introvert Experts to Upcycle an LLM
Ajinkya Tejankar
K. Navaneet
Ujjawal Panchal
Kossar Pourahmadi
Hamed Pirsiavash
MoE
337
0
0
13 Oct 2024
ChroKnowledge: Unveiling Chronological Knowledge of Language Models in Multiple Domains
ChroKnowledge: Unveiling Chronological Knowledge of Language Models in Multiple DomainsInternational Conference on Learning Representations (ICLR), 2024
Yein Park
Chanwoong Yoon
Jungwoo Park
Donghyeon Lee
Minbyul Jeong
Jaewoo Kang
KELM
478
3
0
13 Oct 2024
Keys to Robust Edits: from Theoretical Insights to Practical Advances
Keys to Robust Edits: from Theoretical Insights to Practical AdvancesAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Jianhao Yan
Futing Wang
Yun Luo
Yafu Li
Yue Zhang
KELM
273
1
0
12 Oct 2024
Understanding the Interplay between Parametric and Contextual Knowledge
  for Large Language Models
Understanding the Interplay between Parametric and Contextual Knowledge for Large Language Models
Sitao Cheng
Liangming Pan
Xunjian Yin
Xinyi Wang
William Yang Wang
KELM
237
10
0
10 Oct 2024
Uncovering Overfitting in Large Language Model Editing
Uncovering Overfitting in Large Language Model EditingInternational Conference on Learning Representations (ICLR), 2024
Mengqi Zhang
Xiaotian Ye
Qiang Liu
Sudipta Singha Roy
Shu Wu
Zhumin Chen
KELM
285
24
0
10 Oct 2024
From Tokens to Words: On the Inner Lexicon of LLMs
From Tokens to Words: On the Inner Lexicon of LLMsInternational Conference on Learning Representations (ICLR), 2024
Guy Kaplan
Matanel Oren
Yuval Reif
Roy Schwartz
441
30
0
08 Oct 2024
Locate-then-edit for Multi-hop Factual Recall under Knowledge Editing
Locate-then-edit for Multi-hop Factual Recall under Knowledge Editing
Zhuoran Zhang
Yongqian Li
Zijian Kan
Keyuan Cheng
Lijie Hu
Di Wang
KELM
408
26
0
08 Oct 2024
MINER: Mining the Underlying Pattern of Modality-Specific Neurons in
  Multimodal Large Language Models
MINER: Mining the Underlying Pattern of Modality-Specific Neurons in Multimodal Large Language Models
Kaichen Huang
Jiahao Huo
Yibo Yan
Kun Wang
Yutao Yue
Xuming Hu
244
2
0
07 Oct 2024
Neuron-Level Sequential Editing for Large Language Models
Neuron-Level Sequential Editing for Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Houcheng Jiang
Cunchun Li
Tianyu Zhang
An Zhang
Ruipeng Wang
Tao Liang
Xiang Wang
KELM
233
11
0
05 Oct 2024
Mitigating Memorization In Language Models
Mitigating Memorization In Language Models
Mansi Sakarvadia
Aswathy Ajith
Arham Khan
Nathaniel Hudson
Caleb Geniesse
Kyle Chard
Yaoqing Yang
Ian Foster
Michael W. Mahoney
KELMMU
393
8
0
03 Oct 2024
Mitigating Copy Bias in In-Context Learning through Neuron Pruning
Mitigating Copy Bias in In-Context Learning through Neuron Pruning
Ameen Ali
Lior Wolf
Ivan Titov
199
6
0
02 Oct 2024
Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition
Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge AcquisitionInternational Conference on Learning Representations (ICLR), 2024
Jiyeon Kim
Hyunji Lee
Hyowon Cho
Joel Jang
Hyeonbin Hwang
Seungpil Won
Youbin Ahn
Dohaeng Lee
Minjoon Seo
KELM
1.0K
13
0
02 Oct 2024
Previous
123456789
Next