ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.05262
  4. Cited By
Locating and Editing Factual Associations in GPT
v1v2v3v4v5 (latest)

Locating and Editing Factual Associations in GPT

Neural Information Processing Systems (NeurIPS), 2022
10 February 2022
Kevin Meng
David Bau
A. Andonian
Yonatan Belinkov
    KELM
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "Locating and Editing Factual Associations in GPT"

50 / 1,361 papers shown
Sari Sandbox: A Virtual Retail Store Environment for Embodied AI Agents
Sari Sandbox: A Virtual Retail Store Environment for Embodied AI Agents
Janika Deborah Gajo
Gerarld Paul Merales
Jerome Escarcha
Brenden Ashley Molina
Gian Nartea
Emmanuel G. Maminta
Juan Carlos Roldan
Rowel O. Atienza
LM&Ro
143
1
0
01 Aug 2025
Unveiling the Influence of Amplifying Language-Specific Neurons
Inaya Rahmanisa
Lyzander Marciano Andrylie
Mahardika Krisna Ihsani
Alfan Farizki Wicaksono
Haryo Akbarianto Wibowo
Alham Fikri Aji
148
0
0
30 Jul 2025
RainbowPrompt: Diversity-Enhanced Prompt-Evolving for Continual Learning
RainbowPrompt: Diversity-Enhanced Prompt-Evolving for Continual Learning
Kiseong Hong
Gyeong-hyeon Kim
Eunwoo Kim
CLLVLM
174
0
0
30 Jul 2025
When Truthful Representations Flip Under Deceptive Instructions?
When Truthful Representations Flip Under Deceptive Instructions?
Xianxuan Long
Y. Fu
Runchao Li
Mu Sheng
Haotian Yu
Xiaotian Han
Pan Li
LLMSV
369
4
0
29 Jul 2025
Dissecting Persona-Driven Reasoning in Language Models via Activation Patching
Dissecting Persona-Driven Reasoning in Language Models via Activation Patching
Ansh Poonia
Maeghal Jain
215
0
0
28 Jul 2025
Modular Delta Merging with Orthogonal Constraints: A Scalable Framework for Continual and Reversible Model Composition
Modular Delta Merging with Orthogonal Constraints: A Scalable Framework for Continual and Reversible Model Composition
Haris Khan
Shumaila Asif
Sadia Asif
MoMeCLL
204
0
0
28 Jul 2025
A Survey on Generative Model Unlearning: Fundamentals, Taxonomy, Evaluation, and Future Direction
A Survey on Generative Model Unlearning: Fundamentals, Taxonomy, Evaluation, and Future Direction
Xiaohua Feng
Jiaming Zhang
Fengyuan Yu
C. Wang
Li Zhang
Kaixiang Li
Yuyuan Li
Chaochao Chen
Jianwei Yin
MU
262
2
0
26 Jul 2025
Trustworthy Reasoning: Evaluating and Enhancing Factual Accuracy in LLM Intermediate Thought Processes
Trustworthy Reasoning: Evaluating and Enhancing Factual Accuracy in LLM Intermediate Thought Processes
Rui Jiao
Yue Zhang
Jinku Li
LRM
204
0
0
25 Jul 2025
Modality Agnostic Efficient Long Range Encoder
Modality Agnostic Efficient Long Range Encoder
T. Parag
Ahmed Elgammal
158
0
0
25 Jul 2025
CircuitProbe: Dissecting Spatiotemporal Visual Semantics with Circuit Tracing
CircuitProbe: Dissecting Spatiotemporal Visual Semantics with Circuit Tracing
Yiming Zhang
Chengzhang Yu
Zhuokai Zhao
Kun Wang
Qiankun Li
Z. Chen
Yang Liu
Zenghui Ding
Yining Sun
LRM
218
1
0
25 Jul 2025
Decoupling Knowledge and Reasoning in LLMs: An Exploration Using Cognitive Dual-System Theory
Decoupling Knowledge and Reasoning in LLMs: An Exploration Using Cognitive Dual-System Theory
Mutian Yang
Jiandong Gao
Ji Wu
186
1
0
24 Jul 2025
NeuralDB: Scaling Knowledge Editing in LLMs to 100,000 Facts with Neural KV Database
NeuralDB: Scaling Knowledge Editing in LLMs to 100,000 Facts with Neural KV Database
Weizhi Fei
Hao Shi
Jing Xu
Jingchen Peng
Jiazheng Li
Jingzhao Zhang
Bo Bai
Wei Han
Z. Chen
Xueyan Niu
KELM
170
0
0
24 Jul 2025
How does Chain of Thought Think? Mechanistic Interpretability of Chain-of-Thought Reasoning with Sparse Autoencoding
How does Chain of Thought Think? Mechanistic Interpretability of Chain-of-Thought Reasoning with Sparse Autoencoding
Xi Chen
Aske Plaat
Niki van Stein
ReLMLRMAI4CE
128
5
0
24 Jul 2025
Steering Out-of-Distribution Generalization with Concept Ablation Fine-Tuning
Steering Out-of-Distribution Generalization with Concept Ablation Fine-Tuning
Helena Casademunt
Caden Juang
Adam Karvonen
Samuel Marks
Senthooran Rajamanoharan
Neel Nanda
OODDLLMSV
385
10
0
22 Jul 2025
Beyond Isolated Capabilities: Bridging Long CoT Reasoning and Long-Context Understanding
Beyond Isolated Capabilities: Bridging Long CoT Reasoning and Long-Context Understanding
Yifei Wang
LRM
132
0
0
20 Jul 2025
Linear Relational Decoding of Morphology in Language Models
Linear Relational Decoding of Morphology in Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025
Eric Xia
Jugal Kalita
192
1
0
19 Jul 2025
Retention analysis of edited knowledge after fine-tuning
Retention analysis of edited knowledge after fine-tuning
Fufang Wen
Shichang Zhang
KELM
194
0
0
14 Jul 2025
Function Induction and Task Generalization: An Interpretability Study with Off-by-One Addition
Function Induction and Task Generalization: An Interpretability Study with Off-by-One Addition
Qinyuan Ye
Robin Jia
Xiang Ren
LRMELM
180
1
0
14 Jul 2025
Deep Hidden Cognition Facilitates Reliable Chain-of-Thought Reasoning
Deep Hidden Cognition Facilitates Reliable Chain-of-Thought Reasoning
Zijun Chen
Wenbo Hu
Richang Hong
LRM
158
0
0
14 Jul 2025
An Exploration of Knowledge Editing for Arabic
An Exploration of Knowledge Editing for Arabic
Basel Mousi
Nadir Durrani
Fahim Dalvi
KELM
186
1
0
13 Jul 2025
DATE-LM: Benchmarking Data Attribution Evaluation for Large Language Models
DATE-LM: Benchmarking Data Attribution Evaluation for Large Language Models
Cathy Jiao
Yijun Pan
Emily Xiao
Daisy Sheng
Niket Jain
H. C. Zhao
Ishita Dasgupta
Jiaqi W. Ma
Chenyan Xiong
216
0
0
12 Jul 2025
Knowledge Fusion via Bidirectional Information Aggregation
Knowledge Fusion via Bidirectional Information Aggregation
Songlin Zhai
Guilin Qi
Yue Wang
Yuan Meng
134
0
0
11 Jul 2025
Steering Information Utility in Key-Value Memory for Language Model Post-Training
Steering Information Utility in Key-Value Memory for Language Model Post-Training
Chunyuan Deng
Ruidi Chang
Hanjie Chen
LLMSV
369
0
0
07 Jul 2025
Dynamic Injection of Entity Knowledge into Dense Retrievers
Dynamic Injection of Entity Knowledge into Dense Retrievers
Ikuya Yamada
Ryokan Ri
Takeshi Kojima
Yusuke Iwasawa
Yutaka Matsuo
171
0
0
05 Jul 2025
MemOS: A Memory OS for AI System
MemOS: A Memory OS for AI System
Ruoyao Xiao
Chenyang Xi
Chunyu Li
Ding Chen
Chen Tang
...
Wentao Zhang
Wentao Zhang
S. Chen
Siheng Chen
Feiyu Xiong
KELMRALM
513
30
0
04 Jul 2025
Controlling Thinking Speed in Reasoning Models
Controlling Thinking Speed in Reasoning Models
Zhengkai Lin
Zhihang Fu
Ze Chen
Chao Chen
Liang Xie
Wenxiao Wang
Deng Cai
Zheng Wang
Jieping Ye
LRM
142
7
0
04 Jul 2025
Latent Chain-of-Thought? Decoding the Depth-Recurrent Transformer
Latent Chain-of-Thought? Decoding the Depth-Recurrent Transformer
Wenquan Lu
Yuechuan Yang
Kyle Lee
Yanshu Li
Enqi Liu
LRMAI4CE
136
0
0
02 Jul 2025
Layer Importance for Mathematical Reasoning is Forged in Pre-Training and Invariant after Post-Training
Layer Importance for Mathematical Reasoning is Forged in Pre-Training and Invariant after Post-Training
Aadim Nepal
Safal Shrestha
Anubhav Shrestha
Minwu Kim
Jalal Naghiyev
Ravid Shwartz-Ziv
Keith Ross
LRM
189
0
0
27 Jun 2025
Little By Little: Continual Learning via Self-Activated Sparse Mixture-of-Rank Adaptive Learning
Little By Little: Continual Learning via Self-Activated Sparse Mixture-of-Rank Adaptive Learning
Haodong Lu
Chongyang Zhao
Jason Xue
Lina Yao
Kristen Moore
Dong Gong
CLLMoMeMoE
230
2
0
26 Jun 2025
Multiple Streams of Knowledge Retrieval: Enriching and Recalling in Transformers
Multiple Streams of Knowledge Retrieval: Enriching and Recalling in Transformers
Todd Nief
David Reber
Sean Richardson
Ari Holtzman
KELM
192
0
0
25 Jun 2025
Model Editing as a Double-Edged Sword: Steering Agent Ethical Behavior Toward Beneficence or Harm
Model Editing as a Double-Edged Sword: Steering Agent Ethical Behavior Toward Beneficence or Harm
Baixiang Huang
Zhen Tan
Haoran Wang
Zijie Liu
Dawei Li
Ali Payani
Huan Liu
Tianlong Chen
Kai Shu
KELMLLMSV
281
0
0
25 Jun 2025
Bridging Compositional and Distributional Semantics: A Survey on Latent Semantic Geometry via AutoEncoder
Bridging Compositional and Distributional Semantics: A Survey on Latent Semantic Geometry via AutoEncoder
Yingji Zhang
Danilo S. Carvalho
André Freitas
CoGe
400
0
0
25 Jun 2025
Understanding Reasoning in Thinking Language Models via Steering Vectors
Understanding Reasoning in Thinking Language Models via Steering Vectors
Constantin Venhoff
Iván Arcuschin
Philip Torr
Arthur Conmy
Neel Nanda
LLMSVLRM
195
43
0
22 Jun 2025
Sparse Feature Coactivation Reveals Causal Semantic Modules in Large Language Models
Sparse Feature Coactivation Reveals Causal Semantic Modules in Large Language Models
Ruixuan Deng
Xiaoyang Hu
Miles Gilberti
Shane Storks
Aman Taxali
Mike Angstadt
Chandra S. Sripada
Joyce Chai
189
0
0
22 Jun 2025
From Concepts to Components: Concept-Agnostic Attention Module Discovery in Transformers
From Concepts to Components: Concept-Agnostic Attention Module Discovery in Transformers
Jingtong Su
Julia Kempe
Karen Ullrich
274
3
0
20 Jun 2025
Large Language Models as Psychological Simulators: A Methodological Guide
Large Language Models as Psychological Simulators: A Methodological Guide
Zhicheng Lin
LLMAG
246
2
0
20 Jun 2025
Latent Concept Disentanglement in Transformer-based Language Models
Latent Concept Disentanglement in Transformer-based Language Models
Guan Zhe Hong
Bhavya Vasudeva
Willie Neiswanger
Cyrus Rashtchian
Prabhakar Raghavan
Rina Panigrahy
ReLMLRM
340
2
0
20 Jun 2025
Reviving Your MNEME: Predicting The Side Effects of LLM Unlearning and Fine-Tuning via Sparse Model Diffing
Reviving Your MNEME: Predicting The Side Effects of LLM Unlearning and Fine-Tuning via Sparse Model Diffing
Aly M. Kassem
Zhuan Shi
Negar Rostamzadeh
G. Farnadi
MUKELM
155
2
0
19 Jun 2025
Under the Shadow of Babel: How Language Shapes Reasoning in LLMs
Under the Shadow of Babel: How Language Shapes Reasoning in LLMs
Chenxi Wang
Y. Zhang
Lang Gao
Zixiang Xu
Zirui Song
Zixiang Xu
Xiuying Chen
158
1
0
19 Jun 2025
Can structural correspondences ground real world representational content in Large Language Models?
Can structural correspondences ground real world representational content in Large Language Models?
Iwan Williams
154
3
0
19 Jun 2025
Mr. Snuffleupagus at SemEval-2025 Task 4: Unlearning Factual Knowledge from LLMs Using Adaptive RMU
Mr. Snuffleupagus at SemEval-2025 Task 4: Unlearning Factual Knowledge from LLMs Using Adaptive RMU
Arjun Dosajh
Mihika Sanghi
MU
263
0
0
19 Jun 2025
Visual symbolic mechanisms: Emergent symbol processing in vision language models
Visual symbolic mechanisms: Emergent symbol processing in vision language models
Rim Assouel
Declan Campbell
Taylor Webb
Taylor Webb
202
2
0
18 Jun 2025
The Compositional Architecture of Regret in Large Language Models
The Compositional Architecture of Regret in Large Language Models
Xiangxiang Cui
Shu Yang
Tianjin Huang
Wanyu Lin
Lijie Hu
Haiyan Zhao
226
0
0
18 Jun 2025
Representation Consistency for Accurate and Coherent LLM Answer Aggregation
Representation Consistency for Accurate and Coherent LLM Answer Aggregation
Junqi Jiang
Tom Bewley
Salim I. Amoukou
Francesco Leofante
Antonio Rago
Saumitra Mishra
Francesca Toni
196
2
0
18 Jun 2025
Learning-Time Encoding Shapes Unlearning in LLMs
Learning-Time Encoding Shapes Unlearning in LLMs
Ruihan Wu
Konstantin Garov
Kamalika Chaudhuri
MU
221
0
0
18 Jun 2025
Probabilistic Aggregation and Targeted Embedding Optimization for Collective Moral Reasoning in Large Language Models
Probabilistic Aggregation and Targeted Embedding Optimization for Collective Moral Reasoning in Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Chenchen Yuan
Zheyu Zhang
Shuo Yang
Bardh Prenkaj
Gjergji Kasneci
265
1
0
17 Jun 2025
Mitigating Safety Fallback in Editing-based Backdoor Injection on LLMs
Mitigating Safety Fallback in Editing-based Backdoor Injection on LLMs
Houcheng Jiang
Zetong Zhao
Junfeng Fang
Haokai Ma
Ruipeng Wang
Yang Deng
Xiang Wang
Xiangnan He
KELMAAML
260
0
0
16 Jun 2025
DualEdit: Dual Editing for Knowledge Updating in Vision-Language Models
DualEdit: Dual Editing for Knowledge Updating in Vision-Language Models
Zhiyi Shi
Binjie Wang
Chongjie Si
Yichen Wu
Junsik Kim
Hanspeter Pfister
KELMVLM
321
1
0
16 Jun 2025
Position: Pause Recycling LoRAs and Prioritize Mechanisms to Uncover Limits and Effectiveness
Position: Pause Recycling LoRAs and Prioritize Mechanisms to Uncover Limits and Effectiveness
Mei-Yen Chen
Thi Thu Uyen Hoang
Michael Hahn
M. Sarfraz
MoMe
246
0
0
16 Jun 2025
TrojanTO: Action-Level Backdoor Attacks against Trajectory Optimization Models
TrojanTO: Action-Level Backdoor Attacks against Trajectory Optimization Models
Yang Dai
Oubo Ma
Longfei Zhang
Xingxing Liang
Xiaochun Cao
Shouling Ji
J. Zhang
Jincai Huang
Li Shen
234
0
0
15 Jun 2025
Previous
123...567...262728
Next