ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.05262
  4. Cited By
Locating and Editing Factual Associations in GPT
v1v2v3v4v5 (latest)

Locating and Editing Factual Associations in GPT

Neural Information Processing Systems (NeurIPS), 2022
10 February 2022
Kevin Meng
David Bau
A. Andonian
Yonatan Belinkov
    KELM
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "Locating and Editing Factual Associations in GPT"

50 / 1,361 papers shown
REMA: A Unified Reasoning Manifold Framework for Interpreting Large Language Model
REMA: A Unified Reasoning Manifold Framework for Interpreting Large Language Model
Bo Li
Guanzhi Deng
Ronghao Chen
Junrong Yue
Shuo Zhang
Qinghua Zhao
Linqi Song
Lijie Wen
LRM
111
1
0
26 Sep 2025
Fine-tuning Done Right in Model Editing
Fine-tuning Done Right in Model Editing
Wanli Yang
Fei Sun
Rui Tang
Hongyu Zang
Du Su
Qi Cao
Jingang Wang
Huawei Shen
Xueqi Cheng
KELM
183
0
0
26 Sep 2025
Bilinear relational structure fixes reversal curse and enables consistent model editing
Bilinear relational structure fixes reversal curse and enables consistent model editing
Dong-Kyum Kim
Minsung Kim
Jea Kwon
Nakyeong Yang
Meeyoung Cha
KELM
377
0
0
26 Sep 2025
MindCraft: How Concept Trees Take Shape In Deep Models
MindCraft: How Concept Trees Take Shape In Deep Models
Bowei Tian
Yexiao He
Wanghao Ye
Ziyao Wang
Meng Liu
Ang Li
LRM
108
0
0
26 Sep 2025
Towards Transparent AI: A Survey on Explainable Language Models
Towards Transparent AI: A Survey on Explainable Language Models
Avash Palikhe
Sribala Vidyadhari Chinta
Zhipeng Yin
Rui Guo
Qiang Duan
Jie Yang
Wenbin Zhang
178
2
0
25 Sep 2025
Towards Atoms of Large Language Models
Towards Atoms of Large Language Models
Chenhui Hu
Pengfei Cao
Yubo Chen
Kang Liu
Jun Zhao
122
0
0
25 Sep 2025
Painless Activation Steering: An Automated, Lightweight Approach for Post-Training Large Language Models
Painless Activation Steering: An Automated, Lightweight Approach for Post-Training Large Language Models
Sasha Cui
Zhongren Chen
LLMSV
238
1
0
25 Sep 2025
CLUE: Conflict-guided Localization for LLM Unlearning Framework
CLUE: Conflict-guided Localization for LLM Unlearning Framework
Hang Chen
Jiaying Zhu
Xinyu Yang
Wenya Wang
MU
143
0
0
25 Sep 2025
Detoxifying Large Language Models via Autoregressive Reward Guided Representation Editing
Detoxifying Large Language Models via Autoregressive Reward Guided Representation Editing
Yisong Xiao
Aishan Liu
Siyuan Liang
Zonghao Ying
Xianglong Liu
Dacheng Tao
KELM
153
2
0
24 Sep 2025
Personality Vector: Modulating Personality of Large Language Models by Model Merging
Personality Vector: Modulating Personality of Large Language Models by Model Merging
Seungjong Sun
Seo Yeon Baek
Jang Hyun Kim
MoMe
121
3
0
24 Sep 2025
bi-GRPO: Bidirectional Optimization for Jailbreak Backdoor Injection on LLMs
bi-GRPO: Bidirectional Optimization for Jailbreak Backdoor Injection on LLMs
Wence Ji
Jiancan Wu
Aiying Li
Shuyi Zhang
Junkang Wu
An Zhang
Xiang-Bin Wang
Xiangnan He
AAML
150
0
0
24 Sep 2025
Latent Activation Editing: Inference-Time Refinement of Learned Policies for Safer Multirobot Navigation
Latent Activation Editing: Inference-Time Refinement of Learned Policies for Safer Multirobot Navigation
Satyajeet Das
Darren Chiu
Zhehui Huang
Lars Lindemann
Gaurav Sukhatme
LLMSV
182
0
0
24 Sep 2025
CR-Net: Scaling Parameter-Efficient Training with Cross-Layer Low-Rank Structure
CR-Net: Scaling Parameter-Efficient Training with Cross-Layer Low-Rank Structure
Boao Kong
Junzhu Liang
Yuxi Liu
Renjia Deng
Kun Yuan
160
1
0
23 Sep 2025
Data Efficient Adaptation in Large Language Models via Continuous Low-Rank Fine-Tuning
Data Efficient Adaptation in Large Language Models via Continuous Low-Rank Fine-Tuning
Xiao Han
Zimo Zhao
Wanyu Wang
Xinjian Zhao
Zitao Liu
Yi Chang
Xiangyu Zhao
CLL
162
1
0
23 Sep 2025
Cyclic Ablation: Testing Concept Localization against Functional Regeneration in AI
Cyclic Ablation: Testing Concept Localization against Functional Regeneration in AI
Eduard Kapelko
21
0
0
23 Sep 2025
When Long Helps Short: How Context Length in Supervised Fine-tuning Affects Behavior of Large Language Models
When Long Helps Short: How Context Length in Supervised Fine-tuning Affects Behavior of Large Language Models
Yingming Zheng
Hanqi Li
Kai Yu
Lu Chen
249
0
0
23 Sep 2025
Consistency-Aware Parameter-Preserving Knowledge Editing Framework for Multi-Hop Question Answering
Consistency-Aware Parameter-Preserving Knowledge Editing Framework for Multi-Hop Question Answering
Lingwen Deng
Yifei Han
Long Zhang
Yue Du
Bin Li
KELM
185
0
0
23 Sep 2025
Memory in Large Language Models: Mechanisms, Evaluation and Evolution
Memory in Large Language Models: Mechanisms, Evaluation and Evolution
D. Zhang
Wendong Li
Kani Song
Jiaye Lu
Gang Li
Liuchun Yang
Sheng Li
KELM
217
1
0
23 Sep 2025
How Persuasive is Your Context?
How Persuasive is Your Context?
Tu Nguyen
Kevin Du
Alexander Miserlis Hoyle
Ryan Cotterell
113
0
0
22 Sep 2025
Diagnosing Model Editing via Knowledge Spectrum
Diagnosing Model Editing via Knowledge Spectrum
Tsung-Hsuan Pan
Chung-Chi Chen
Hen-Hsen Huang
Hsin-Hsi Chen
KELM
117
0
0
22 Sep 2025
Achilles' Heel of Mamba: Essential difficulties of the Mamba architecture demonstrated by synthetic data
Achilles' Heel of Mamba: Essential difficulties of the Mamba architecture demonstrated by synthetic data
Jiahao Huo
Pengxiao Lin
Zhiwei Wang
Zhi-Qin John Xu
Mamba
172
0
0
22 Sep 2025
DISCO: Disentangled Communication Steering for Large Language Models
DISCO: Disentangled Communication Steering for Large Language Models
Max Torop
A. Masoomi
Masih Eskandar
Jennifer Dy
LLMSV
182
0
0
20 Sep 2025
ConceptViz: A Visual Analytics Approach for Exploring Concepts in Large Language Models
ConceptViz: A Visual Analytics Approach for Exploring Concepts in Large Language Models
Xue Yang
Zhen Wen
Qiqi Jiang
Chenxiao Li
Yuwei Wu
Y. Yang
Yiyao Wang
Xiuqi Huang
Minfeng Zhu
Wei Chen
156
0
0
20 Sep 2025
Sparse-Autoencoder-Guided Internal Representation Unlearning for Large Language Models
Sparse-Autoencoder-Guided Internal Representation Unlearning for Large Language Models
Tomoya Yamashita
Akira Ito
Yuuki Yamanaka
Masanori Yamada
Takayuki Miura
Toshiki Shibahara
MUKELM
118
1
0
19 Sep 2025
Toward Efficient Influence Function: Dropout as a Compression Tool
Toward Efficient Influence Function: Dropout as a Compression Tool
Yuchen Zhang
Mohammad Mohammadi Amiri
TDI
243
0
0
19 Sep 2025
Concept Unlearning in Large Language Models via Self-Constructed Knowledge Triplets
Concept Unlearning in Large Language Models via Self-Constructed Knowledge Triplets
Tomoya Yamashita
Yuuki Yamanaka
M. Yamada
Takayuki Miura
Toshiki Shibahara
Tomoharu Iwata
MU
98
1
0
19 Sep 2025
Reveal and Release: Iterative LLM Unlearning with Self-generated Data
Reveal and Release: Iterative LLM Unlearning with Self-generated Data
Linxi Xie
Xin Teng
Shichang Ke
Hongyi Wen
Shengjie Wang
MU
166
0
0
18 Sep 2025
Digging Into the Internal: Causality-Based Analysis of LLM Function Calling
Digging Into the Internal: Causality-Based Analysis of LLM Function Calling
Zhenlan Ji
Daoyuan Wu
Wenxuan Wang
Pingchuan Ma
Shuai Wang
Lei Ma
74
0
0
18 Sep 2025
V-SEAM: Visual Semantic Editing and Attention Modulating for Causal Interpretability of Vision-Language Models
V-SEAM: Visual Semantic Editing and Attention Modulating for Causal Interpretability of Vision-Language Models
Qidong Wang
Junjie Hu
Ming Jiang
104
0
0
18 Sep 2025
Real, Fake, or Manipulated? Detecting Machine-Influenced Text
Real, Fake, or Manipulated? Detecting Machine-Influenced Text
Yitong Wang
Zhongping Zhang
Margherita Piana
Zheng Zhou
Peter Gerstoft
Bryan A. Plummer
DeLMO
240
1
0
18 Sep 2025
Sparse Neurons Carry Strong Signals of Question Ambiguity in LLMs
Sparse Neurons Carry Strong Signals of Question Ambiguity in LLMs
Zhuoxuan Zhang
Jinhao Duan
Edward Kim
Kaidi Xu
112
0
0
17 Sep 2025
Do Natural Language Descriptions of Model Activations Convey Privileged Information?
Do Natural Language Descriptions of Model Activations Convey Privileged Information?
Millicent Li
Alberto Mario Ceballos Arroyo
Giordano Rogers
Naomi Saphra
Byron C. Wallace
181
2
0
16 Sep 2025
Collapse of Irrelevant Representations (CIR) Ensures Robust and Non-Disruptive LLM Unlearning
Collapse of Irrelevant Representations (CIR) Ensures Robust and Non-Disruptive LLM Unlearning
Filip Sondej
Yushi Yang
MU
381
0
0
15 Sep 2025
Quantifying Compositionality of Classic and State-of-the-Art Embeddings
Quantifying Compositionality of Classic and State-of-the-Art Embeddings
Zhijin Guo
Chenhao Xue
Zhaozhen Xu
Hongbo Bo
Yuxuan Ye
Janet B. Pierrehumbert
Martha Lewis
CoGe
169
0
0
14 Sep 2025
Pathological Truth Bias in Vision-Language Models
Pathological Truth Bias in Vision-Language Models
Yash Thube
97
0
0
14 Sep 2025
Context Copying Modulation: The Role of Entropy Neurons in Managing Parametric and Contextual Knowledge Conflicts
Context Copying Modulation: The Role of Entropy Neurons in Managing Parametric and Contextual Knowledge Conflicts
Zineddine Tighidet
Andrea Mogini
Hedi Ben-younes
Jiali Mei
Patrick Gallinari
Benjamin Piwowarski
229
2
0
12 Sep 2025
All for One: LLMs Solve Mental Math at the Last Token With Information Transferred From Other Tokens
All for One: LLMs Solve Mental Math at the Last Token With Information Transferred From Other Tokens
Siddarth Mamidanna
Daking Rai
Ziyu Yao
Yilun Zhou
LRM
130
1
0
11 Sep 2025
SEDM: Scalable Self-Evolving Distributed Memory for Agents
SEDM: Scalable Self-Evolving Distributed Memory for Agents
Haoran Xu
Jiacong Hu
Ke Zhang
Lei Yu
Yuxin Tang
Xinyuan Song
Yiqun Duan
Lynn Ai
Bill Shi
183
1
0
11 Sep 2025
Do All Autoregressive Transformers Remember Facts the Same Way? A Cross-Architecture Analysis of Recall Mechanisms
Do All Autoregressive Transformers Remember Facts the Same Way? A Cross-Architecture Analysis of Recall Mechanisms
Minyeong Choe
Haehyun Cho
Changho Seo
Hyunil Kim
KELMHILM
150
3
0
10 Sep 2025
Avoiding Knowledge Edit Skipping in Multi-hop Question Answering with Guided Decomposition
Avoiding Knowledge Edit Skipping in Multi-hop Question Answering with Guided Decomposition
Yi Liu
Xiangrong Zhu
Xiangyu Liu
Wei Wei
Wei Hu
KELM
113
0
0
09 Sep 2025
Statistical Methods in Generative AI
Statistical Methods in Generative AI
Edgar Dobriban
289
3
0
08 Sep 2025
Towards Meta-Cognitive Knowledge Editing for Multimodal LLMs
Towards Meta-Cognitive Knowledge Editing for Multimodal LLMs
Zhaoyu Fan
Kaihang Pan
Mingze Zhou
Bosheng Qin
Juncheng Billy Li
Shengyu Zhang
Wenqiao Zhang
Siliang Tang
Fei Wu
Yueting Zhuang
KELM
158
0
0
06 Sep 2025
Memorization $\neq$ Understanding: Do Large Language Models Have the Ability of Scenario Cognition?
Memorization ≠\neq= Understanding: Do Large Language Models Have the Ability of Scenario Cognition?
Boxiang Ma
Ru Li
Yuanlong Wang
Hongye Tan
Xiaoli Li
137
2
0
05 Sep 2025
Manipulating Transformer-Based Models: Controllability, Steerability, and Robust Interventions
Manipulating Transformer-Based Models: Controllability, Steerability, and Robust Interventions
Faruk Alpay
Taylan Alpay
LM&Ro
82
1
0
04 Sep 2025
Context Engineering for Trustworthiness: Rescorla Wagner Steering Under Mixed and Inappropriate Contexts
Context Engineering for Trustworthiness: Rescorla Wagner Steering Under Mixed and Inappropriate Contexts
Rushi Wang
Jiateng Liu
Cheng Qian
Yifan Shen
Yanzhou Pan
Zhaozhuo Xu
Ahmed Abbasi
Heng Ji
D. Zhang
LLMSV
162
2
0
02 Sep 2025
Unlearning That Lasts: Utility-Preserving, Robust, and Almost Irreversible Forgetting in LLMs
Unlearning That Lasts: Utility-Preserving, Robust, and Almost Irreversible Forgetting in LLMs
Naman D. Singh
Maximilian Müller
Francesco Croce
Matthias Hein
MUKELMCLL
192
4
0
02 Sep 2025
Vis-CoT: A Human-in-the-Loop Framework for Interactive Visualization and Intervention in LLM Chain-of-Thought Reasoning
Vis-CoT: A Human-in-the-Loop Framework for Interactive Visualization and Intervention in LLM Chain-of-Thought Reasoning
Kaviraj Pather
Elena Hadjigeorgiou
Arben Krasniqi
Claire Schmit
Irina Rusu
Marc Pons
Kabir Khan
LRM
113
0
0
01 Sep 2025
Robust Knowledge Editing via Explicit Reasoning Chains for Distractor-Resilient Multi-Hop QA
Robust Knowledge Editing via Explicit Reasoning Chains for Distractor-Resilient Multi-Hop QA
Yuchen Wu
Liang Ding
Li Shen
Dacheng Tao
KELMLRM
116
1
0
01 Sep 2025
Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic
Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic
Mohammad Zbeeb
Hasan Hammoud
Bernard Ghanem
LRM
184
3
0
01 Sep 2025
Causal Consistency Regularization: Training Verifiably Sensitive Reasoning in Large Language Models
Causal Consistency Regularization: Training Verifiably Sensitive Reasoning in Large Language Models
Ibne Farabi Shihab
Sanjeda Akter
Anuj Sharma
ReLMLRM
158
0
0
01 Sep 2025
Previous
12345...262728
Next