Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2202.05262
Cited By

Locating and Editing Factual Associations in GPT

v1v2v3v4v5 (latest)

Locating and Editing Factual Associations in GPT

Neural Information Processing Systems (NeurIPS), 2022

10 February 2022

Yonatan Belinkov

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)

Papers citing "Locating and Editing Factual Associations in GPT"

50 / 1,361 papers shown

Latent Causal Probing: A Formal Perspective on Probing with Causal
Models of Data

Latent Causal Probing: A Formal Perspective on Probing with Causal Models of Data

219

3

0

18 Jul 2024

NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals

NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals

Jaden Fiotto-Kaufman

Alexander R. Loftus

Jannik Brinkmann

...

Byron C. Wallace

381

6

0

18 Jul 2024

Retrieval-Augmented Generation for Natural Language Processing: A Survey

Retrieval-Augmented Generation for Natural Language Processing: A Survey

Shangyu Wu

Yufei Cui

...

Xue Liu

465

97

0

18 Jul 2024

Establishing Knowledge Preference in Language Models

Establishing Knowledge Preference in Language Models

Heng Ji

236

0

0

17 Jul 2024

LLM Circuit Analyses Are Consistent Across Training and Scale

LLM Circuit Analyses Are Consistent Across Training and Scale

Stella Biderman

332

32

0

15 Jul 2024

How and where does CLIP process negation?

How and where does CLIP process negation?

Vincent Quantmeyer

246

11

0

15 Jul 2024

Cross-Lingual Multi-Hop Knowledge Editing

Cross-Lingual Multi-Hop Knowledge Editing

Aditi Khandelwal

148

0

0

14 Jul 2024

On Large Language Model Continual Unlearning

On Large Language Model Continual Unlearning

Qi Zhu

268

0

0

14 Jul 2024

A Survey on Symbolic Knowledge Distillation of Large Language Models

A Survey on Symbolic Knowledge Distillation of Large Language Models

Alvaro Velasquez

288

23

0

12 Jul 2024

Transformer Circuit Faithfulness Metrics are not Robust

Transformer Circuit Faithfulness Metrics are not Robust

William Saunders

207

9

0

11 Jul 2024

Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing

Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing

Shiji Song

428

16

0

11 Jul 2024

Knowledge Overshadowing Causes Amalgamated Hallucination in Large
Language Models

Knowledge Overshadowing Causes Amalgamated Hallucination in Large Language Models

Sha Li

381

27

0

10 Jul 2024

Uncovering Layer-Dependent Activation Sparsity Patterns in ReLU
Transformers

Uncovering Layer-Dependent Activation Sparsity Patterns in ReLU Transformers

Jesper Anderson

190

0

0

10 Jul 2024

Grounding and Evaluation for Large Language Models: Practical Challenges
and Lessons Learned (Survey)

Grounding and Evaluation for Large Language Models: Practical Challenges and Lessons Learned (Survey)

219

33

0

10 Jul 2024

Composable Interventions for Language Models

Composable Interventions for Language Models

Arinbjorn Kolbeinsson

...

Anurag J. Vaidya

Thomas Hartvigsen

519

4

0

09 Jul 2024

MUSE: Machine Unlearning Six-Way Evaluation for Language Models

MUSE: Machine Unlearning Six-Way Evaluation for Language Models

Sadhika Malladi

Luke Zettlemoyer

310

149

0

08 Jul 2024

CodeUpdateArena: Benchmarking Knowledge Editing on API Updates

CodeUpdateArena: Benchmarking Knowledge Editing on API Updates

405

13

0

08 Jul 2024

Missed Causes and Ambiguous Effects: Counterfactuals Pose Challenges for
Interpreting Neural Networks

Missed Causes and Ambiguous Effects: Counterfactuals Pose Challenges for Interpreting Neural Networks

Aaron Mueller

228

16

0

05 Jul 2024

Concept Bottleneck Models Without Predefined Concepts

Concept Bottleneck Models Without Predefined Concepts

239

16

0

04 Jul 2024

Sheaf Discovery with Joint Computation Graph Pruning and Flexible Granularity

Sheaf Discovery with Joint Computation Graph Pruning and Flexible Granularity

214

9

0

04 Jul 2024

Truth is Universal: Robust Detection of Lies in LLMs

Truth is Universal: Robust Detection of Lies in LLMs

Lennart Bürger

237

51

0

03 Jul 2024

To Forget or Not? Towards Practical Knowledge Unlearning for Large
Language Models

To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models

Bozhong Tian

Xiaozhuan Liang

Mengru Wang

Dianbo Sui

Xi Chen

Huajun Chen

225

21

0

02 Jul 2024

A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models

A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models

Abulhair Saparov

640

81

0

02 Jul 2024

Why Does New Knowledge Create Messy Ripple Effects in LLMs?

Why Does New Knowledge Create Messy Ripple Effects in LLMs?

246

19

0

02 Jul 2024

PFME: A Modular Approach for Fine-grained Hallucination Detection and
Editing of Large Language Models

PFME: A Modular Approach for Fine-grained Hallucination Detection and Editing of Large Language Models

186

2

0

29 Jun 2024

Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs

Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs

Sheridan Feucht

Byron C. Wallace

243

13

0

28 Jun 2024

LEMoE: Advanced Mixture of Experts Adaptor for Lifelong Model Editing of
Large Language Models

LEMoE: Advanced Mixture of Experts Adaptor for Lifelong Model Editing of Large Language Models

285

13

0

28 Jun 2024

SeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval Augmented
Generation

SeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval Augmented Generation

Linmei Hu

Juanzi Li

160

22

0

27 Jun 2024

AlignIT: Enhancing Prompt Alignment in Customization of Text-to-Image
Models

AlignIT: Enhancing Prompt Alignment in Customization of Text-to-Image Models

Aishwarya Agarwal

Srikrishna Karanam

Balaji Vasan Srinivasan

253

2

0

27 Jun 2024

The Remarkable Robustness of LLMs: Stages of Inference?

The Remarkable Robustness of LLMs: Stages of Inference?

519

87

0

27 Jun 2024

Evaluating Copyright Takedown Methods for Language Models

Evaluating Copyright Takedown Methods for Language Models

Weijia Shi

Yangsibo Huang

Luke Zettlemoyer

Kai Li

Peter Henderson

458

38

0

26 Jun 2024

IRCAN: Mitigating Knowledge Conflicts in LLM Generation via Identifying
and Reweighting Context-Aware Neurons

IRCAN: Mitigating Knowledge Conflicts in LLM Generation via Identifying and Reweighting Context-Aware Neurons

Deyi Xiong

249

26

0

26 Jun 2024

Do LLMs dream of elephants (when told not to)? Latent concept
association and associative memory in transformers

Do LLMs dream of elephants (when told not to)? Latent concept association and associative memory in transformers

Goutham Rajendran

Pradeep Ravikumar

277

12

0

26 Jun 2024

Enhancing Data Privacy in Large Language Models through Private
Association Editing

Enhancing Data Privacy in Large Language Models through Private Association Editing

Davide Venditti

Elena Sofia Ruzzetti

Giancarlo A. Xompero

Cristina Giannone

Raniero Romagnoli

Fabio Massimo Zanzotto

209

7

0

26 Jun 2024

Transformer Normalisation Layers and the Independence of Semantic
Subspaces

Transformer Normalisation Layers and the Independence of Semantic Subspaces

231

2

0

25 Jun 2024

BMIKE-53: Investigating Cross-Lingual Knowledge Editing with In-Context Learning

BMIKE-53: Investigating Cross-Lingual Knowledge Editing with In-Context Learning

Hinrich Schütze

494

12

0

25 Jun 2024

How Well Can Knowledge Edit Methods Edit Perplexing Knowledge?

How Well Can Knowledge Edit Methods Edit Perplexing Knowledge?

289

4

0

25 Jun 2024

It Is Not About What You Say, It Is About How You Say It: A Surprisingly
Simple Approach for Improving Reading Comprehension

It Is Not About What You Say, It Is About How You Say It: A Surprisingly Simple Approach for Improving Reading Comprehension

Lawrence E Hunter

Katharina von der Wense

269

4

0

24 Jun 2024

Multilingual Knowledge Editing with Language-Agnostic Factual Neurons

Multilingual Knowledge Editing with Language-Agnostic Factual Neurons

Fandong Meng

Yufeng Chen

Jinan Xu

Jie Zhou

166

14

0

24 Jun 2024

MD tree: a model-diagnostic tree grown on loss landscape

MD tree: a model-diagnostic tree grown on loss landscape

Konstantin Schürholt

296

2

0

24 Jun 2024

Confidence Regulation Neurons in Language Models

Confidence Regulation Neurons in Language Models

Alessandro Stolfo

Yonatan Belinkov

Mrinmaya Sachan

242

39

0

24 Jun 2024

What Do VLMs NOTICE? A Mechanistic Interpretability Pipeline for Gaussian-Noise-free Text-Image Corruption and Evaluation

What Do VLMs NOTICE? A Mechanistic Interpretability Pipeline for Gaussian-Noise-free Text-Image Corruption and Evaluation

Michal Golovanevsky

Ritambhara Singh

Carsten Eickhoff

446

10

0

24 Jun 2024

Large Language Models Are Cross-Lingual Knowledge-Free Reasoners

Large Language Models Are Cross-Lingual Knowledge-Free Reasoners

Shujian Huang

405

14

0

24 Jun 2024

FastMem: Fast Memorization of Prompt Improves Context Awareness of Large
Language Models

FastMem: Fast Memorization of Prompt Improves Context Awareness of Large Language Models

Bo Tang

Zhiyu Li

Tong Xu

Matthew B. Blaschko

210

6

0

23 Jun 2024

Memorizing Documents with Guidance in Large Language Models

Memorizing Documents with Guidance in Large Language Models

195

1

0

23 Jun 2024

Unveiling LLM Mechanisms Through Neural ODEs and Control Theory

Unveiling LLM Mechanisms Through Neural ODEs and Control Theory

Yukun Zhang

Qi Dong

309

0

0

23 Jun 2024

Beyond the Doors of Perception: Vision Transformers Represent Relations
Between Objects

Beyond the Doors of Perception: Vision Transformers Represent Relations Between Objects

Michael A. Lepori

Alexa R. Tartaglini

Brenden M. Lake

223

15

0

22 Jun 2024

Beyond Individual Facts: Investigating Categorical Knowledge Locality of
Taxonomy and Meronomy Concepts in GPT Models

Beyond Individual Facts: Investigating Categorical Knowledge Locality of Taxonomy and Meronomy Concepts in GPT Models

Christopher Burger

Yifan Hu

185

0

0

22 Jun 2024

Steering Without Side Effects: Improving Post-Deployment Control of
Language Models

Steering Without Side Effects: Improving Post-Deployment Control of Language Models

Asa Cooper Stickland

Alexander Lyzhov

Salsabila Mahdi

Samuel R. Bowman

244

38

0

21 Jun 2024

Towards Understanding Safety Alignment: A Mechanistic Perspective from Safety Neurons

Towards Understanding Safety Alignment: A Mechanistic Perspective from Safety Neurons

344

26

0

20 Jun 2024

1 2 3...16 17 18...26 27 28