Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2202.05262
Cited By

Locating and Editing Factual Associations in GPT

v1v2v3v4v5 (latest)

Locating and Editing Factual Associations in GPT

Neural Information Processing Systems (NeurIPS), 2022

10 February 2022

Yonatan Belinkov

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)

Papers citing "Locating and Editing Factual Associations in GPT"

50 / 1,368 papers shown

Learning without training: The implicit dynamics of in-context learning

Learning without training: The implicit dynamics of in-context learning

644

21

0

24 Dec 2025

EvoEdit: Lifelong Free-Text Knowledge Editing through Latent Perturbation Augmentation and Knowledge-driven Parameter Fusion

EvoEdit: Lifelong Free-Text Knowledge Editing through Latent Perturbation Augmentation and Knowledge-driven Parameter Fusion

378

0

0

04 Dec 2025

EtCon: Edit-then-Consolidate for Reliable Knowledge Editing

EtCon: Edit-then-Consolidate for Reliable Knowledge Editing

153

0

0

04 Dec 2025

RapidUn: Influence-Driven Parameter Reweighting for Efficient Large Language Model Unlearning

RapidUn: Influence-Driven Parameter Reweighting for Efficient Large Language Model Unlearning

Guoshenghui Zhao

198

0

0

04 Dec 2025

Too Late to Recall: Explaining the Two-Hop Problem in Multimodal Knowledge Retrieval

Too Late to Recall: Explaining the Two-Hop Problem in Multimodal Knowledge Retrieval

Constantin Venhoff

84

0

0

02 Dec 2025

Latent Debate: A Surrogate Framework for Interpreting LLM Thinking

Latent Debate: A Surrogate Framework for Interpreting LLM Thinking

221

0

0

01 Dec 2025

Hybrid-DMKG: A Hybrid Reasoning Framework over Dynamic Multimodal Knowledge Graphs for Multimodal Multihop QA with Knowledge Editing

Hybrid-DMKG: A Hybrid Reasoning Framework over Dynamic Multimodal Knowledge Graphs for Multimodal Multihop QA with Knowledge Editing

Changmeng Zheng

295

0

0

30 Nov 2025

Mechanistic Interpretability for Transformer-based Time Series Classification

Mechanistic Interpretability for Transformer-based Time Series Classification

Matīss Kalnāre

Sofoklis Kitharidis

362

0

0

26 Nov 2025

CrossEarth-Gate: Fisher-Guided Adaptive Tuning Engine for Efficient Adaptation of Cross-Domain Remote Sensing Semantic Segmentation

CrossEarth-Gate: Fisher-Guided Adaptive Tuning Engine for Efficient Adaptation of Cross-Domain Remote Sensing Semantic Segmentation

...

266

0

0

25 Nov 2025

Beyond Components: Singular Vector-Based Interpretability of Transformer Circuits

Beyond Components: Singular Vector-Based Interpretability of Transformer Circuits

82

3

0

25 Nov 2025

Emergence and Localisation of Semantic Role Circuits in LLMs

Emergence and Localisation of Semantic Role Circuits in LLMs

Danilo S. Carvalho

90

0

0

25 Nov 2025

Physics Steering: Causal Control of Cross-Domain Concepts in a Physics Foundation Model

Physics Steering: Causal Control of Cross-Domain Concepts in a Physics Foundation Model

Payel Mukhopadhyay

525

3

0

25 Nov 2025

Representation Interventions Enable Lifelong Unstructured Knowledge Control

Representation Interventions Enable Lifelong Unstructured Knowledge Control

Zhengzhang Chen

195

0

0

25 Nov 2025

Dissecting the Ledger: Locating and Suppressing "Liar Circuits" in Financial Large Language Models

Dissecting the Ledger: Locating and Suppressing "Liar Circuits" in Financial Large Language Models

186

0

0

24 Nov 2025

No Free Lunch in Language Model Bias Mitigation? Targeted Bias Reduction Can Exacerbate Unmitigated LLM Biases

No Free Lunch in Language Model Bias Mitigation? Targeted Bias Reduction Can Exacerbate Unmitigated LLM Biases

133

1

0

23 Nov 2025

Curvature-Aware Safety Restoration In LLMs Fine-Tuning

Curvature-Aware Safety Restoration In LLMs Fine-Tuning

Thanh Nguyen-Tang

T. Hoang Ngan Le

148

1

0

22 Nov 2025

An Efficient LLM-based Evolutional Recommendation with Locate-Forget-Update Paradigm

220

0

0

20 Nov 2025

BlockCert: Certified Blockwise Extraction of Transformer Mechanisms

BlockCert: Certified Blockwise Extraction of Transformer Mechanisms

84

0

0

20 Nov 2025

Anatomy of an Idiom: Tracing Non-Compositionality in Language Models

199

0

0

20 Nov 2025

Beyond Tokens in Language Models: Interpreting Activations through Text Genre Chunks

Éloïse Benito-Rodriguez

Nicky Pochinkov

115

1

0

20 Nov 2025

Erase to Retain: Low Rank Adaptation Guided Selective Unlearning in Medical Segmentation Networks

Md. Golam Rabiul Alam

335

0

0

20 Nov 2025

From Passive to Persuasive: Steering Emotional Nuance in Human-AI Negotiation

From Passive to Persuasive: Steering Emotional Nuance in Human-AI Negotiation

Niranjan Chebrolu

Gerard Christopher Yeo

225

0

0

16 Nov 2025

MolEdit: Knowledge Editing for Multimodal Molecule Language Models

MolEdit: Knowledge Editing for Multimodal Molecule Language ModelsWeb Search and Data Mining (WSDM), 2025

425

1

0

16 Nov 2025

Reason-KE++: Aligning the Process, Not Just the Outcome, for Faithful LLM Knowledge Editing

Reason-KE++: Aligning the Process, Not Just the Outcome, for Faithful LLM Knowledge Editing

236

0

0

16 Nov 2025

Catastrophic Forgetting in Kolmogorov-Arnold Networks

Catastrophic Forgetting in Kolmogorov-Arnold Networks

Mohammad Marufur Rahman

228

1

0

16 Nov 2025

A Multifaceted Analysis of Negative Bias in Large Language Models through the Lens of Parametric Knowledge

A Multifaceted Analysis of Negative Bias in Large Language Models through the Lens of Parametric KnowledgeIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2025

61

0

0

14 Nov 2025

Forgetting-MarI: LLM Unlearning via Marginal Information Regularization

Forgetting-MarI: LLM Unlearning via Marginal Information Regularization

Stefan Broecker

Thomas Strohmer

432

0

0

14 Nov 2025

Training Language Models to Explain Their Own Computations

Training Language Models to Explain Their Own Computations

Jacob Steinhardt

271

7

0

11 Nov 2025

On the Analogy between Human Brain and LLMs: Spotting Key Neurons in Grammar Perception

On the Analogy between Human Brain and LLMs: Spotting Key Neurons in Grammar Perception

Sanaz Saki Norouzi

Mohammad Masjedi

144

0

0

09 Nov 2025

You Had One Job: Per-Task Quantization Using LLMs' Hidden Representations

You Had One Job: Per-Task Quantization Using LLMs' Hidden Representations

Yaniv Nemcovsky

Ravid Shwartz Ziv

155

2

0

09 Nov 2025

SR-KI: Scalable and Real-Time Knowledge Integration into LLMs via Supervised Attention

SR-KI: Scalable and Real-Time Knowledge Integration into LLMs via Supervised Attention

200

0

0

09 Nov 2025

Visual Exploration of Feature Relationships in Sparse Autoencoders with Curated Concepts

Visual Exploration of Feature Relationships in Sparse Autoencoders with Curated Concepts

Kowshik Thopalli

178

0

0

08 Nov 2025

Causal Tracing of Object Representations in Large Vision Language Models: Mechanistic Interpretability and Hallucination Mitigation

Causal Tracing of Object Representations in Large Vision Language Models: Mechanistic Interpretability and Hallucination Mitigation

297

4

0

08 Nov 2025

Can Fine-Tuning Erase Your Edits? On the Fragile Coexistence of Knowledge Editing and Adaptation

Can Fine-Tuning Erase Your Edits? On the Fragile Coexistence of Knowledge Editing and Adaptation

Jorg Schlotterer

639

0

0

08 Nov 2025

Stemming Hallucination in Language Models Using a Licensing Oracle

Stemming Hallucination in Language Models Using a Licensing Oracle

Simeon Emanuilov

Richard Ackermann

198

0

0

08 Nov 2025

APP: Accelerated Path Patching with Task-Specific Pruning

APP: Accelerated Path Patching with Task-Specific Pruning

Frauke Andersen

Carsten Eickhoff

85

0

0

07 Nov 2025

First is Not Really Better Than Last: Evaluating Layer Choice and Aggregation Strategies in Language Model Data Influence Estimation

First is Not Really Better Than Last: Evaluating Layer Choice and Aggregation Strategies in Language Model Data Influence Estimation

Anshuman Chhabra

402

1

0

06 Nov 2025

Addressing divergent representations from causal interventions on neural networks

Addressing divergent representations from causal interventions on neural networks

Simon Jerome Han

Alexa R. Tartaglini

Christopher Potts

498

0

0

06 Nov 2025

Understanding Robustness of Model Editing in Code LLMs: An Empirical Study

Understanding Robustness of Model Editing in Code LLMs: An Empirical Study

183

0

0

05 Nov 2025

Relational Deep Dive: Error-Aware Queries Over Unstructured Data

Relational Deep Dive: Error-Aware Queries Over Unstructured Data

132

1

0

04 Nov 2025

ExplicitLM: Decoupling Knowledge from Parameters via Explicit Memory Banks

ExplicitLM: Decoupling Knowledge from Parameters via Explicit Memory Banks

173

0

0

03 Nov 2025

Democratizing LLM Efficiency: From Hyperscale Optimizations to Universal Deployability

Democratizing LLM Efficiency: From Hyperscale Optimizations to Universal Deployability

103

0

0

03 Nov 2025

Balancing Knowledge Updates: Toward Unified Modular Editing in LLMs

Balancing Knowledge Updates: Toward Unified Modular Editing in LLMs

159

0

0

31 Oct 2025

ParaScopes: What do Language Models Activations Encode About Future Text?

ParaScopes: What do Language Models Activations Encode About Future Text?

Nicky Pochinkov

Sai V R Chereddy

245

0

0

31 Oct 2025

Understanding and Enhancing Mamba-Transformer Hybrids for Memory Recall and Language Modeling

Understanding and Enhancing Mamba-Transformer Hybrids for Memory Recall and Language Modeling

234

1

0

30 Oct 2025

LLMs Process Lists With General Filter Heads

LLMs Process Lists With General Filter Heads

Arnab Sen Sharma

Giordano Rogers

Natalie Shapira

175

2

0

30 Oct 2025

Understanding Hardness of Vision-Language Compositionality from A Token-level Causal Lens

Understanding Hardness of Vision-Language Compositionality from A Token-level Causal Lens

121

0

0

30 Oct 2025

A Survey on Unlearning in Large Language Models

A Survey on Unlearning in Large Language Models

746

1

0

29 Oct 2025

MemEIC: A Step Toward Continual and Compositional Knowledge Editing

MemEIC: A Step Toward Continual and Compositional Knowledge Editing

Wencke Liermann

326

0

0

29 Oct 2025

Activation-Space Personality Steering: Hybrid Layer Selection for Stable Trait Control in LLMs

Activation-Space Personality Steering: Hybrid Layer Selection for Stable Trait Control in LLMs

Pranav Bhandari

Sanjeevan Selvaganapathy

223

2

0

29 Oct 2025

1 2 3 4...26 27 28

Page 1 of 28

Pageof 28