Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2202.05262
Cited By

Locating and Editing Factual Associations in GPT

v1v2v3v4v5 (latest)

Locating and Editing Factual Associations in GPT

Neural Information Processing Systems (NeurIPS), 2022

10 February 2022

Yonatan Belinkov

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)

Papers citing "Locating and Editing Factual Associations in GPT"

50 / 1,361 papers shown

Reproducing and Extending Causal Insights Into Term Frequency Computation in Neural Rankers

Reproducing and Extending Causal Insights Into Term Frequency Computation in Neural Rankers

Cile van Marken

172

0

0

08 Oct 2025

POME: Post Optimization Model Edit via Muon-style Projection

POME: Post Optimization Model Edit via Muon-style Projection

97

0

0

08 Oct 2025

VideoMiner: Iteratively Grounding Key Frames of Hour-Long Videos via Tree-based Group Relative Policy Optimization

VideoMiner: Iteratively Grounding Key Frames of Hour-Long Videos via Tree-based Group Relative Policy Optimization

145

0

0

07 Oct 2025

Distributional Semantics Tracing: A Framework for Explaining Hallucinations in Large Language Models

Distributional Semantics Tracing: A Framework for Explaining Hallucinations in Large Language Models

Somayajulu G Sripada

276

1

0

07 Oct 2025

LLM Microscope: What Model Internals Reveal About Answer Correctness and Context Utilization

LLM Microscope: What Model Internals Reveal About Answer Correctness and Context Utilization

Nishant Subramani

151

0

0

05 Oct 2025

Decoding Emotion in the Deep: A Systematic Study of How LLMs Represent, Retain, and Express Emotion

Decoding Emotion in the Deep: A Systematic Study of How LLMs Represent, Retain, and Express Emotion

Jingxiang Zhang

212

1

0

05 Oct 2025

Mechanistic Interpretability of Socio-Political Frames in Language Models

Mechanistic Interpretability of Socio-Political Frames in Language Models

94

0

0

04 Oct 2025

Allocation of Parameters in Transformers

Allocation of Parameters in Transformers

160

0

0

04 Oct 2025

Evaluation Framework for Highlight Explanations of Context Utilisation in Language Models

Evaluation Framework for Highlight Explanations of Context Utilisation in Language Models

Sagnik Ray Choudhury

Sekh Mainul Islam

Isabelle Augenstein

197

0

0

03 Oct 2025

What Drives Compositional Generalization in Visual Generative Models?

What Drives Compositional Generalization in Visual Generative Models?

Yumna Ali Alnaggar

Cordelia Schmid

324

0

0

03 Oct 2025

Truth-Aware Decoding: A Program-Logic Approach to Factual Language Generation

Truth-Aware Decoding: A Program-Logic Approach to Factual Language Generation

64

0

0

03 Oct 2025

Machine Unlearning Meets Adversarial Robustness via Constrained Interventions on LLMs

Machine Unlearning Meets Adversarial Robustness via Constrained Interventions on LLMs

Fatmazohra Rezkellah

Ramzi Dakhmouche

224

1

0

03 Oct 2025

REPAIR: Robust Editing via Progressive Adaptive Intervention and Reintegration

REPAIR: Robust Editing via Progressive Adaptive Intervention and Reintegration

132

1

0

02 Oct 2025

Multimodal Function Vectors for Spatial Relations

Multimodal Function Vectors for Spatial Relations

Esther Goldberg

86

0

0

02 Oct 2025

Diagnosing Bottlenecks in Data Visualization Understanding by Vision-Language Models

Diagnosing Bottlenecks in Data Visualization Understanding by Vision-Language Models

Alexa R. Tartaglini

Christopher Potts

100

0

0

02 Oct 2025

Unraveling Syntax: How Language Models Learn Context-Free Grammars

Unraveling Syntax: How Language Models Learn Context-Free Grammars

Laura Ying Schulz

Daniel Mitropolsky

107

0

0

02 Oct 2025

Nav-EE: Navigation-Guided Early Exiting for Efficient Vision-Language Models in Autonomous Driving

Nav-EE: Navigation-Guided Early Exiting for Efficient Vision-Language Models in Autonomous Driving

213

0

0

02 Oct 2025

Auditing Algorithmic Bias in Transformer-Based Trading

Auditing Algorithmic Bias in Transformer-Based Trading

199

1

0

01 Oct 2025

Microsaccade-Inspired Probing: Positional Encoding Perturbations Reveal LLM Misbehaviours

Microsaccade-Inspired Probing: Positional Encoding Perturbations Reveal LLM Misbehaviours

148

0

0

01 Oct 2025

Energy-Regularized Sequential Model Editing on Hyperspheres

Energy-Regularized Sequential Model Editing on Hyperspheres

220

0

0

01 Oct 2025

Mechanistic Interpretability as Statistical Estimation: A Variance Analysis of EAP-IG

Mechanistic Interpretability as Statistical Estimation: A Variance Analysis of EAP-IG

François Portet

166

1

0

01 Oct 2025

On Predictability of Reinforcement Learning Dynamics for Large Language Models

On Predictability of Reinforcement Learning Dynamics for Large Language Models

152

0

0

01 Oct 2025

Is Model Editing Built on Sand? Revealing Its Illusory Success and Fragile Foundation

Is Model Editing Built on Sand? Revealing Its Illusory Success and Fragile Foundation

128

0

0

01 Oct 2025

KnowledgeSmith: Uncovering Knowledge Updating in LLMs with Model Editing and Unlearning

KnowledgeSmith: Uncovering Knowledge Updating in LLMs with Model Editing and Unlearning

Marios Savvides

189

0

0

01 Oct 2025

Interpret, prune and distill Donut : towards lightweight VLMs for VQA on document

Interpret, prune and distill Donut : towards lightweight VLMs for VQA on document

Adnan Ben Mansour

131

0

0

30 Sep 2025

Muon Outperforms Adam in Tail-End Associative Memory Learning

Muon Outperforms Adam in Tail-End Associative Memory Learning

Vincent Y. F. Tan

156

3

0

30 Sep 2025

Scalable and Robust LLM Unlearning by Correcting Responses with Retrieved Exclusions

Scalable and Robust LLM Unlearning by Correcting Responses with Retrieved Exclusions

145

1

0

30 Sep 2025

Pretraining with hierarchical memories: separating long-tail and common knowledge

Pretraining with hierarchical memories: separating long-tail and common knowledge

Hadi Pouransari

Michael Kirchhof

243

1

0

29 Sep 2025

Inducing Dyslexia in Vision Language Models

Inducing Dyslexia in Vision Language Models

Melika Honarmand

Badr AlKhamissi

Johannes Mehrer

Martin Schrimpf

304

0

0

29 Sep 2025

EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering

EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering

167

1

0

29 Sep 2025

TDHook: A Lightweight Framework for Interpretability

TDHook: A Lightweight Framework for Interpretability

131

0

0

29 Sep 2025

Stable Forgetting: Bounded Parameter-Efficient Unlearning in LLMs

Stable Forgetting: Bounded Parameter-Efficient Unlearning in LLMs

Hemanth Saratchandran

121

1

0

29 Sep 2025

Reasoning or Retrieval? A Study of Answer Attribution on Large Reasoning Models

Reasoning or Retrieval? A Study of Answer Attribution on Large Reasoning Models

144

1

0

29 Sep 2025

Uni-X: Mitigating Modality Conflict with a Two-End-Separated Architecture for Unified Multimodal Models

Uni-X: Mitigating Modality Conflict with a Two-End-Separated Architecture for Unified Multimodal Models

226

0

0

29 Sep 2025

Knowledge Editing with Subspace-Aware Key-Value Mappings

Knowledge Editing with Subspace-Aware Key-Value Mappings

294

0

0

29 Sep 2025

Circuit Distillation

Circuit Distillation

Byron C. Wallace

148

0

0

29 Sep 2025

How Training Data Shapes the Use of Parametric and In-Context Knowledge in Language Models

How Training Data Shapes the Use of Parametric and In-Context Knowledge in Language Models

138

1

0

29 Sep 2025

Skip-It? Theoretical Conditions for Layer Skipping in Vision-Language Models

Skip-It? Theoretical Conditions for Layer Skipping in Vision-Language Models

Vidhata Arjun Jayaraman

Moulik Choraria

Akhil Bhimaraju

383

0

0

29 Sep 2025

Beyond Benchmarks: Understanding Mixture-of-Experts Models through Internal Mechanisms

Beyond Benchmarks: Understanding Mixture-of-Experts Models through Internal Mechanisms

55

0

0

28 Sep 2025

Towards Understanding Subliminal Learning: When and How Hidden Biases Transfer

Towards Understanding Subliminal Learning: When and How Hidden Biases Transfer

135

0

0

28 Sep 2025

Bridging the Knowledge-Prediction Gap in LLMs on Multiple-Choice Questions

Bridging the Knowledge-Prediction Gap in LLMs on Multiple-Choice Questions

374

0

0

28 Sep 2025

Knowledge Homophily in Large Language Models

Knowledge Homophily in Large Language Models

M. Halappanavar

Franck Dernoncourt

110

0

0

28 Sep 2025

Uncovering Grounding IDs: How External Cues Shape Multimodal Binding

Uncovering Grounding IDs: How External Cues Shape Multimodal Binding

Amirmohammad Izadi

Mobin Bagherian

Sadegh Mohammadian

323

0

0

28 Sep 2025

Enhancing LLM Steering through Sparse Autoencoder-Based Vector Refinement

Enhancing LLM Steering through Sparse Autoencoder-Based Vector Refinement

183

0

0

28 Sep 2025

From Reasoning to Answer: Empirical, Attention-Based and Mechanistic Insights into Distilled DeepSeek R1 Models

From Reasoning to Answer: Empirical, Attention-Based and Mechanistic Insights into Distilled DeepSeek R1 Models

Saravan Rajmohan

108

0

0

28 Sep 2025

Language Model Planning from an Information Theoretic Perspective

Language Model Planning from an Information Theoretic Perspective

Muhammed Ustaomeroglu

Carlee Joe-Wong

133

0

0

28 Sep 2025

Fact Grounded Attention: Eliminating Hallucination in Large Language Models Through Attention Level Knowledge Integration

Fact Grounded Attention: Eliminating Hallucination in Large Language Models Through Attention Level Knowledge Integration

195

0

0

27 Sep 2025

Steering Prepositional Phrases in Language Models: A Case of with-headed Adjectival and Adverbial Complements in Gemma-2

Steering Prepositional Phrases in Language Models: A Case of with-headed Adjectival and Adverbial Complements in Gemma-2

133

0

0

27 Sep 2025

LLM Interpretability with Identifiable Temporal-Instantaneous Representation

LLM Interpretability with Identifiable Temporal-Instantaneous Representation

128

0

0

27 Sep 2025

Bilinear relational structure fixes reversal curse and enables consistent model editing

Bilinear relational structure fixes reversal curse and enables consistent model editing

377

0

0

26 Sep 2025

1 2 3 4 5 6...26 27 28