Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2202.05262
Cited By

Locating and Editing Factual Associations in GPT

v1v2v3v4v5 (latest)

Locating and Editing Factual Associations in GPT

Neural Information Processing Systems (NeurIPS), 2022

10 February 2022

Yonatan Belinkov

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)

Papers citing "Locating and Editing Factual Associations in GPT"

50 / 1,361 papers shown

Understanding Multi-View Transformers

Understanding Multi-View Transformers

Vincent Sitzmann

94

1

0

28 Oct 2025

The Kinetics of Reasoning: How Chain-of-Thought Shapes Learning in Transformers?

The Kinetics of Reasoning: How Chain-of-Thought Shapes Learning in Transformers?

Costas Mavromatis

Huzefa Rangwala

98

0

0

28 Oct 2025

Sequences of Logits Reveal the Low Rank Structure of Language Models

Sequences of Logits Reveal the Low Rank Structure of Language Models

Abhishek Shetty

87

2

0

28 Oct 2025

Language Model Behavioral Phases are Consistent Across Architecture, Training Data, and Scale

Language Model Behavioral Phases are Consistent Across Architecture, Training Data, and Scale

Benjamin Bergen

131

0

0

28 Oct 2025

PAHQ: Accelerating Automated Circuit Discovery through Mixed-Precision Inference Optimization

PAHQ: Accelerating Automated Circuit Discovery through Mixed-Precision Inference Optimization

194

2

0

27 Oct 2025

Edit Less, Achieve More: Dynamic Sparse Neuron Masking for Lifelong Knowledge Editing in LLMs

Edit Less, Achieve More: Dynamic Sparse Neuron Masking for Lifelong Knowledge Editing in LLMs

358

1

0

25 Oct 2025

Probing Neural Combinatorial Optimization Models

Probing Neural Combinatorial Optimization Models

Hoong Chuin Lau

107

0

0

25 Oct 2025

Dynamic Retriever for In-Context Knowledge Editing via Policy Optimization

Dynamic Retriever for In-Context Knowledge Editing via Policy Optimization

Mahmud Wasif Nafee

185

3

0

24 Oct 2025

Head Pursuit: Probing Attention Specialization in Multimodal Transformers

Head Pursuit: Probing Attention Specialization in Multimodal Transformers

Valentino Maiorca

Francesco Locatello

Alberto Cazzaniga

124

2

0

24 Oct 2025

Compress to Impress: Efficient LLM Adaptation Using a Single Gradient Step on 100 Samples

Compress to Impress: Efficient LLM Adaptation Using a Single Gradient Step on 100 Samples

Pratyusha Sharma

118

0

0

23 Oct 2025

The Impact of Negated Text on Hallucination with Large Language Models

The Impact of Negated Text on Hallucination with Large Language Models

144

0

0

23 Oct 2025

Stream: Scaling up Mechanistic Interpretability to Long Context in LLMs via Sparse Attention

Stream: Scaling up Mechanistic Interpretability to Long Context in LLMs via Sparse Attention

José Luis Redondo García

Konstantina Palla

Hugues Bouchard

96

0

0

22 Oct 2025

ToMMeR -- Efficient Entity Mention Detection from Large Language Models

ToMMeR -- Efficient Entity Mention Detection from Large Language Models

Benjamin Piwowarski

188

0

0

22 Oct 2025

Restoring Pruned Large Language Models via Lost Component Compensation

Restoring Pruned Large Language Models via Lost Component Compensation

Jia Jim Deryl Chua

141

0

0

22 Oct 2025

When Do Transformers Learn Heuristics for Graph Connectivity?

When Do Transformers Learn Heuristics for Graph Connectivity?

159

0

0

22 Oct 2025

How Do LLMs Use Their Depth?

How Do LLMs Use Their Depth?

Gopala Anumanchipalli

81

0

0

21 Oct 2025

That's Deprecated! Understanding, Detecting, and Steering Knowledge Conflicts in Language Models for Code Generation

That's Deprecated! Understanding, Detecting, and Steering Knowledge Conflicts in Language Models for Code Generation

Cameron Churchwell

Mitchell Hermon

Yekaterina Yegorova

Mark Hasegawa-Johnson

133

0

0

21 Oct 2025

DePass: Unified Feature Attributing by Simple Decomposed Forward Pass

DePass: Unified Feature Attributing by Simple Decomposed Forward Pass

162

0

0

21 Oct 2025

How role-play shapes relevance judgment in zero-shot LLM rankers

How role-play shapes relevance judgment in zero-shot LLM rankers

Panagiotis Eustratiadis

90

0

0

20 Oct 2025

Atomic Literary Styling: Mechanistic Manipulation of Prose Generation in Neural Language Models

Atomic Literary Styling: Mechanistic Manipulation of Prose Generation in Neural Language Models

Tsogt-Ochir Enkhbayar

135

0

0

19 Oct 2025

SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models

SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models

...

Sung-Feng Huang

Chao-Han Huck Yang

184

0

0

19 Oct 2025

EditMark: Watermarking Large Language Models based on Model Editing

EditMark: Watermarking Large Language Models based on Model Editing

234

0

0

18 Oct 2025

Facts in Stats: Impacts of Pretraining Diversity on Language Model Generalization

Facts in Stats: Impacts of Pretraining Diversity on Language Model Generalization

Christos Thrampoulidis

109

0

0

17 Oct 2025

Rethinking Cross-lingual Gaps from a Statistical Viewpoint

Rethinking Cross-lingual Gaps from a Statistical Viewpoint

Partha Talukdar

112

0

0

17 Oct 2025

Emergence of Linear Truth Encodings in Language Models

Emergence of Linear Truth Encodings in Language Models

Shauli Ravfogel

144

2

0

17 Oct 2025

Flip-Flop Consistency: Unsupervised Training for Robustness to Prompt Perturbations in LLMs

Flip-Flop Consistency: Unsupervised Training for Robustness to Prompt Perturbations in LLMs

Alireza S. Ziabari

Morteza Dehghani

137

0

0

16 Oct 2025

Measuring the Effect of Disfluency in Multilingual Knowledge Probing Benchmarks

Measuring the Effect of Disfluency in Multilingual Knowledge Probing Benchmarks

92

0

0

16 Oct 2025

Visual Interestingness Decoded: How GPT-4o Mirrors Human Interests

Visual Interestingness Decoded: How GPT-4o Mirrors Human Interests

Fitim Abdullahu

85

0

0

15 Oct 2025

The Mechanistic Emergence of Symbol Grounding in Language Models

The Mechanistic Emergence of Symbol Grounding in Language Models

Josue Torres-Fonseca

184

2

0

15 Oct 2025

MedREK: Retrieval-Based Editing for Medical LLMs with Key-Aware Prompts

MedREK: Retrieval-Based Editing for Medical LLMs with Key-Aware Prompts

...

461

0

0

15 Oct 2025

DSCD: Large Language Model Detoxification with Self-Constrained Decoding

DSCD: Large Language Model Detoxification with Self-Constrained Decoding

107

1

0

15 Oct 2025

Position: Require Frontier AI Labs To Release Small "Analog" Models

Position: Require Frontier AI Labs To Release Small "Analog" Models

Shriyash Upadhyay

Chaithanya Bandi

72

0

0

15 Oct 2025

Analysing Moral Bias in Finetuned LLMs through Mechanistic Interpretability

Analysing Moral Bias in Finetuned LLMs through Mechanistic Interpretability

Bianca Raimondi

Daniela Dalbagno

Maurizio Gabbrielli

100

0

0

14 Oct 2025

Exploring and Leveraging Class Vectors for Classifier Editing

Exploring and Leveraging Class Vectors for Classifier Editing

193

0

0

13 Oct 2025

CoSPED: Consistent Soft Prompt Targeted Data Extraction and Defense

CoSPED: Consistent Soft Prompt Targeted Data Extraction and Defense

250

0

0

13 Oct 2025

The Curious Case of Factual (Mis)Alignment between LLMs' Short- and Long-Form Answers

The Curious Case of Factual (Mis)Alignment between LLMs' Short- and Long-Form Answers

Saad Obaid ul Islam

230

0

0

13 Oct 2025

Medical Interpretability and Knowledge Maps of Large Language Models

Medical Interpretability and Knowledge Maps of Large Language Models

Razvan Marinescu

Victoria-Elisabeth Gruber

239

0

0

13 Oct 2025

Tracing the Traces: Latent Temporal Signals for Efficient and Accurate Reasoning

Tracing the Traces: Latent Temporal Signals for Efficient and Accurate Reasoning

Martina G. Vilas

Safoora Yousefi

Vidhisha Balachandran

115

1

0

12 Oct 2025

STEAM: A Semantic-Level Knowledge Editing Framework for Large Language Models

STEAM: A Semantic-Level Knowledge Editing Framework for Large Language Models

Geunyeong Jeong

155

0

0

12 Oct 2025

PIXEL: Adaptive Steering Via Position-wise Injection with eXact Estimated Levels under Subspace Calibration

PIXEL: Adaptive Steering Via Position-wise Injection with eXact Estimated Levels under Subspace Calibration

301

4

0

11 Oct 2025

EvoEdit: Evolving Null-space Alignment for Robust and Efficient Knowledge Editing

EvoEdit: Evolving Null-space Alignment for Robust and Efficient Knowledge Editing

81

0

0

11 Oct 2025

The Achilles' Heel of LLMs: How Altering a Handful of Neurons Can Cripple Language Abilities

The Achilles' Heel of LLMs: How Altering a Handful of Neurons Can Cripple Language Abilities

129

1

0

11 Oct 2025

Large Language Models Do NOT Really Know What They Don't Know

Large Language Models Do NOT Really Know What They Don't Know

158

0

0

10 Oct 2025

On the Representations of Entities in Auto-regressive Large Language Models

On the Representations of Entities in Auto-regressive Large Language Models

Benjamin Piwowarski

121

0

0

10 Oct 2025

Closing the Data-Efficiency Gap Between Autoregressive and Masked Diffusion LLMs

Closing the Data-Efficiency Gap Between Autoregressive and Masked Diffusion LLMs

191

1

0

10 Oct 2025

Transmuting prompts into weights

Transmuting prompts into weights

Javier Gonzalvo

167

0

0

09 Oct 2025

ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall

ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall

78

0

0

09 Oct 2025

SIMU: Selective Influence Machine Unlearning

SIMU: Selective Influence Machine Unlearning

Dilek Hakkani-Tur

118

0

0

09 Oct 2025

Evaluation of a Robust Control System in Real-World Cable-Driven Parallel Robots

Evaluation of a Robust Control System in Real-World Cable-Driven Parallel Robots

Damir Nurtdinov

Aliaksei Korshuk

Alexander Maloletov

84

0

0

09 Oct 2025

How to Teach Large Multimodal Models New Skills

How to Teach Large Multimodal Models New Skills

173

0

0

09 Oct 2025

1 2 3 4 5...26 27 28

Page 2 of 28

Pageof 28