ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.05262
  4. Cited By
Locating and Editing Factual Associations in GPT
v1v2v3v4v5 (latest)

Locating and Editing Factual Associations in GPT

Neural Information Processing Systems (NeurIPS), 2022
10 February 2022
Kevin Meng
David Bau
A. Andonian
Yonatan Belinkov
    KELM
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "Locating and Editing Factual Associations in GPT"

50 / 1,361 papers shown
Reproducing and Extending Causal Insights Into Term Frequency Computation in Neural Rankers
Reproducing and Extending Causal Insights Into Term Frequency Computation in Neural Rankers
Cile van Marken
Roxana Petcu
CML
169
0
0
08 Oct 2025
POME: Post Optimization Model Edit via Muon-style Projection
POME: Post Optimization Model Edit via Muon-style Projection
Yong Liu
Di Fu
Yang Luo
Zirui Zhu
Minhao Cheng
Cho-Jui Hsieh
Yang You
97
0
0
08 Oct 2025
VideoMiner: Iteratively Grounding Key Frames of Hour-Long Videos via Tree-based Group Relative Policy Optimization
VideoMiner: Iteratively Grounding Key Frames of Hour-Long Videos via Tree-based Group Relative Policy Optimization
Xinye Cao
Hongcan Guo
Jiawen Qian
Guoshun Nan
Chao Wang
Yuqi Pan
Tianhao Hou
X. Wang
Yutong Gao
VGen
141
0
0
07 Oct 2025
Distributional Semantics Tracing: A Framework for Explaining Hallucinations in Large Language Models
Distributional Semantics Tracing: A Framework for Explaining Hallucinations in Large Language Models
Gagan Bhatia
Somayajulu G Sripada
Kevin Allan
Jacobo Azcona
HILMLRM
274
1
0
07 Oct 2025
LLM Microscope: What Model Internals Reveal About Answer Correctness and Context Utilization
LLM Microscope: What Model Internals Reveal About Answer Correctness and Context Utilization
Jiarui Liu
Jivitesh Jain
Mona T. Diab
Nishant Subramani
148
0
0
05 Oct 2025
Decoding Emotion in the Deep: A Systematic Study of How LLMs Represent, Retain, and Express Emotion
Decoding Emotion in the Deep: A Systematic Study of How LLMs Represent, Retain, and Express Emotion
Jingxiang Zhang
Lujia Zhong
211
1
0
05 Oct 2025
Mechanistic Interpretability of Socio-Political Frames in Language Models
Mechanistic Interpretability of Socio-Political Frames in Language Models
Hadi Asghari
Sami Nenno
94
0
0
04 Oct 2025
Allocation of Parameters in Transformers
Allocation of Parameters in Transformers
Ruoxi Yu
Haotian Jiang
Jingpu Cheng
Penghao Yu
Qianxiao Li
Zhong Li
MoE
160
0
0
04 Oct 2025
Evaluation Framework for Highlight Explanations of Context Utilisation in Language Models
Evaluation Framework for Highlight Explanations of Context Utilisation in Language Models
Jingyi Sun
Pepa Atanasova
Sagnik Ray Choudhury
Sekh Mainul Islam
Isabelle Augenstein
197
0
0
03 Oct 2025
What Drives Compositional Generalization in Visual Generative Models?
What Drives Compositional Generalization in Visual Generative Models?
Karim Farid
Rajat Sahay
Yumna Ali Alnaggar
Simon Schrodi
Volker Fischer
Cordelia Schmid
Thomas Brox
CoGe
321
0
0
03 Oct 2025
Truth-Aware Decoding: A Program-Logic Approach to Factual Language Generation
Truth-Aware Decoding: A Program-Logic Approach to Factual Language Generation
Faruk Alpay
Hamdi Alakkad
64
0
0
03 Oct 2025
Machine Unlearning Meets Adversarial Robustness via Constrained Interventions on LLMs
Machine Unlearning Meets Adversarial Robustness via Constrained Interventions on LLMs
Fatmazohra Rezkellah
Ramzi Dakhmouche
AAMLMU
212
1
0
03 Oct 2025
REPAIR: Robust Editing via Progressive Adaptive Intervention and Reintegration
REPAIR: Robust Editing via Progressive Adaptive Intervention and Reintegration
Yisu Wang
Ming Wang
Haoyuan Song
Wenjie Huang
Chaozheng Wang
Yi Xie
Xuming Ran
KELMMoMeCLL
125
1
0
02 Oct 2025
Multimodal Function Vectors for Spatial Relations
Multimodal Function Vectors for Spatial Relations
Shuhao Fu
Esther Goldberg
Ying Nian Wu
Hongjing Lu
85
0
0
02 Oct 2025
Diagnosing Bottlenecks in Data Visualization Understanding by Vision-Language Models
Diagnosing Bottlenecks in Data Visualization Understanding by Vision-Language Models
Alexa R. Tartaglini
Satchel Grant
Daniel Wurgaft
Christopher Potts
Judith E. Fan
96
0
0
02 Oct 2025
Unraveling Syntax: How Language Models Learn Context-Free Grammars
Unraveling Syntax: How Language Models Learn Context-Free Grammars
Laura Ying Schulz
Daniel Mitropolsky
Tomaso Poggio
ReLMLRMAI4CE
104
0
0
02 Oct 2025
Nav-EE: Navigation-Guided Early Exiting for Efficient Vision-Language Models in Autonomous Driving
Nav-EE: Navigation-Guided Early Exiting for Efficient Vision-Language Models in Autonomous Driving
Haibo Hu
Lianming Huang
X. Wang
Yufei Cui
Shangyu Wu
Nan Guan
Chun Jason Xue
VLM
207
0
0
02 Oct 2025
Auditing Algorithmic Bias in Transformer-Based Trading
Auditing Algorithmic Bias in Transformer-Based Trading
Armin Gerami
R. Duraiswami
190
0
0
01 Oct 2025
Microsaccade-Inspired Probing: Positional Encoding Perturbations Reveal LLM Misbehaviours
Microsaccade-Inspired Probing: Positional Encoding Perturbations Reveal LLM Misbehaviours
Rui Melo
Rui Abreu
C. Păsăreanu
141
0
0
01 Oct 2025
Energy-Regularized Sequential Model Editing on Hyperspheres
Energy-Regularized Sequential Model Editing on Hyperspheres
Qingyuan Liu
Jia-Chen Gu
Yunzhi Yao
Hong Wang
Nanyun Peng
KELM
220
0
0
01 Oct 2025
Mechanistic Interpretability as Statistical Estimation: A Variance Analysis of EAP-IG
Mechanistic Interpretability as Statistical Estimation: A Variance Analysis of EAP-IG
Maxime Méloux
François Portet
Maxime Peyrard
166
1
0
01 Oct 2025
On Predictability of Reinforcement Learning Dynamics for Large Language Models
On Predictability of Reinforcement Learning Dynamics for Large Language Models
Yuchen Cai
Ding Cao
Xin Xu
Zijun Yao
Yuqing Huang
Zhenyu Tan
Benyi Zhang
Guiquan Liu
Junfeng Fang
136
0
0
01 Oct 2025
Is Model Editing Built on Sand? Revealing Its Illusory Success and Fragile Foundation
Is Model Editing Built on Sand? Revealing Its Illusory Success and Fragile Foundation
Wei Liu
Haomei Xu
Bingqing Liu
Zhiying Deng
H. Wang
Jun Wang
Ruixuan Li
Yee Whye Teh
Wee Sun Lee
KELM
124
0
0
01 Oct 2025
KnowledgeSmith: Uncovering Knowledge Updating in LLMs with Model Editing and Unlearning
KnowledgeSmith: Uncovering Knowledge Updating in LLMs with Model Editing and Unlearning
Yinyi Luo
Z. Zhou
Hao Chen
Kai Qiu
Marios Savvides
Shouqing Yang
James Evans
KELMMU
188
0
0
01 Oct 2025
Interpret, prune and distill Donut : towards lightweight VLMs for VQA on document
Interpret, prune and distill Donut : towards lightweight VLMs for VQA on document
Adnan Ben Mansour
Ayoub Karine
D. Naccache
130
0
0
30 Sep 2025
Muon Outperforms Adam in Tail-End Associative Memory Learning
Muon Outperforms Adam in Tail-End Associative Memory Learning
Shuche Wang
Fengzhuo Zhang
Jiaxiang Li
Cunxiao Du
C. Du
Tianyu Pang
Zhuoran Yang
Mingyi Hong
Vincent Y. F. Tan
132
2
0
30 Sep 2025
Scalable and Robust LLM Unlearning by Correcting Responses with Retrieved Exclusions
Scalable and Robust LLM Unlearning by Correcting Responses with Retrieved Exclusions
Junbeom Kim
Kyuyoung Kim
Jihoon Tack
Dongha Lim
Jinwoo Shin
MUKELM
145
1
0
30 Sep 2025
Pretraining with hierarchical memories: separating long-tail and common knowledge
Pretraining with hierarchical memories: separating long-tail and common knowledge
Hadi Pouransari
David Grangier
C Thomas
Michael Kirchhof
Oncel Tuzel
RALMKELM
240
1
0
29 Sep 2025
Inducing Dyslexia in Vision Language Models
Inducing Dyslexia in Vision Language Models
Melika Honarmand
Ayati Sharma
Badr AlKhamissi
Johannes Mehrer
Martin Schrimpf
AI4Ed
296
0
0
29 Sep 2025
EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering
EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering
Haolei Xu
Xinyu Mei
Yuchen Yan
Rui Zhou
Wenqi Zhang
Weiming Lu
Yueting Zhuang
Yongliang Shen
LLMSV
164
1
0
29 Sep 2025
TDHook: A Lightweight Framework for Interpretability
TDHook: A Lightweight Framework for Interpretability
Yoann Poupart
AI4CE
128
0
0
29 Sep 2025
Stable Forgetting: Bounded Parameter-Efficient Unlearning in LLMs
Stable Forgetting: Bounded Parameter-Efficient Unlearning in LLMs
Arpit Garg
Hemanth Saratchandran
Ravi Garg
Simon Lucey
MUCLL
118
1
0
29 Sep 2025
Reasoning or Retrieval? A Study of Answer Attribution on Large Reasoning Models
Reasoning or Retrieval? A Study of Answer Attribution on Large Reasoning Models
Yuhui Wang
Changjiang Li
Guangke Chen
Jiacheng Liang
Ting Wang
ReLMKELMLRM
141
1
0
29 Sep 2025
Uni-X: Mitigating Modality Conflict with a Two-End-Separated Architecture for Unified Multimodal Models
Uni-X: Mitigating Modality Conflict with a Two-End-Separated Architecture for Unified Multimodal Models
Jitai Hao
Hao Liu
Xinyan Xiao
Qiang Huang
Jun Yu
220
0
0
29 Sep 2025
Knowledge Editing with Subspace-Aware Key-Value Mappings
Knowledge Editing with Subspace-Aware Key-Value Mappings
Haewon Park
Sangwoo Kim
Yohan Jo
KELM
294
0
0
29 Sep 2025
Circuit Distillation
Circuit Distillation
Somin Wadhwa
Silvio Amir
Byron C. Wallace
141
0
0
29 Sep 2025
How Training Data Shapes the Use of Parametric and In-Context Knowledge in Language Models
How Training Data Shapes the Use of Parametric and In-Context Knowledge in Language Models
Minsung Kim
Dong-Kyum Kim
Jea Kwon
Nakyeong Yang
Kyomin Jung
Meeyoung Cha
137
1
0
29 Sep 2025
Skip-It? Theoretical Conditions for Layer Skipping in Vision-Language Models
Skip-It? Theoretical Conditions for Layer Skipping in Vision-Language Models
Max Hartman
Vidhata Arjun Jayaraman
Moulik Choraria
Akhil Bhimaraju
Lav Varshney
VLM
380
0
0
29 Sep 2025
Beyond Benchmarks: Understanding Mixture-of-Experts Models through Internal Mechanisms
Beyond Benchmarks: Understanding Mixture-of-Experts Models through Internal Mechanisms
Jiahao Ying
Mingbao Lin
Qianru Sun
Yixin Cao
MoE
55
0
0
28 Sep 2025
Towards Understanding Subliminal Learning: When and How Hidden Biases Transfer
Towards Understanding Subliminal Learning: When and How Hidden Biases Transfer
Simon Schrodi
Elias Kempf
Fazl Barez
Thomas Brox
FedML
133
0
0
28 Sep 2025
Bridging the Knowledge-Prediction Gap in LLMs on Multiple-Choice Questions
Bridging the Knowledge-Prediction Gap in LLMs on Multiple-Choice Questions
Yoonah Park
Haesung Pyun
Yohan Jo
KELM
368
0
0
28 Sep 2025
Knowledge Homophily in Large Language Models
Knowledge Homophily in Large Language Models
Utkarsh Sahu
Zhisheng Qi
M. Halappanavar
Nedim Lipka
Ryan Rossi
Franck Dernoncourt
Yu Zhang
Yao Ma
Yu Wang
102
0
0
28 Sep 2025
Uncovering Grounding IDs: How External Cues Shape Multimodal Binding
Uncovering Grounding IDs: How External Cues Shape Multimodal Binding
Hosein Hasani
Amirmohammad Izadi
Fatemeh Askari
Mobin Bagherian
Sadegh Mohammadian
Mohammad Izadi
M. Baghshah
323
0
0
28 Sep 2025
Enhancing LLM Steering through Sparse Autoencoder-Based Vector Refinement
Enhancing LLM Steering through Sparse Autoencoder-Based Vector Refinement
Anyi Wang
Xuansheng Wu
Dong Shu
Yunpu Ma
Ninghao Liu
LLMSV
178
0
0
28 Sep 2025
From Reasoning to Answer: Empirical, Attention-Based and Mechanistic Insights into Distilled DeepSeek R1 Models
From Reasoning to Answer: Empirical, Attention-Based and Mechanistic Insights into Distilled DeepSeek R1 Models
Jue Zhang
Qingwei Lin
Saravan Rajmohan
Dongmei Zhang
LRM
108
0
0
28 Sep 2025
Language Model Planning from an Information Theoretic Perspective
Language Model Planning from an Information Theoretic Perspective
Muhammed Ustaomeroglu
Baris Askin
Gauri Joshi
Carlee Joe-Wong
Guannan Qu
133
0
0
28 Sep 2025
Fact Grounded Attention: Eliminating Hallucination in Large Language Models Through Attention Level Knowledge Integration
Fact Grounded Attention: Eliminating Hallucination in Large Language Models Through Attention Level Knowledge Integration
Aayush Gupta
HILM
194
0
0
27 Sep 2025
Steering Prepositional Phrases in Language Models: A Case of with-headed Adjectival and Adverbial Complements in Gemma-2
Steering Prepositional Phrases in Language Models: A Case of with-headed Adjectival and Adverbial Complements in Gemma-2
Stefan Arnold
Rene Gröbner
LLMSV
130
0
0
27 Sep 2025
LLM Interpretability with Identifiable Temporal-Instantaneous Representation
LLM Interpretability with Identifiable Temporal-Instantaneous Representation
Xiangchen Song
Jiaqi Sun
Zijian Li
Yujia Zheng
Kun Zhang
127
0
0
27 Sep 2025
Bilinear relational structure fixes reversal curse and enables consistent model editing
Bilinear relational structure fixes reversal curse and enables consistent model editing
Dong-Kyum Kim
Minsung Kim
Jea Kwon
Nakyeong Yang
Meeyoung Cha
KELM
368
0
0
26 Sep 2025
Previous
123456...262728
Next