Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2202.05262
Cited By
v1
v2
v3
v4
v5 (latest)
Locating and Editing Factual Associations in GPT
Neural Information Processing Systems (NeurIPS), 2022
10 February 2022
Kevin Meng
David Bau
A. Andonian
Yonatan Belinkov
KELM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"Locating and Editing Factual Associations in GPT"
50 / 1,361 papers shown
Understanding Multi-View Transformers
Michal Stary
Julien Gaubil
A. Tewari
Vincent Sitzmann
ViT
87
1
0
28 Oct 2025
The Kinetics of Reasoning: How Chain-of-Thought Shapes Learning in Transformers?
Zihan Pengmei
Costas Mavromatis
Zhengyuan Shen
Yunyi Zhang
V. Ioannidis
Huzefa Rangwala
LRM
95
0
0
28 Oct 2025
Sequences of Logits Reveal the Low Rank Structure of Language Models
Noah Golowich
Allen Liu
Abhishek Shetty
80
2
0
28 Oct 2025
Language Model Behavioral Phases are Consistent Across Architecture, Training Data, and Scale
J. Michaelov
Roger P. Levy
Benjamin Bergen
AI4TS
128
0
0
28 Oct 2025
PAHQ: Accelerating Automated Circuit Discovery through Mixed-Precision Inference Optimization
Xinhai Wang
Shu Yang
Liangyu Wang
L. Zhang
Huanyi Xie
Lijie Hu
Di Wang
188
2
0
27 Oct 2025
Edit Less, Achieve More: Dynamic Sparse Neuron Masking for Lifelong Knowledge Editing in LLMs
Jinzhe Liu
Junshu Sun
Shufan Shen
Chenxue Yang
Shuhui Wang
KELM
CLL
352
1
0
25 Oct 2025
Probing Neural Combinatorial Optimization Models
Zhiqin Zhang
Yining Ma
Zhiguang Cao
Hoong Chuin Lau
104
0
0
25 Oct 2025
Dynamic Retriever for In-Context Knowledge Editing via Policy Optimization
Mahmud Wasif Nafee
Maiqi Jiang
Haipeng Chen
Yanfu Zhang
KELM
166
3
0
24 Oct 2025
Head Pursuit: Probing Attention Specialization in Multimodal Transformers
Lorenzo Basile
Valentino Maiorca
Diego Doimo
Francesco Locatello
Alberto Cazzaniga
118
2
0
24 Oct 2025
Compress to Impress: Efficient LLM Adaptation Using a Single Gradient Step on 100 Samples
Shiva Sreeram
Alaa Maalouf
Pratyusha Sharma
Daniela Rus
112
0
0
23 Oct 2025
The Impact of Negated Text on Hallucination with Large Language Models
Jaehyung Seo
Hyeonseok Moon
Heuiseok Lim
143
0
0
23 Oct 2025
Stream: Scaling up Mechanistic Interpretability to Long Context in LLMs via Sparse Attention
J Rosser
José Luis Redondo García
Gustavo Penha
Konstantina Palla
Hugues Bouchard
90
0
0
22 Oct 2025
ToMMeR -- Efficient Entity Mention Detection from Large Language Models
Victor Morand
Nadi Tomeh
Josiane Mothe
Benjamin Piwowarski
MoE
VLM
182
0
0
22 Oct 2025
Restoring Pruned Large Language Models via Lost Component Compensation
Zijian Feng
Hanzhang Zhou
Zixiao Zhu
Tianjiao Li
Jia Jim Deryl Chua
Lee Onn Mak
Gee Wah Ng
Kezhi Mao
141
0
0
22 Oct 2025
When Do Transformers Learn Heuristics for Graph Connectivity?
Qilin Ye
Deqing Fu
Robin Jia
Vatsal Sharan
156
0
0
22 Oct 2025
How Do LLMs Use Their Depth?
Akshat Gupta
Jay Yeung
Gopala Anumanchipalli
Anna Ivanova
81
0
0
21 Oct 2025
That's Deprecated! Understanding, Detecting, and Steering Knowledge Conflicts in Language Models for Code Generation
Jaesung Bae
Cameron Churchwell
Mitchell Hermon
Tsun-An Hsieh
Jocelyn Xu
Yekaterina Yegorova
Mark Hasegawa-Johnson
Heng Ji
124
0
0
21 Oct 2025
DePass: Unified Feature Attributing by Simple Decomposed Forward Pass
Xiangyu Hong
Che Jiang
Kai Tian
Biqing Qi
Youbang Sun
Ning Ding
Bowen Zhou
154
0
0
21 Oct 2025
How role-play shapes relevance judgment in zero-shot LLM rankers
Yumeng Wang
Jirui Qi
Catherine Chen
Panagiotis Eustratiadis
Suzan Verberne
78
0
0
20 Oct 2025
Atomic Literary Styling: Mechanistic Manipulation of Prose Generation in Neural Language Models
Tsogt-Ochir Enkhbayar
129
0
0
19 Oct 2025
SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models
Chih-Kai Yang
Yen-Ting Piao
Tzu-wen Hsu
Szu-Wei Fu
Zhehuai Chen
...
Sung-Feng Huang
Chao-Han Huck Yang
Y. Wang
Yun-Nung Chen
Hung-yi Lee
KELM
AuLLM
181
0
0
19 Oct 2025
EditMark: Watermarking Large Language Models based on Model Editing
Shuai Li
Kejiang Chen
Jun Jiang
Jie Zhang
Qiyi Yao
K. Zeng
W. Zhang
N. Yu
WaLM
KELM
226
0
0
18 Oct 2025
Facts in Stats: Impacts of Pretraining Diversity on Language Model Generalization
Tina Behnia
Puneesh Deora
Christos Thrampoulidis
105
0
0
17 Oct 2025
Rethinking Cross-lingual Gaps from a Statistical Viewpoint
Vihari Piratla
Purvam Jain
Darshan Singh
Partha Talukdar
Trevor Cohn
112
0
0
17 Oct 2025
Emergence of Linear Truth Encodings in Language Models
Shauli Ravfogel
Gilad Yehudai
Tal Linzen
Joan Bruna
A. Bietti
KELM
141
2
0
17 Oct 2025
Flip-Flop Consistency: Unsupervised Training for Robustness to Prompt Perturbations in LLMs
Parsa Hejabi
Elnaz Rahmati
Alireza S. Ziabari
Morteza Dehghani
AAML
LRM
132
0
0
16 Oct 2025
Measuring the Effect of Disfluency in Multilingual Knowledge Probing Benchmarks
Kirill Semenov
Rico Sennrich
88
0
0
16 Oct 2025
Visual Interestingness Decoded: How GPT-4o Mirrors Human Interests
Fitim Abdullahu
Helmut Grabner
78
0
0
15 Oct 2025
The Mechanistic Emergence of Symbol Grounding in Language Models
Shuyu Wu
Ziqiao Ma
Xiaoxi Luo
Yidong Huang
Josue Torres-Fonseca
Freda Shi
Joyce Chai
LRM
183
2
0
15 Oct 2025
MedREK: Retrieval-Based Editing for Medical LLMs with Key-Aware Prompts
Shujun Xia
Haokun Lin
Yichen Wu
Yinan Zhou
Zixuan Li
...
Yefeng Zheng
Xiang Li
Caifeng Shan
Zhenan Sun
Quanzheng Li
KELM
454
0
0
15 Oct 2025
DSCD: Large Language Model Detoxification with Self-Constrained Decoding
Ming Dong
Jinkui Zhang
Bolong Zheng
Xinhui Tu
Po Hu
Tingting He
101
1
0
15 Oct 2025
Position: Require Frontier AI Labs To Release Small "Analog" Models
Shriyash Upadhyay
Chaithanya Bandi
Narmeen Oozeer
Philip Quirke
63
0
0
15 Oct 2025
Analysing Moral Bias in Finetuned LLMs through Mechanistic Interpretability
Bianca Raimondi
Daniela Dalbagno
Maurizio Gabbrielli
AI4CE
95
0
0
14 Oct 2025
Exploring and Leveraging Class Vectors for Classifier Editing
Jaeik Kim
Jaeyoung Do
VLM
190
0
0
13 Oct 2025
CoSPED: Consistent Soft Prompt Targeted Data Extraction and Defense
Yang Zhuochen
Fok Kar Wai
Thing Vrizlynn
AAML
SILM
250
0
0
13 Oct 2025
The Curious Case of Factual (Mis)Alignment between LLMs' Short- and Long-Form Answers
Saad Obaid ul Islam
Anne Lauscher
Goran Glavaš
HILM
212
0
0
13 Oct 2025
Medical Interpretability and Knowledge Maps of Large Language Models
Razvan Marinescu
Victoria-Elisabeth Gruber
Diego Fajardo
FAtt
AI4MH
238
0
0
13 Oct 2025
Tracing the Traces: Latent Temporal Signals for Efficient and Accurate Reasoning
Martina G. Vilas
Safoora Yousefi
Besmira Nushi
Eric Horvitz
Vidhisha Balachandran
LRM
111
1
0
12 Oct 2025
STEAM: A Semantic-Level Knowledge Editing Framework for Large Language Models
Geunyeong Jeong
Juoh Sun
Seonghee Lee
Harksoo Kim
KELM
148
0
0
12 Oct 2025
PIXEL: Adaptive Steering Via Position-wise Injection with eXact Estimated Levels under Subspace Calibration
Manjiang Yu
Hongji Li
Priyanka Singh
X. Li
Di Wang
Lijie Hu
LLMSV
300
4
0
11 Oct 2025
EvoEdit: Evolving Null-space Alignment for Robust and Efficient Knowledge Editing
Sicheng Lyu
Yu Gu
Xinyu Wang
Jerry Huang
Sitao Luan
Yufei Cui
Xiao-Wen Chang
Peng Lu
KELM
80
0
0
11 Oct 2025
The Achilles' Heel of LLMs: How Altering a Handful of Neurons Can Cripple Language Abilities
Zixuan Qin
Kunlin Lyu
Qingchen Yu
Yifan Sun
Zhaoxin Fan
AAML
124
1
0
11 Oct 2025
Large Language Models Do NOT Really Know What They Don't Know
C. Cheang
Hou Pong Chan
Wenxuan Zhang
Yang Deng
HILM
154
0
0
10 Oct 2025
On the Representations of Entities in Auto-regressive Large Language Models
Victor Morand
Josiane Mothe
Benjamin Piwowarski
120
0
0
10 Oct 2025
Closing the Data-Efficiency Gap Between Autoregressive and Masked Diffusion LLMs
Xu Pan
Ely Hahami
Jingxuan Fan
Ziqian Xie
H. Sompolinsky
166
1
0
10 Oct 2025
Transmuting prompts into weights
Hanna Mazzawi
Benoit Dherin
Michael Munn
Michael Wunder
Javier Gonzalvo
LM&Ro
158
0
0
09 Oct 2025
ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall
Jiayu Yang
Yuxuan Fan
Songning Lai
Shengen Wu
J. Tang
Chun Kang
Zhijiang Guo
Yutao Yue
KELM
73
0
0
09 Oct 2025
SIMU: Selective Influence Machine Unlearning
Anu Agarwal
Mihir Pamnani
Dilek Hakkani-Tur
MU
116
0
0
09 Oct 2025
Evaluation of a Robust Control System in Real-World Cable-Driven Parallel Robots
Damir Nurtdinov
Aliaksei Korshuk
Alexei Kornaev
Alexander Maloletov
73
0
0
09 Oct 2025
How to Teach Large Multimodal Models New Skills
Zhen Zhu
Yiming Gong
Yao Xiao
Yaoyao Liu
Derek Hoiem
MLLM
CLL
KELM
173
0
0
09 Oct 2025
Previous
1
2
3
4
5
...
26
27
28
Next