Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2202.05262
Cited By
v1
v2
v3
v4
v5 (latest)
Locating and Editing Factual Associations in GPT
Neural Information Processing Systems (NeurIPS), 2022
10 February 2022
Kevin Meng
David Bau
A. Andonian
Yonatan Belinkov
KELM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"Locating and Editing Factual Associations in GPT"
50 / 1,361 papers shown
Healing Powers of BERT: How Task-Specific Fine-Tuning Recovers Corrupted Language Models
Shijie Han
Zhenyu Zhang
Andrei Arsene Simion
171
2
0
20 Jun 2024
Locating and Extracting Relational Concepts in Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Zijian Wang
Britney White
Chang Xu
KELM
211
1
0
19 Jun 2024
From RAGs to rich parameters: Probing how language models utilize external knowledge over parametric information for factual queries
Hitesh Wadhwa
Rahul Seetharaman
Somyaa Aggarwal
Reshmi Ghosh
Samyadeep Basu
Soundararajan Srinivasan
Wenlong Zhao
Shreyas Chaudhari
Ehsan Aghazadeh
RALM
157
15
0
18 Jun 2024
Hopping Too Late: Exploring the Limitations of Large Language Models on Multi-Hop Queries
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Eden Biran
Daniela Gottesman
Sohee Yang
Mor Geva
Amir Globerson
LRM
256
69
0
18 Jun 2024
Estimating Knowledge in Large Language Models Without Generating a Single Token
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Daniela Gottesman
Mor Geva
269
28
0
18 Jun 2024
From Insights to Actions: The Impact of Interpretability and Analysis Research on NLP
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Marius Mosbach
Vagrant Gautam
Tomás Vergara-Browne
Dietrich Klakow
Mor Geva
AI4CE
245
17
0
18 Jun 2024
Adaptive Token Biaser: Knowledge Editing via Biasing Key Entities
Baolong Bi
Shenghua Liu
Yiwei Wang
Lingrui Mei
Hongcheng Gao
Yilong Xu
Xueqi Cheng
KELM
190
15
0
18 Jun 2024
Retrieval Meets Reasoning: Dynamic In-Context Editing for Long-Text Understanding
Weizhi Fei
Xueyan Niu
Guoqing Xie
Yanhua Zhang
Bo Bai
Lei Deng
Wei Han
LRM
KELM
RALM
246
11
0
18 Jun 2024
An Investigation of Neuron Activation as a Unified Lens to Explain Chain-of-Thought Eliciting Arithmetic Reasoning of LLMs
Daking Rai
Ziyu Yao
LRM
284
20
0
18 Jun 2024
SafeInfer: Context Adaptive Decoding Time Safety Alignment for Large Language Models
Somnath Banerjee
Soham Tripathy
Sayan Layek
Shanu Kumar
Animesh Mukherjee
Rima Hazra
211
13
0
18 Jun 2024
A Hopfieldian View-based Interpretation for Chain-of-Thought Reasoning
Lijie Hu
Liang Liu
Shu Yang
Xin Chen
Songning Lai
Mengdi Li
Pan Zhou
Muhammad Asif Ali
Di Wang
LRM
317
12
0
18 Jun 2024
Opt-Out: Investigating Entity-Level Unlearning for Large Language Models via Optimal Transport
Minseok Choi
Daniel Rim
Dohyun Lee
Jaegul Choo
MU
KELM
264
1
0
18 Jun 2024
InternalInspector
I
2
I^2
I
2
: Robust Confidence Estimation in LLMs through Internal States
Mohammad Beigi
Ying Shen
Runing Yang
Zihao Lin
Qifan Wang
Ankith Mohan
Jianfeng He
Ming Jin
Chang-Tien Lu
Lifu Huang
HILM
253
20
0
17 Jun 2024
Soft Prompting for Unlearning in Large Language Models
Karuna Bhaila
Minh-Hao Van
Xintao Wu
MU
KELM
276
22
0
17 Jun 2024
Language Modeling with Editable External Knowledge
Belinda Z. Li
Emmy Liu
Alexis Ross
Abbas Zeitoun
Graham Neubig
Jacob Andreas
KELM
264
8
0
17 Jun 2024
Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations
Rima Hazra
Sayan Layek
Somnath Banerjee
Soujanya Poria
KELM
LLMSV
267
23
0
17 Jun 2024
MEMLA: Enhancing Multilingual Knowledge Editing with Neuron-Masked Low-Rank Adaptation
Jiakuan Xie
Pengfei Cao
Yuheng Chen
Yubo Chen
Kang Liu
Jun Zhao
KELM
256
5
0
17 Jun 2024
CrAM: Credibility-Aware Attention Modification in LLMs for Combating Misinformation in RAG
Boyi Deng
Wenjie Wang
Fengbin Zhu
Qifan Wang
Fuli Feng
246
19
0
17 Jun 2024
A Complete Survey on LLM-based AI Chatbots
Sumit Kumar Dam
Choong Seon Hong
Yu Qiao
Chaoning Zhang
288
131
0
17 Jun 2024
Self-training Large Language Models through Knowledge Detection
Wei Jie Yeo
Teddy Ferdinan
Przemyslaw Kazienko
Frank Xing
Erik Cambria
241
14
0
17 Jun 2024
The Fall of ROME: Understanding the Collapse of LLMs in Model Editing
Wanli Yang
Fei Sun
Jiajun Tan
Xinyu Ma
Du Su
D. Yin
Huawei Shen
KELM
133
0
0
17 Jun 2024
SUGARCREPE++ Dataset: Vision-Language Model Sensitivity to Semantic and Lexical Alterations
Sri Harsha Dumpala
Aman Jaiswal
Chandramouli Shama Sastry
E. Milios
Sageev Oore
Hassan Sajjad
CoGe
407
25
0
17 Jun 2024
Intrinsic Test of Unlearning Using Parametric Knowledge Traces
Yihuai Hong
Lei Yu
Haiqin Yang
Shauli Ravfogel
Mor Geva
KELM
MU
350
26
0
17 Jun 2024
In-Context Editing: Learning Knowledge from Self-Induced Distributions
Siyuan Qi
Bangcheng Yang
Kailin Jiang
Xiaobo Wang
Jiaqi Li
Yifan Zhong
Yaodong Yang
Zilong Zheng
KELM
575
15
0
17 Jun 2024
Breaking Boundaries: Investigating the Effects of Model Editing on Cross-linguistic Performance
Somnath Banerjee
Avik Halder
Rajarshi Mandal
Sayan Layek
Ian Soboroff
Rima Hazra
Animesh Mukherjee
575
2
0
17 Jun 2024
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models
Zhuoran Jin
Pengfei Cao
Chenhao Wang
Zhitao He
Hongbang Yuan
Jiachun Li
Yubo Chen
Kang Liu
Jun Zhao
KELM
MU
339
51
0
16 Jun 2024
Teaching Large Language Models to Express Knowledge Boundary from Their Own Signals
Lida Chen
Zujie Liang
Xintao Wang
Jiaqing Liang
Yanghua Xiao
Feng Wei
Jinglei Chen
Zhenghong Hao
Bing Han
Wei Wang
212
26
0
16 Jun 2024
RoseLoRA: Row and Column-wise Sparse Low-rank Adaptation of Pre-trained Language Model for Knowledge Editing and Fine-tuning
Haoyu Wang
Tianci Liu
Ruirui Li
Monica Cheng
Tuo Zhao
Jing Gao
282
18
0
16 Jun 2024
DIEKAE: Difference Injection for Efficient Knowledge Augmentation and Editing of Large Language Models
Alessio Galatolo
Meriem Beloucif
Katie Winkle
163
0
0
15 Jun 2024
Knowledge Editing in Language Models via Adapted Direct Preference Optimization
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Amit Rozner
Barak Battash
Lior Wolf
Ofir Lindenbaum
KELM
188
13
0
14 Jun 2024
REVS: Unlearning Sensitive Information in Language Models via Rank Editing in the Vocabulary Space
Tomer Ashuach
Martin Tutek
Yonatan Belinkov
MU
KELM
666
12
0
13 Jun 2024
Research Trends for the Interplay between Large Language Models and Knowledge Graphs
H. Khorashadizadeh
Fatima Zahra Amara
Morteza Ezzabady
Frédéric Ieng
Sanju Tiwari
Nandana Mihindukulasooriya
Jinghua Groppe
S. Sahri
Farah Benamara
Sven Groppe
409
19
0
12 Jun 2024
Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL
Zijin Hong
Zheng Yuan
Qinggang Zhang
Hao Chen
Hao-Heng Chen
Feiran Huang
Xiao Huang
853
144
0
12 Jun 2024
Towards Lifelong Learning of Large Language Models: A Survey
Junhao Zheng
Shengjie Qiu
Chengming Shi
Qianli Ma
KELM
CLL
286
66
0
10 Jun 2024
The Curse of Popularity: Popular Entities have Catastrophic Side Effects when Deleting Knowledge from Language Models
Ryosuke Takahashi
Go Kamoda
Benjamin Heinzerling
Keisuke Sakaguchi
Kentaro Inui
MU
KELM
181
1
0
10 Jun 2024
MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Jitai Hao
Weiwei Sun
Xin Xin
Qi Meng
Zhumin Chen
Sudipta Singha Roy
Zhaochun Ren
MoE
190
9
0
07 Jun 2024
Time Sensitive Knowledge Editing through Efficient Finetuning
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Xiou Ge
Ali Mousavi
Edouard Grave
Armand Joulin
Kun Qian
Benjamin Han
Mostafa Arefiyan
Yunyao Li
KELM
346
11
0
06 Jun 2024
Improving Alignment and Robustness with Circuit Breakers
Neural Information Processing Systems (NeurIPS), 2024
Andy Zou
Long Phan
Justin Wang
Derek Duenas
Maxwell Lin
Maksym Andriushchenko
Rowan Wang
Zico Kolter
Matt Fredrikson
Dan Hendrycks
AAML
624
210
0
06 Jun 2024
Understanding Information Storage and Transfer in Multi-modal Large Language Models
Neural Information Processing Systems (NeurIPS), 2024
Samyadeep Basu
Martin Grayson
C. Morrison
Besmira Nushi
Soheil Feizi
Daniela Massiceti
299
31
0
06 Jun 2024
Memorization in deep learning: A survey
Jiaheng Wei
Yanjun Zhang
Leo Yu Zhang
Ming Ding
Chao Chen
Kok-Leong Ong
Jun Zhang
Yang Xiang
303
18
0
06 Jun 2024
Interpreting the Second-Order Effects of Neurons in CLIP
Yossi Gandelsman
Alexei A. Efros
Jacob Steinhardt
MILM
450
32
0
06 Jun 2024
Outdated Issue Aware Decoding for Reasoning Questions on Edited Knowledge
Zengkui Sun
Yijin Liu
Jiaan Wang
Fandong Meng
Jinan Xu
Jinan Xu
Jie Zhou
KELM
209
0
0
05 Jun 2024
Exact Conversion of In-Context Learning to Model Weights in Linearized-Attention Transformers
Brian K Chen
Tianyang Hu
Hui Jin
Hwee Kuan Lee
Kenji Kawaguchi
229
5
0
05 Jun 2024
Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller
Min Cai
Yuchen Zhang
Shichang Zhang
Fan Yin
Difan Zou
Yisong Yue
Ziniu Hu
319
3
0
04 Jun 2024
LoFiT: Localized Fine-tuning on LLM Representations
Fangcong Yin
Xi Ye
Greg Durrett
268
41
0
03 Jun 2024
Decoupled Alignment for Robust Plug-and-Play Adaptation
Haozheng Luo
Jiahao Yu
Wenxin Zhang
Jialong Li
Jerry Yao-Chieh Hu
Xingyu Xing
Han Liu
390
11
0
03 Jun 2024
Understanding Token Probability Encoding in Output Embeddings
Hakaze Cho
Yoshihiro Sakai
Kenshiro Tanaka
Mariko Kato
Naoya Inoue
296
3
0
03 Jun 2024
Editing the Mind of Giants: An In-Depth Exploration of Pitfalls of Knowledge Editing in Large Language Models
Cheng-Hsun Hsueh
Paul Kuo-Ming Huang
Tzu-Han Lin
Che-Wei Liao
Hung-Chieh Fang
Chao-Wei Huang
Yun-Nung Chen
KELM
230
9
0
03 Jun 2024
From Feature Visualization to Visual Circuits: Effect of Adversarial Model Manipulation
Géraldin Nanfack
Michael Eickenberg
Eugene Belilovsky
FAtt
AAML
GNN
311
1
0
03 Jun 2024
Position: An Inner Interpretability Framework for AI Inspired by Lessons from Cognitive Neuroscience
Martina G. Vilas
Federico Adolfi
David Poeppel
Gemma Roig
312
10
0
03 Jun 2024
Previous
1
2
3
...
17
18
19
...
26
27
28
Next
Page 18 of 28
Page
of 28
Go