v1v2v3v4v5 (latest)

Locating and Editing Factual Associations in GPT

Neural Information Processing Systems (NeurIPS), 2022

10 February 2022

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)

Papers citing "Locating and Editing Factual Associations in GPT"

13 / 1,363 papers shown

Causal Analysis of Syntactic Agreement Neurons in Multilingual Language ModelsConference on Computational Natural Language Learning (CoNLL), 2022

246

25 Oct 2022

A Causal Framework to Quantify the Robustness of Mathematical Reasoning with Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

391

21 Oct 2022

Revision Transformers: Instructing Language Models to Change their ValuesEuropean Conference on Artificial Intelligence (ECAI), 2022

268

19 Oct 2022

Language Generation Models Can Cause Harm: So What Can We Do About It? An Actionable SurveyConference of the European Chapter of the Association for Computational Linguistics (EACL), 2022

Sachin Kumar

Vidhisha Balachandran

Lucille Njoo

Antonios Anastasopoulos

Yulia Tsvetkov

ELM

452

106

14 Oct 2022

Mass-Editing Memory in a TransformerInternational Conference on Learning Representations (ICLR), 2022

437

809

13 Oct 2022

Improving Data-Efficient Fossil Segmentation via Model Editing

198

08 Oct 2022

Learning by Distilling Context

Charles Burton Snell

Dan Klein

Ruiqi Zhong

ReLM LRM

615

30 Sep 2022

Extremely Simple Activation Shaping for Out-of-Distribution DetectionInternational Conference on Learning Representations (ICLR), 2022

Andrija Djurisic

452

206

20 Sep 2022

The Alignment Problem from a Deep Learning PerspectiveInternational Conference on Learning Representations (ICLR), 2022

Richard Ngo

Lawrence Chan

Sören Mindermann

546

250

30 Aug 2022

Net2Brain: A Toolbox to compare artificial vision models with human brain responses

Domenic Bersch

Kshitij Dwivedi

Martina G. Vilas

Radoslaw Martin Cichy

Gemma Roig

253

20 Aug 2022

Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks

Tilman Raukur

A. Ho

Stephen Casper

Dylan Hadfield-Menell

AAML AI4CE

782

170

27 Jul 2022

Memory-Based Model Editing at ScaleInternational Conference on Machine Learning (ICML), 2022

E. Mitchell

Charles Lin

Antoine Bosselut

Christopher D. Manning

Chelsea Finn

KELM

372

465

13 Jun 2022

Inducing Causal Structure for Interpretable Neural Networks

384

01 Dec 2021