ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.05262
  4. Cited By
Locating and Editing Factual Associations in GPT
v1v2v3v4v5 (latest)

Locating and Editing Factual Associations in GPT

Neural Information Processing Systems (NeurIPS), 2022
10 February 2022
Kevin Meng
David Bau
A. Andonian
Yonatan Belinkov
    KELM
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "Locating and Editing Factual Associations in GPT"

50 / 1,361 papers shown
A Survey on Hallucination in Large Language Models: Principles,
  Taxonomy, Challenges, and Open Questions
A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions
Lei Huang
Weijiang Yu
Weitao Ma
Weihong Zhong
Zhangyin Feng
...
Qianglong Chen
Weihua Peng
Xiaocheng Feng
Bing Qin
Ting Liu
LRMHILM
458
1,998
0
09 Nov 2023
Future Lens: Anticipating Subsequent Tokens from a Single Hidden State
Future Lens: Anticipating Subsequent Tokens from a Single Hidden State
Koyena Pal
Jiuding Sun
Andrew Yuan
Byron C. Wallace
David Bau
204
91
0
08 Nov 2023
Massive Editing for Large Language Models via Meta Learning
Massive Editing for Large Language Models via Meta Learning
Chenmien Tan
Ge Zhang
Jie Fu
KELM
284
58
0
08 Nov 2023
Towards Interpretable Sequence Continuation: Analyzing Shared Circuits
  in Large Language Models
Towards Interpretable Sequence Continuation: Analyzing Shared Circuits in Large Language Models
Michael Lan
Phillip H. S. Torr
Fazl Barez
LRM
379
8
0
07 Nov 2023
The Linear Representation Hypothesis and the Geometry of Large Language
  Models
The Linear Representation Hypothesis and the Geometry of Large Language ModelsInternational Conference on Machine Learning (ICML), 2023
Kiho Park
Yo Joong Choe
Victor Veitch
LLMSVMILM
485
335
0
07 Nov 2023
In-Context Exemplars as Clues to Retrieving from Large Associative
  Memory
In-Context Exemplars as Clues to Retrieving from Large Associative Memory
Jiachen Zhao
292
15
0
06 Nov 2023
The Effect of Scaling, Retrieval Augmentation and Form on the Factual
  Consistency of Language Models
The Effect of Scaling, Retrieval Augmentation and Form on the Factual Consistency of Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Lovisa Hagström
Denitsa Saynova
Tobias Norlund
Moa Johansson
Richard Johansson
KELMHILM
175
12
0
02 Nov 2023
Training Dynamics of Contextual N-Grams in Language Models
Training Dynamics of Contextual N-Grams in Language Models
Lucia Quirke
Lovis Heindrich
Wes Gurnee
Neel Nanda
261
6
0
01 Nov 2023
Defining a New NLP Playground
Defining a New NLP PlaygroundConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Sha Li
Chi Han
Pengfei Yu
Carl Edwards
Pengfei Yu
...
Yi R. Fung
Charles Yu
Joel R. Tetreault
Eduard H. Hovy
Heng Ji
386
5
0
31 Oct 2023
DEPN: Detecting and Editing Privacy Neurons in Pretrained Language
  Models
DEPN: Detecting and Editing Privacy Neurons in Pretrained Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Xinwei Wu
Junzhuo Li
Minghui Xu
Weilong Dong
Shuangzhi Wu
Chao Bian
Deyi Xiong
MUKELM
382
83
0
31 Oct 2023
The Expressibility of Polynomial based Attention Scheme
The Expressibility of Polynomial based Attention Scheme
Zhao Song
Guangyi Xu
Junze Yin
326
6
0
30 Oct 2023
A Survey on Knowledge Editing of Neural Networks
A Survey on Knowledge Editing of Neural NetworksIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Vittorio Mazzia
Alessandro Pedrani
Andrea Caciolai
Kay Rottmann
Davide Bernardi
KELM
416
40
0
30 Oct 2023
Debiasing Algorithm through Model Adaptation
Debiasing Algorithm through Model AdaptationInternational Conference on Learning Representations (ICLR), 2023
Tomasz Limisiewicz
David Marecek
Tomáš Musil
476
21
0
29 Oct 2023
Codebook Features: Sparse and Discrete Interpretability for Neural
  Networks
Codebook Features: Sparse and Discrete Interpretability for Neural NetworksInternational Conference on Machine Learning (ICML), 2023
Alex Tamkin
Mohammad Taufeeque
Noah D. Goodman
217
41
0
26 Oct 2023
How do Language Models Bind Entities in Context?
How do Language Models Bind Entities in Context?International Conference on Learning Representations (ICLR), 2023
Jiahai Feng
Jacob Steinhardt
325
64
0
26 Oct 2023
Give Me the Facts! A Survey on Factual Knowledge Probing in Pre-trained
  Language Models
Give Me the Facts! A Survey on Factual Knowledge Probing in Pre-trained Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Paul Youssef
Osman Alperen Koracs
Meijie Li
Jorg Schlotterer
Christin Seifert
KELM
315
28
0
25 Oct 2023
Attention Lens: A Tool for Mechanistically Interpreting the Attention
  Head Information Retrieval Mechanism
Attention Lens: A Tool for Mechanistically Interpreting the Attention Head Information Retrieval Mechanism
Mansi Sakarvadia
Arham Khan
Aswathy Ajith
Daniel Grzenda
Nathaniel Hudson
André Bauer
Kyle Chard
Ian Foster
484
18
0
25 Oct 2023
Knowledge Editing for Large Language Models: A Survey
Knowledge Editing for Large Language Models: A SurveyACM Computing Surveys (ACM Comput. Surv.), 2023
Song Wang
Yaochen Zhu
Haochen Liu
Zaiyi Zheng
Chen Chen
Wenlin Yao
KELM
463
204
0
24 Oct 2023
In-Context Learning Creates Task Vectors
In-Context Learning Creates Task VectorsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Roee Hendel
Mor Geva
Amir Globerson
345
244
0
24 Oct 2023
Characterizing Mechanisms for Factual Recall in Language Models
Characterizing Mechanisms for Factual Recall in Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Qinan Yu
Jack Merullo
Ellie Pavlick
KELM
284
44
0
24 Oct 2023
SoK: Memorization in General-Purpose Large Language Models
SoK: Memorization in General-Purpose Large Language Models
Valentin Hartmann
Anshuman Suri
Vincent Bindschaedler
David Evans
Shruti Tople
Robert West
KELMLLMAG
328
37
0
24 Oct 2023
Unnatural language processing: How do language models handle
  machine-generated prompts?
Unnatural language processing: How do language models handle machine-generated prompts?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Corentin Kervadec
Francesca Franzon
Marco Baroni
251
7
0
24 Oct 2023
Unveiling Multilinguality in Transformer Models: Exploring Language
  Specificity in Feed-Forward Networks
Unveiling Multilinguality in Transformer Models: Exploring Language Specificity in Feed-Forward NetworksBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2023
Sunit Bhattacharya
Ondrej Bojar
171
16
0
24 Oct 2023
KITAB: Evaluating LLMs on Constraint Satisfaction for Information
  Retrieval
KITAB: Evaluating LLMs on Constraint Satisfaction for Information RetrievalInternational Conference on Learning Representations (ICLR), 2023
Marah Abdin
Suriya Gunasekar
Varun Chandrasekaran
Jerry Li
Mert Yuksekgonul
Rahee Peshawaria
Ranjita Naik
Besmira Nushi
197
14
0
24 Oct 2023
Function Vectors in Large Language Models
Function Vectors in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2023
Eric Todd
Millicent Li
Arnab Sen Sharma
Aaron Mueller
Byron C. Wallace
David Bau
332
182
0
23 Oct 2023
Plausibility Processing in Transformer Language Models: Focusing on the
  Role of Attention Heads in GPT
Plausibility Processing in Transformer Language Models: Focusing on the Role of Attention Heads in GPT
Soo Hyun Ryu
171
1
0
20 Oct 2023
Understanding Addition in Transformers
Understanding Addition in Transformers
Abir Harrasse
Fazl Barez
626
30
0
19 Oct 2023
Frozen Transformers in Language Models Are Effective Visual Encoder
  Layers
Frozen Transformers in Language Models Are Effective Visual Encoder Layers
Ziqi Pang
Ziyang Xie
Yunze Man
Yu-Xiong Wang
431
49
0
19 Oct 2023
SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency
  in Both Image Classification and Generation
SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and GenerationInternational Conference on Learning Representations (ICLR), 2023
Chongyu Fan
Jiancheng Liu
Yihua Zhang
Eric Wong
Dennis Wei
Sijia Liu
MU
550
263
0
19 Oct 2023
Getting aligned on representational alignment
Getting aligned on representational alignment
Ilia Sucholutsky
Lukas Muttenthaler
Adrian Weller
Andi Peng
Andreea Bobu
...
Thomas Unterthiner
Andrew Kyle Lampinen
Klaus-Robert Muller
M. Toneva
Thomas Griffiths
333
138
0
18 Oct 2023
Emptying the Ocean with a Spoon: Should We Edit Models?
Emptying the Ocean with a Spoon: Should We Edit Models?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yuval Pinter
Michael Elhadad
KELM
263
29
0
18 Oct 2023
From Neural Activations to Concepts: A Survey on Explaining Concepts in
  Neural Networks
From Neural Activations to Concepts: A Survey on Explaining Concepts in Neural Networks
Jae Hee Lee
Sergio Lanza
Stefan Wermter
238
18
0
18 Oct 2023
Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from
  a Parametric Perspective
Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric PerspectiveInternational Conference on Learning Representations (ICLR), 2023
Ming Zhong
Chenxin An
Weizhu Chen
Jiawei Han
Pengcheng He
367
16
0
17 Oct 2023
How Do Transformers Learn In-Context Beyond Simple Functions? A Case
  Study on Learning with Representations
How Do Transformers Learn In-Context Beyond Simple Functions? A Case Study on Learning with RepresentationsInternational Conference on Learning Representations (ICLR), 2023
Tianyu Guo
Wei Hu
Song Mei
Huan Wang
Caiming Xiong
Silvio Savarese
Yu Bai
264
76
0
16 Oct 2023
Interpreting and Controlling Vision Foundation Models via Text
  Explanations
Interpreting and Controlling Vision Foundation Models via Text Explanations
Haozhe Chen
Junfeng Yang
Carl Vondrick
Chengzhi Mao
206
8
0
16 Oct 2023
Attribution Patching Outperforms Automated Circuit Discovery
Attribution Patching Outperforms Automated Circuit DiscoveryBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2023
Aaquib Syed
Can Rager
Arthur Conmy
369
102
0
16 Oct 2023
Untying the Reversal Curse via Bidirectional Language Model Editing
Untying the Reversal Curse via Bidirectional Language Model Editing
Jun-Yu Ma
Jia-Chen Gu
Zhen-Hua Ling
Quan Liu
Cong Liu
KELM
318
43
0
16 Oct 2023
Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models
Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models
Jirui Qi
Raquel Fernández
Arianna Bisazza
KELMHILM
725
107
0
16 Oct 2023
VLIS: Unimodal Language Models Guide Multimodal Language Generation
VLIS: Unimodal Language Models Guide Multimodal Language Generation
Jiwan Chung
Youngjae Yu
VLM
253
2
0
15 Oct 2023
Measuring Feature Sparsity in Language Models
Measuring Feature Sparsity in Language Models
Mingyang Deng
Lucas Tao
Joe Benton
243
2
0
11 Oct 2023
Survey on Factuality in Large Language Models: Knowledge, Retrieval and
  Domain-Specificity
Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity
Cunxiang Wang
Xiaoze Liu
Yuanhao Yue
Xiangru Tang
Tianhang Zhang
...
Linyi Yang
Yongfeng Zhang
Xing Xie
Zheng Zhang
Yue Zhang
HILMKELM
465
261
0
11 Oct 2023
How Do Large Language Models Capture the Ever-changing World Knowledge?
  A Review of Recent Advances
How Do Large Language Models Capture the Ever-changing World Knowledge? A Review of Recent AdvancesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Zihan Zhang
Meng Fang
Lingxi Chen
Mohammad-Reza Namazi-Rad
Jun Wang
KELM
241
40
0
11 Oct 2023
An Adversarial Example for Direct Logit Attribution: Memory Management
  in gelu-4l
An Adversarial Example for Direct Logit Attribution: Memory Management in gelu-4lBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2023
James Dao
Yeu-Tong Lau
Can Rager
Jett Janiak
384
5
0
11 Oct 2023
The Geometry of Truth: Emergent Linear Structure in Large Language Model
  Representations of True/False Datasets
The Geometry of Truth: Emergent Linear Structure in Large Language Model Representations of True/False Datasets
Samuel Marks
Max Tegmark
HILM
486
360
0
10 Oct 2023
A Meta-Learning Perspective on Transformers for Causal Language Modeling
A Meta-Learning Perspective on Transformers for Causal Language ModelingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Xinbo Wu
Lav Varshney
304
8
0
09 Oct 2023
Factuality Challenges in the Era of Large Language Models
Factuality Challenges in the Era of Large Language Models
Isabelle Augenstein
Timothy Baldwin
Meeyoung Cha
Tanmoy Chakraborty
Giovanni Luca Ciampaglia
...
Rubén Míguez
Preslav Nakov
Dietram A. Scheufele
Shivam Sharma
Giovanni Zagni
HILM
412
54
0
08 Oct 2023
The Cost of Down-Scaling Language Models: Fact Recall Deteriorates
  before In-Context Learning
The Cost of Down-Scaling Language Models: Fact Recall Deteriorates before In-Context Learning
Tian Jin
Nolan Clement
Xin Dong
Vaishnavh Nagarajan
Michael Carbin
Jonathan Ragan-Kelley
Gintare Karolina Dziugaite
LRM
342
5
0
07 Oct 2023
SPADE: Sparsity-Guided Debugging for Deep Neural Networks
SPADE: Sparsity-Guided Debugging for Deep Neural NetworksInternational Conference on Machine Learning (ICML), 2023
Arshia Soltani Moakhar
Eugenia Iofinova
Elias Frantar
Dan Alistarh
332
2
0
06 Oct 2023
DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers
DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers
Anna Langedijk
Hosein Mohebbi
Gabriele Sarti
Willem H. Zuidema
Jaap Jumelet
254
15
0
05 Oct 2023
Discovering Knowledge-Critical Subnetworks in Pretrained Language Models
Discovering Knowledge-Critical Subnetworks in Pretrained Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Deniz Bayazit
Negar Foroutan
Zeming Chen
Gail Weiss
Antoine Bosselut
KELM
261
19
0
04 Oct 2023
Previous
123...232425262728
Next
Page 24 of 28
Pageof 28