Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.13417
Cited By
VISIT: Visualizing and Interpreting the Semantic Information Flow of Transformers
22 May 2023
Shahar Katz
Yonatan Belinkov
Re-assign community
ArXiv
PDF
HTML
Papers citing
"VISIT: Visualizing and Interpreting the Semantic Information Flow of Transformers"
6 / 6 papers shown
Title
Relation Also Knows: Rethinking the Recall and Editing of Factual Associations in Auto-Regressive Transformer Language Models
Xiyu Liu
Zhengxiao Liu
Naibin Gu
Zheng-Shen Lin
Wanli Ma
Ji Xiang
Weiping Wang
KELM
44
0
0
27 Aug 2024
Finding Transformer Circuits with Edge Pruning
Adithya Bhaskar
Alexander Wettig
Dan Friedman
Danqi Chen
53
14
0
24 Jun 2024
Knowledge Circuits in Pretrained Transformers
Yunzhi Yao
Ningyu Zhang
Zekun Xi
Meng Wang
Ziwen Xu
Shumin Deng
Huajun Chen
KELM
64
19
0
28 May 2024
Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 small
Kevin Wang
Alexandre Variengien
Arthur Conmy
Buck Shlegeris
Jacob Steinhardt
210
486
0
01 Nov 2022
Outliers Dimensions that Disrupt Transformers Are Driven by Frequency
Giovanni Puccetti
Anna Rogers
Aleksandr Drozd
F. Dell’Orletta
68
42
0
23 May 2022
All Bark and No Bite: Rogue Dimensions in Transformer Language Models Obscure Representational Quality
William Timkey
Marten van Schijndel
213
110
0
09 Sep 2021
1