Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.13151
Cited By
MIB: A Mechanistic Interpretability Benchmark
17 April 2025
Aaron Mueller
Atticus Geiger
Sarah Wiegreffe
Dana Arad
Iván Arcuschin
Adam Belfki
Yik Siu Chan
Jaden Fiotto-Kaufman
Tal Haklay
Michael Hanna
Jing Huang
Rohan Gupta
Yaniv Nikankin
Hadas Orgad
Nikhil Prakash
Anja Reusch
Aruna Sankaranarayanan
Shun Shao
Alessandro Stolfo
Martin Tutek
Amir Zur
David Bau
Yonatan Belinkov
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MIB: A Mechanistic Interpretability Benchmark"
1 / 1 papers shown
Title
Bigram Subnetworks: Mapping to Next Tokens in Transformer Language Models
Tyler A. Chang
Benjamin Bergen
38
0
0
21 Apr 2025
1