Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.08590
Cited By
A Mechanistic Interpretation of Syllogistic Reasoning in Auto-Regressive Language Models
16 August 2024
Geonhee Kim
Marco Valentino
André Freitas
LRM
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Mechanistic Interpretation of Syllogistic Reasoning in Auto-Regressive Language Models"
3 / 3 papers shown
Title
Quantifying Logical Consistency in Transformers via Query-Key Alignment
Eduard Tulchinskii
Anastasia Voznyuk
Laida Kushnareva
Andrei Andriiainen
Irina Piontkovskaya
Evgeny Burnaev
Serguei Barannikov
LRM
61
0
0
24 Feb 2025
Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 small
Kevin Wang
Alexandre Variengien
Arthur Conmy
Buck Shlegeris
Jacob Steinhardt
210
486
0
01 Nov 2022
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
220
4,424
0
23 Jan 2020
1