Papers citing 'Position: Understanding LLMs Requires More Than Statistical Generalization'

Title
Out-of-distribution Tests Reveal Compositionality in Chess Transformers Anna Mészáros Patrik Reizinger Ferenc Huszár CoGe 92 0 0 23 Oct 2025
When Does Closeness in Distribution Imply Representational Similarity? An Identifiability Perspective Beatrix M. G. Nielsen Emanuele Marconato Andrea Dittadi Luigi Gresele 195 2 0 04 Jun 2025
Position: An Empirically Grounded Identifiability Theory Will Accelerate Self-Supervised Learning Research Patrik Reizinger Randall Balestriero David Klindt Wieland Brendel 531 2 0 17 Apr 2025
Out-of-distribution generalization via composition: a lens through induction heads in TransformersProceedings of the National Academy of Sciences of the United States of America (PNAS), 2024 Jiajun Song Zhuoyan Xu Yiqiao Zhong 276 19 0 31 Dec 2024
All or None: Identifiable Linear Properties of Next-token Predictors in Language ModelingInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024 Emanuele Marconato Sébastien Lachapelle Sebastian Weichwald Luigi Gresele 308 6 0 30 Oct 2024
Slaves to the Law of Large Numbers: An Asymptotic Equipartition Property for Perplexity in Generative Language Models Tyler Bell Avinash Mudireddy R. Mudumbai Soura Dasgupta R. Mudumbai 207 3 0 22 May 2024