Embers of Autoregression: Understanding Large Language Models Through the Problem They are Trained to Solve

24 September 2023

Papers citing "Embers of Autoregression: Understanding Large Language Models Through the Problem They are Trained to Solve"

28 / 28 papers shown

Title
Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback Dongwei Jiang Alvin Zhang Andrew Wang Nicholas Andrews Daniel Khashabi LRM 36 0 0 13 Jun 2025
The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity Parshin Shojaee Iman Mirzadeh Keivan Alizadeh Maxwell Horton Samy Bengio Mehrdad Farajtabar LRM 40 9 0 07 Jun 2025
Read Your Own Mind: Reasoning Helps Surface Self-Confidence Signals in LLMs Jakub Podolak Rajeev Verma ReLM LRM 34 0 0 28 May 2025
Large Language Models' Reasoning Stalls: An Investigation into the Capabilities of Frontier Models Lachlan McGinness Peter Baumgartner ReLM LRM ELM 94 1 0 26 May 2025
The Price of Format: Diversity Collapse in LLMs Longfei Yun Chenyang An Zilong Wang Letian Peng Jingbo Shang 51 0 0 25 May 2025
Base Models Beat Aligned Models at Randomness and Creativity Peter West Christopher Potts 476 4 0 30 Apr 2025
Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction Vaishnavh Nagarajan Chen Henry Wu Charles Ding Aditi Raghunathan 126 0 0 21 Apr 2025
Understanding the Logical Capabilities of Large Language Models via Out-of-Context Representation Learning Jonathan Shaki Emanuele La Malfa Michael Wooldridge Sarit Kraus LRM ReLM 160 0 0 13 Mar 2025
Learning richness modulates equality reasoning in neural networks William L. Tong Cengiz Pehlevan 71 0 0 12 Mar 2025
Constructions are Revealed in Word Distributions J. Rozner Leonie Weissweiler Kyle Mahowald Cory Shain 89 1 0 08 Mar 2025
Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models Yukang Yang Declan Campbell Kaixuan Huang Mengdi Wang Jonathan D. Cohen Taylor Webb LRM 195 5 0 27 Feb 2025
Exploring and Controlling Diversity in LLM-Agent Conversation Kuanchao Chu Yi-Pei Chen Hideki Nakayama LLMAG 156 1 0 24 Feb 2025
Reasoning about Affordances: Causal and Compositional Reasoning in LLMs Magnus F. Gjerde Vanessa Cheung David Lagnado ReLM LRM 109 0 0 23 Feb 2025
ExpliCa: Evaluating Explicit Causal Reasoning in Large Language Models Martina Miliani S. Auriemma Alessandro Bondielli Emmanuele Chersoni Lucia Passaro Irene Sucameli Alessandro Lenci LRM ELM 90 0 0 21 Feb 2025
Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise Rose E. Wang Ana T. Ribeiro Carly Robinson Susanna Loeb Dora Demszky 162 17 0 28 Jan 2025
Quantifying artificial intelligence through algorithmic generalization Takuya Ito Murray Campbell L. Horesh Tim Klinger Parikshit Ram ELM 126 0 0 08 Nov 2024
United in Diversity? Contextual Biases in LLM-Based Predictions of the 2024 European Parliament Elections Leah von der Heyde Anna Haensch Alexander Wenz Bolei Ma 153 2 0 29 Aug 2024
Multilevel Interpretability Of Artificial Neural Networks: Leveraging Framework And Methods From Neuroscience Zhonghao He Jascha Achterberg Katie Collins Kevin K. Nejad Danyal Akarca ... Chole Li Kai J. Sandbrink Stephen Casper Anna Ivanova Grace W. Lindsay AI4CE 110 2 0 22 Aug 2024
Dialogue Ontology Relation Extraction via Constrained Chain-of-Thought Decoding Renato Vukovic David Arps Carel van Niekerk Benjamin Matthias Ruppik Hsien-chin Lin Michael Heck Milica Gašić 107 1 0 05 Aug 2024
The Remarkable Robustness of LLMs: Stages of Inference? Vedang Lad Wes Gurnee Max Tegmark Max Tegmark 138 53 0 27 Jun 2024
What Did I Do Wrong? Quantifying LLMs' Sensitivity and Consistency to Prompt Engineering Federico Errica G. Siracusano D. Sanvito Roberto Bifulco 164 26 0 18 Jun 2024
Does learning the right latent variables necessarily improve in-context learning? Sarthak Mittal Eric Elmoznino Léo Gagnon Sangnie Bhardwaj Tom Marty Dhanya Sridhar Guillaume Lajoie 96 7 0 29 May 2024
Evaluating Text-to-Speech Synthesis from a Large Discrete Token-based Speech Language Model Siyang Wang Éva Székely 107 6 0 16 May 2024
Tokenization counts: the impact of tokenization on arithmetic in frontier LLMs Aaditya K. Singh DJ Strouse 122 61 0 22 Feb 2024
Properties and Challenges of LLM-Generated Explanations Jenny Kunz Marco Kuhlmann 97 24 0 16 Feb 2024
Incoherent Probability Judgments in Large Language Models Jian-Qiao Zhu Thomas Griffiths 171 8 0 30 Jan 2024
Physics simulation capabilities of LLMs M. Ali-Dib Kristen Menou ELM AI4CE 48 0 0 04 Dec 2023
Large language models predict human sensory judgments across six modalities Raja Marjieh Ilia Sucholutsky Pol van Rijn Nori Jacoby Thomas Griffiths VLM 98 44 0 02 Feb 2023