ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.13638
  4. Cited By
Embers of Autoregression: Understanding Large Language Models Through
  the Problem They are Trained to Solve

Embers of Autoregression: Understanding Large Language Models Through the Problem They are Trained to Solve

24 September 2023
R. Thomas McCoy
Shunyu Yao
Dan Friedman
Matthew Hardy
Thomas Griffiths
ArXiv (abs)PDFHTML

Papers citing "Embers of Autoregression: Understanding Large Language Models Through the Problem They are Trained to Solve"

28 / 28 papers shown
Title
Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback
Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback
Dongwei Jiang
Alvin Zhang
Andrew Wang
Nicholas Andrews
Daniel Khashabi
LRM
36
0
0
13 Jun 2025
The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity
The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity
Parshin Shojaee
Iman Mirzadeh
Keivan Alizadeh
Maxwell Horton
Samy Bengio
Mehrdad Farajtabar
LRM
40
9
0
07 Jun 2025
Read Your Own Mind: Reasoning Helps Surface Self-Confidence Signals in LLMs
Read Your Own Mind: Reasoning Helps Surface Self-Confidence Signals in LLMs
Jakub Podolak
Rajeev Verma
ReLMLRM
34
0
0
28 May 2025
Large Language Models' Reasoning Stalls: An Investigation into the Capabilities of Frontier Models
Large Language Models' Reasoning Stalls: An Investigation into the Capabilities of Frontier Models
Lachlan McGinness
Peter Baumgartner
ReLMLRMELM
94
1
0
26 May 2025
The Price of Format: Diversity Collapse in LLMs
The Price of Format: Diversity Collapse in LLMs
Longfei Yun
Chenyang An
Zilong Wang
Letian Peng
Jingbo Shang
51
0
0
25 May 2025
Base Models Beat Aligned Models at Randomness and Creativity
Base Models Beat Aligned Models at Randomness and Creativity
Peter West
Christopher Potts
476
4
0
30 Apr 2025
Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction
Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction
Vaishnavh Nagarajan
Chen Henry Wu
Charles Ding
Aditi Raghunathan
126
0
0
21 Apr 2025
Understanding the Logical Capabilities of Large Language Models via Out-of-Context Representation Learning
Jonathan Shaki
Emanuele La Malfa
Michael Wooldridge
Sarit Kraus
LRMReLM
160
0
0
13 Mar 2025
Learning richness modulates equality reasoning in neural networks
Learning richness modulates equality reasoning in neural networks
William L. Tong
Cengiz Pehlevan
71
0
0
12 Mar 2025
Constructions are Revealed in Word Distributions
J. Rozner
Leonie Weissweiler
Kyle Mahowald
Cory Shain
89
1
0
08 Mar 2025
Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models
Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models
Yukang Yang
Declan Campbell
Kaixuan Huang
Mengdi Wang
Jonathan D. Cohen
Taylor Webb
LRM
195
5
0
27 Feb 2025
Exploring and Controlling Diversity in LLM-Agent Conversation
Exploring and Controlling Diversity in LLM-Agent Conversation
Kuanchao Chu
Yi-Pei Chen
Hideki Nakayama
LLMAG
156
1
0
24 Feb 2025
Reasoning about Affordances: Causal and Compositional Reasoning in LLMs
Reasoning about Affordances: Causal and Compositional Reasoning in LLMs
Magnus F. Gjerde
Vanessa Cheung
David Lagnado
ReLMLRM
109
0
0
23 Feb 2025
ExpliCa: Evaluating Explicit Causal Reasoning in Large Language Models
ExpliCa: Evaluating Explicit Causal Reasoning in Large Language Models
Martina Miliani
S. Auriemma
Alessandro Bondielli
Emmanuele Chersoni
Lucia Passaro
Irene Sucameli
Alessandro Lenci
LRMELM
90
0
0
21 Feb 2025
Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise
Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise
Rose E. Wang
Ana T. Ribeiro
Carly Robinson
Susanna Loeb
Dora Demszky
162
17
0
28 Jan 2025
Quantifying artificial intelligence through algorithmic generalization
Quantifying artificial intelligence through algorithmic generalization
Takuya Ito
Murray Campbell
L. Horesh
Tim Klinger
Parikshit Ram
ELM
126
0
0
08 Nov 2024
United in Diversity? Contextual Biases in LLM-Based Predictions of the 2024 European Parliament Elections
United in Diversity? Contextual Biases in LLM-Based Predictions of the 2024 European Parliament Elections
Leah von der Heyde
Anna Haensch
Alexander Wenz
Bolei Ma
153
2
0
29 Aug 2024
Multilevel Interpretability Of Artificial Neural Networks: Leveraging
  Framework And Methods From Neuroscience
Multilevel Interpretability Of Artificial Neural Networks: Leveraging Framework And Methods From Neuroscience
Zhonghao He
Jascha Achterberg
Katie Collins
Kevin K. Nejad
Danyal Akarca
...
Chole Li
Kai J. Sandbrink
Stephen Casper
Anna Ivanova
Grace W. Lindsay
AI4CE
110
2
0
22 Aug 2024
Dialogue Ontology Relation Extraction via Constrained Chain-of-Thought Decoding
Dialogue Ontology Relation Extraction via Constrained Chain-of-Thought Decoding
Renato Vukovic
David Arps
Carel van Niekerk
Benjamin Matthias Ruppik
Hsien-chin Lin
Michael Heck
Milica Gašić
107
1
0
05 Aug 2024
The Remarkable Robustness of LLMs: Stages of Inference?
The Remarkable Robustness of LLMs: Stages of Inference?
Vedang Lad
Wes Gurnee
Max Tegmark
Max Tegmark
138
53
0
27 Jun 2024
What Did I Do Wrong? Quantifying LLMs' Sensitivity and Consistency to Prompt Engineering
What Did I Do Wrong? Quantifying LLMs' Sensitivity and Consistency to Prompt Engineering
Federico Errica
G. Siracusano
D. Sanvito
Roberto Bifulco
164
26
0
18 Jun 2024
Does learning the right latent variables necessarily improve in-context learning?
Does learning the right latent variables necessarily improve in-context learning?
Sarthak Mittal
Eric Elmoznino
Léo Gagnon
Sangnie Bhardwaj
Tom Marty
Dhanya Sridhar
Guillaume Lajoie
96
7
0
29 May 2024
Evaluating Text-to-Speech Synthesis from a Large Discrete Token-based
  Speech Language Model
Evaluating Text-to-Speech Synthesis from a Large Discrete Token-based Speech Language Model
Siyang Wang
Éva Székely
107
6
0
16 May 2024
Tokenization counts: the impact of tokenization on arithmetic in
  frontier LLMs
Tokenization counts: the impact of tokenization on arithmetic in frontier LLMs
Aaditya K. Singh
DJ Strouse
122
61
0
22 Feb 2024
Properties and Challenges of LLM-Generated Explanations
Properties and Challenges of LLM-Generated Explanations
Jenny Kunz
Marco Kuhlmann
97
24
0
16 Feb 2024
Incoherent Probability Judgments in Large Language Models
Incoherent Probability Judgments in Large Language Models
Jian-Qiao Zhu
Thomas Griffiths
171
8
0
30 Jan 2024
Physics simulation capabilities of LLMs
Physics simulation capabilities of LLMs
M. Ali-Dib
Kristen Menou
ELMAI4CE
48
0
0
04 Dec 2023
Large language models predict human sensory judgments across six
  modalities
Large language models predict human sensory judgments across six modalities
Raja Marjieh
Ilia Sucholutsky
Pol van Rijn
Nori Jacoby
Thomas Griffiths
VLM
98
44
0
02 Feb 2023
1