Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2305.18654
Cited By
v1
v2
v3 (latest)
Faith and Fate: Limits of Transformers on Compositionality
Neural Information Processing Systems (NeurIPS), 2023
29 May 2023
Nouha Dziri
Ximing Lu
Melanie Sclar
Xiang Lorraine Li
Liwei Jian
Bill Yuchen Lin
Peter West
Chandra Bhagavatula
Ronan Le Bras
Jena D. Hwang
Soumya Sanyal
Sean Welleck
Xiang Ren
Allyson Ettinger
Zaïd Harchaoui
Yejin Choi
ReLM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (7 upvotes)
Papers citing
"Faith and Fate: Limits of Transformers on Compositionality"
50 / 327 papers shown
Title
Position as Probability: Self-Supervised Transformers that Think Past Their Training for Length Extrapolation
Philip Heejun Lee
74
0
0
24 Dec 2025
AsymPuzl: An Asymmetric Puzzle for multi-agent cooperation
Xavier F. Cadet
Edward Koh
Peter Chin
LLMAG
44
0
0
03 Dec 2025
When Do Symbolic Solvers Enhance Reasoning in Large Language Models?
Zhiyuan He
Dingmin Wang
ReLM
LRM
99
0
0
02 Dec 2025
Orthographic Constraint Satisfaction and Human Difficulty Alignment in Large Language Models
Bryan Edward Tuck
Rakesh M. Verma
ALM
145
0
0
26 Nov 2025
Closed-Loop Transformers: Autoregressive Modeling as Iterative Latent Equilibrium
Akbar Anbar Jafari
G. Anbarjafari
36
0
0
26 Nov 2025
In-Context Compositional Learning via Sparse Coding Transformer
Wei Chen
Jingxi Yu
Zichen Miao
Qiang Qiu
136
0
0
25 Nov 2025
Cognitive Foundations for Reasoning and Their Manifestation in LLMs
Priyanka Kargupta
Shuyue Stella Li
Haocheng Wang
Jinu Lee
Shan Chen
...
Thomas L. Griffiths
Max Kleiman-Weiner
Jiawei Han
Asli Celikyilmaz
Yulia Tsvetkov
LRM
182
2
0
20 Nov 2025
Cognitive Maps in Language Models: A Mechanistic Analysis of Spatial Planning
Caroline Baumgartner
Eleanor Spens
Neil Burgess
Petru Manescu
LM&Ro
137
0
0
17 Nov 2025
Next-Latent Prediction Transformers Learn Compact World Models
Jayden Teoh
Manan Tomar
Kwangjun Ahn
E. Hu
Pratyusha Sharma
Riashat Islam
Alex Lamb
John Langford
108
0
0
08 Nov 2025
DecompSR: A dataset for decomposed analyses of compositional multihop spatial reasoning
Lachlan McPheat
Navdeep Kaur
Robert E Blackwell
Alessandra Russo
Anthony G Cohn
Pranava Madhyastha
ReLM
CoGe
LRM
244
0
0
04 Nov 2025
The Ouroboros of Benchmarking: Reasoning Evaluation in an Era of Saturation
İbrahim Ethem Deveci
Duygu Ataman
ReLM
ALM
ELM
LRM
199
0
0
03 Nov 2025
Training LLMs Beyond Next Token Prediction - Filling the Mutual Information Gap
Chun-Hao Yang
Bo-Han Feng
Tzu-Yuan Lai
Yan Yu Chen
Yin-Kai Dean Huang
Shou-De Lin
64
0
0
31 Oct 2025
The Kinetics of Reasoning: How Chain-of-Thought Shapes Learning in Transformers?
Zihan Pengmei
Costas Mavromatis
Zhengyuan Shen
Yunyi Zhang
V. Ioannidis
Huzefa Rangwala
LRM
86
0
0
28 Oct 2025
Can Language Models Compose Skills In-Context?
Zidong Liu
Zhuoyan Xu
Zhenmei Shi
Yingyu Liang
ReLM
CoGe
LRM
259
0
0
27 Oct 2025
When No Paths Lead to Rome: Benchmarking Systematic Neural Relational Reasoning
Anirban Das
Irtaza Khalid
Rafael Peñaloza
Steven Schockaert
GNN
3DV
434
0
0
27 Oct 2025
Once Upon an Input: Reasoning via Per-Instance Program Synthesis
Adam Stein
Neelay Velingker
Mayur Naik
Eric Wong
ReLM
LRM
152
0
0
26 Oct 2025
Reasoning Models Reason Well, Until They Don't
Revanth Rameshkumar
Jimson Huang
Yunxin Sun
Fei Xia
Abulhair Saparov
ReLM
LRM
100
0
0
25 Oct 2025
Measuring Reasoning in LLMs: a New Dialectical Angle
Soheil Abbasloo
LRM
116
0
0
20 Oct 2025
DAG-Math: Graph-Guided Mathematical Reasoning in LLMs
Yuanhe Zhang
Ilja Kuzborskij
Jason D. Lee
Chenlei Leng
Fanghui Liu
LRM
134
1
0
19 Oct 2025
Self-Verifying Reflection Helps Transformers with CoT Reasoning
Zhongwei Yu
Wannian Xia
Xue Yan
Bo Xu
Haifeng Zhang
Yali Du
Ning Yang
ReLM
LRM
101
0
0
14 Oct 2025
Which Word Orders Facilitate Length Generalization in LMs? An Investigation with GCG-Based Artificial Languages
Nadine El-Naggar
Tatsuki Kuribayashi
Ted Briscoe
84
0
0
14 Oct 2025
RegexPSPACE: A Benchmark for Evaluating LLM Reasoning on PSPACE-complete Regex Problems
Hyundong Jin
Joonghyuk Hahn
Yo-Sub Han
LRM
69
0
0
10 Oct 2025
Hire Your Anthropologist! Rethinking Culture Benchmarks Through an Anthropological Lens
Mai AlKhamissi
Yunze Xiao
Badr AlKhamissi
Mona T. Diab
155
0
0
07 Oct 2025
Bridging Reasoning to Learning: Unmasking Illusions using Complexity Out of Distribution Generalization
Mohammad Mahdi Samiei Paqaleh
Arash Marioriyad
Arman Tahmasebi-Zadeh
Mohamadreza Fereydooni
Mahdi Ghaznavai
Mahdieh Soleymani Baghshah
116
0
0
06 Oct 2025
Orchestrating Human-AI Teams: The Manager Agent as a Unifying Research Challenge
Charlie Masters
Advaith Vellanki
J. Shangguan
Bart Kultys
Jonathan Gilmore
Alastair Moore
Stefano V. Albrecht
142
4
0
02 Oct 2025
How Do Language Models Compose Functions?
Apoorv Khandelwal
Ellie Pavlick
KELM
CoGe
LRM
204
1
0
02 Oct 2025
Boosting Process-Correct CoT Reasoning by Modeling Solvability of Multiple-Choice QA
Raphael Schumann
Stefan Riezler
LRM
110
0
0
30 Sep 2025
Identity Bridge: Enabling Implicit Reasoning via Shared Latent Memory
Pengxiao Lin
Zheng Chen
Zhi-Qin John Xu
LRM
86
1
0
29 Sep 2025
Local Success Does Not Compose: Benchmarking Large Language Models for Compositional Formal Verification
X. Xu
Xin Li
Xingwei Qu
Jie Fu
Hang Zhao
CoGe
LRM
114
1
0
27 Sep 2025
Teaching Transformers to Solve Combinatorial Problems through Efficient Trial & Error
Panagiotis Giannoulis
Yorgos Pantis
Christos Tzamos
116
0
0
26 Sep 2025
Review of Hallucination Understanding in Large Language and Vision Models
Zhengyi Ho
Siyuan Liang
D. Tao
VLM
LRM
138
0
0
26 Sep 2025
Variation in Verification: Understanding Verification Dynamics in Large Language Models
Yefan Zhou
Austin Xu
Yilun Zhou
Janvijay Singh
Jiang Gui
Shafiq Joty
LRM
164
3
0
22 Sep 2025
PiERN: Token-Level Routing for Integrating High-Precision Computation and Reasoning
Hengbo Xiao
Jingyuan Fan
Xin Tong
Jingzhao Zhang
Chao Lu
Guannan He
MoE
164
0
0
17 Sep 2025
Large Language Models Imitate Logical Reasoning, but at what Cost?
Lachlan McGinness
Peter Baumgartner
ReLM
LRM
ELM
AI4CE
184
2
0
16 Sep 2025
Is In-Context Learning Learning?
Adrian de Wynter
137
1
0
12 Sep 2025
COGITAO: A Visual Reasoning Framework To Study Compositionality & Generalization
Yassine Taoudi-Benchekroun
Klim Troyan
Pascal Sager
Stefan Gerber
Lukas Tuggener
Benjamin Grewe
LRM
97
0
0
05 Sep 2025
When LLM Meets Time Series: Can LLMs Perform Multi-Step Time Series Reasoning and Inference
Wen Ye
Jinbo Liu
Defu Cao
Wei Yang
Yan Liu
AI4TS
81
1
0
01 Sep 2025
Provable Benefits of In-Tool Learning for Large Language Models
Sam Houliston
Ambroise Odonnat
Charles Arnal
Vivien A. Cabannes
RALM
144
1
0
28 Aug 2025
Understanding Subword Compositionality of Large Language Models
Qiwei Peng
Yekun Chai
Anders Søgaard
92
1
0
25 Aug 2025
Beyond Memorization: Extending Reasoning Depth with Recurrence, Memory and Test-Time Compute Scaling
Ivan Rodkin
Daniil Orel
Konstantin Smirnov
Arman Bolatov
Bilal Elbouardi
...
Aydar Bulatov
Preslav Nakov
Timothy Baldwin
Artem Shelmanov
Mikhail Burtsev
LRM
213
0
0
22 Aug 2025
Dream 7B: Diffusion Large Language Models
Jiacheng Ye
Zhihui Xie
Lin Zheng
Lei Li
Zirui Wu
Xin Jiang
Zhenguo Li
Lingpeng Kong
DiffM
VLM
720
106
0
21 Aug 2025
TransLLM: A Unified Multi-Task Foundation Framework for Urban Transportation via Learnable Prompting
Jiaming Leng
Yunying Bi
Chuan Qin
Bing Yin
Yanyong Zhang
Chao Wang
AI4TS
80
0
0
20 Aug 2025
Reinforced Context Order Recovery for Adaptive Reasoning and Planning
Long Ma
Fangwei Zhong
Yizhou Wang
LRM
76
2
0
18 Aug 2025
Beyond Ethical Alignment: Evaluating LLMs as Artificial Moral Assistants
Alessio Galatolo
Luca Alberto Rappuoli
Katie Winkle
Meriem Beloucif
ELM
122
1
0
18 Aug 2025
Is GPT-OSS Good? A Comprehensive Evaluation of OpenAI's Latest Open Source Models
Ziqian Bi
Keyu Chen
Chiung-Yi Tseng
Danyang Zhang
Pohsun Feng
...
Junming Huang
Jibin Guan
Junfeng Hao
Junhao Song
Junhao Song
ELM
182
3
0
17 Aug 2025
The Missing Reward: Active Inference in the Era of Experience
Bo Wen
72
1
0
07 Aug 2025
Topos Theory for Generative AI and LLMs
Sridhar Mahadevan
60
0
0
05 Aug 2025
Diagnosing Memorization in Chain-of-Thought Reasoning, One Token at a Time
Huihan Li
You Chen
Siyuan Wang
Yixin He
Ninareh Mehrabi
Rahul Gupta
Xiang Ren
LRM
177
1
0
04 Aug 2025
CompoST: A Benchmark for Analyzing the Ability of LLMs To Compositionally Interpret Questions in a QALD Setting
David Maria Schmidt
Raoul Schubert
Philipp Cimiano
CoGe
272
0
0
28 Jul 2025
Towards Consistent Long-Term Pose Generation
Yayuan Li
Filippos Bellos
Jason J. Corso
151
0
0
24 Jul 2025
1
2
3
4
5
6
7
Next