Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2305.18654
Cited By
v1
v2
v3 (latest)
Faith and Fate: Limits of Transformers on Compositionality
Neural Information Processing Systems (NeurIPS), 2023
29 May 2023
Nouha Dziri
Ximing Lu
Melanie Sclar
Xiang Lorraine Li
Liwei Jian
Bill Yuchen Lin
Peter West
Chandra Bhagavatula
Ronan Le Bras
Jena D. Hwang
Soumya Sanyal
Sean Welleck
Xiang Ren
Allyson Ettinger
Zaïd Harchaoui
Yejin Choi
ReLM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (7 upvotes)
Papers citing
"Faith and Fate: Limits of Transformers on Compositionality"
50 / 327 papers shown
Title
Visualizing Thought: Conceptual Diagrams Enable Robust Planning in LMMs
Nasim Borazjanizadeh
Roei Herzig
Eduard Oks
Trevor Darrell
Rogerio Feris
Leonid Karlinsky
LRM
306
2
0
14 Mar 2025
Are formal and functional linguistic mechanisms dissociated in language models?
Michael Hanna
Sandro Pezzelle
Yonatan Belinkov
437
4
0
14 Mar 2025
Don't Take Things Out of Context: Attention Intervention for Enhancing Chain-of-Thought Reasoning in Large Language Models
International Conference on Learning Representations (ICLR), 2025
Shaotian Yan
Chen Shen
Wenxiao Wang
Liang Xie
Junjie Liu
Jieping Ye
ReLM
LRM
321
4
0
14 Mar 2025
Language Models, Graph Searching, and Supervision Adulteration: When More Supervision is Less and How to Make More More
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Arvid Frydenlund
LRM
510
2
0
13 Mar 2025
Out-of-Context Reasoning in Large Language Models
Jonathan Shaki
Emanuele La Malfa
Michael Wooldridge
Sarit Kraus
LRM
ReLM
382
0
0
13 Mar 2025
A Representationalist, Functionalist and Naturalistic Conception of Intelligence as a Foundation for AGI
Rolf Pfister
254
0
0
10 Mar 2025
AI-driven control of bioelectric signalling for real-time topological reorganization of cells
Gonçalo Hora de Carvalho
AI4CE
324
2
0
10 Mar 2025
MastermindEval: A Simple But Scalable Reasoning Benchmark
Jonas Golde
Patrick Haller
Fabio Barth
Alan Akbik
LRM
ReLM
ELM
572
4
0
07 Mar 2025
From Infants to AI: Incorporating Infant-like Learning in Models Boosts Efficiency and Generalization in Learning Social Prediction Tasks
Shify Treger
Shimon Ullman
239
0
0
05 Mar 2025
Structural Deep Encoding for Table Question Answering
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Raphael Mouravieff
Benjamin Piwowarski
Sylvain Lamprier
LMTD
253
2
0
03 Mar 2025
Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models
Yukang Yang
Declan Campbell
Kaixuan Huang
Mengdi Wang
Jonathan D. Cohen
Taylor Webb
LRM
424
15
0
27 Feb 2025
FINEREASON: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle Solving
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Guizhen Chen
Weiwen Xu
Hao Zhang
Hou Pong Chan
Chaoqun Liu
Lidong Bing
Deli Zhao
Anh Tuan Luu
Yu Rong
ReLM
LRM
281
8
0
27 Feb 2025
Beyond In-Distribution Success: Scaling Curves of CoT Granularity for Language Model Generalization
Ru Wang
Wei Huang
Selena Song
Haoyu Zhang
Yusuke Iwasawa
Y. Matsuo
Jiaxian Guo
OODD
LRM
372
5
0
25 Feb 2025
The Role of Sparsity for Length Generalization in Transformers
Noah Golowich
Samy Jelassi
David Brandfonbrener
Sham Kakade
Eran Malach
221
6
0
24 Feb 2025
Reasoning about Affordances: Causal and Compositional Reasoning in LLMs
Magnus F. Gjerde
Vanessa Cheung
David Lagnado
ReLM
LRM
247
0
0
23 Feb 2025
Stepwise Informativeness Search for Efficient and Effective LLM Reasoning
Siyuan Wang
Enda Zhao
Zhongyu Wei
Xiang Ren
LRM
202
2
0
21 Feb 2025
None of the Others: a General Technique to Distinguish Reasoning from Memorization in Multiple-Choice LLM Evaluation Benchmarks
Eva Sánchez Salido
Julio Gonzalo
Guillermo Marco
ELM
559
12
0
18 Feb 2025
MathGAP: Out-of-Distribution Evaluation on Problems with Arbitrarily Complex Proofs
International Conference on Learning Representations (ICLR), 2024
Andreas Opedal
Haruki Shirakami
Bernhard Schölkopf
Abulhair Saparov
Mrinmaya Sachan
LRM
391
7
0
17 Feb 2025
Evaluating the Systematic Reasoning Abilities of Large Language Models through Graph Coloring
Alex Heyman
Joel Zylberberg
LRM
290
2
0
10 Feb 2025
Self-Improving Transformers Overcome Easy-to-Hard and Length Generalization Challenges
Nayoung Lee
Ziyang Cai
Avi Schwarzschild
Kangwook Lee
Dimitris Papailiopoulos
ReLM
VLM
LRM
AI4CE
382
19
0
03 Feb 2025
Strassen Attention, Split VC Dimension and Compositionality in Transformers
Chris Köcher
Felipe Urrutia
Hector Jimenez
Tomasz Steifer
Germán Pizarro
Matías Fuentes
Francisco Meza
Cristian Buc
Cristóbal Rojas
371
4
0
31 Jan 2025
Will Systems of LLM Agents Cooperate: An Investigation into a Social Dilemma
Richard Willis
Yali Du
Joel Z Leibo
Michael Luck
292
12
0
28 Jan 2025
Are Transformers Able to Reason by Connecting Separated Knowledge in Training Data?
International Conference on Learning Representations (ICLR), 2025
Yutong Yin
Zhaoran Wang
LRM
ReLM
1.1K
2
0
27 Jan 2025
Evolution and The Knightian Blindspot of Machine Learning
Joel Lehman
Elliot Meyerson
Tarek El-Gaaly
Kenneth O. Stanley
Tarin Ziyaee
320
7
0
22 Jan 2025
Infinite Time Turing Machines and their Applications
Rukmal Weerawarana
Maxwell Braun
AI4CE
72
0
0
22 Jan 2025
Exposing Limitations of Language Model Agents in Sequential-Task Compositions on the Web
Hiroki Furuta
Yutaka Matsuo
Aleksandra Faust
Izzeddin Gur
CLL
567
19
0
03 Jan 2025
Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding
Jiajun Zhu
Peihao Wang
Ruisi Cai
Jason D. Lee
Pan Li
Liang Luo
KELM
350
4
0
01 Jan 2025
Out-of-distribution generalization via composition: a lens through induction heads in Transformers
Proceedings of the National Academy of Sciences of the United States of America (PNAS), 2024
Jiajun Song
Zhuoyan Xu
Yiqiao Zhong
312
19
0
31 Dec 2024
Evolutionary Pre-Prompt Optimization for Mathematical Reasoning
Mathurin Videau
Alessandro Leite
Marc Schoenauer
O. Teytaud
ReLM
LRM
204
1
0
05 Dec 2024
Theoretical limitations of multi-layer Transformer
Lijie Chen
Binghui Peng
Hongxun Wu
AI4CE
435
21
0
04 Dec 2024
Learning Elementary Cellular Automata with Transformers
Mikhail Burtsev
406
2
0
02 Dec 2024
Sneaking Syntax into Transformer Language Models with Tree Regularization
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Ananjan Nandi
Christopher D. Manning
Shikhar Murty
311
1
0
28 Nov 2024
Do Large Language Models Perform Latent Multi-Hop Reasoning without Exploiting Shortcuts?
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Sohee Yang
Nora Kassner
E. Gribovskaya
Sebastian Riedel
Mor Geva
LRM
KELM
ReLM
363
16
0
25 Nov 2024
Lessons from Studying Two-Hop Latent Reasoning
Mikita Balesni
Tomek Korbak
Owain Evans
ReLM
LRM
401
0
0
25 Nov 2024
Learning to Reason Iteratively and Parallelly for Complex Visual Reasoning Scenarios
Neural Information Processing Systems (NeurIPS), 2024
Shantanu Jaiswal
Debaditya Roy
Basura Fernando
Cheston Tan
ReLM
LRM
323
5
0
20 Nov 2024
SetLexSem Challenge: Using Set Operations to Evaluate the Lexical and Semantic Robustness of Language Models
Neural Information Processing Systems (NeurIPS), 2024
Bardiya Akhbari
Manish Gawali
Nicholas A. Dronen
AAML
225
0
0
11 Nov 2024
Quantifying artificial intelligence through algorithmic generalization
Nature Machine Intelligence (Nat. Mach. Intell.), 2024
Takuya Ito
Murray Campbell
L. Horesh
Tim Klinger
Parikshit Ram
ELM
408
0
0
08 Nov 2024
A Implies B: Circuit Analysis in LLMs for Propositional Logical Reasoning
Guan Zhe Hong
Nishanth Dikkala
Enming Luo
Cyrus Rashtchian
Xin Wang
Rina Panigrahy
OffRL
LRM
NAI
364
0
0
06 Nov 2024
Automatic Generation of Question Hints for Mathematics Problems using Large Language Models in Educational Technology
Junior Cedric Tonga
Benjamin Clément
Pierre-Yves Oudeyer
LRM
155
7
0
05 Nov 2024
Regress, Don't Guess -- A Regression-like Loss on Number Tokens for Language Models
Jonas Zausinger
Lars Pennig
Anamarija Kozina
Sean Sdahl
Julian Sikora
...
Anna Ketteler
Thorben Prein
Vishwa Mohan Singh
Michael Morris Danziger
Jannis Born
337
5
0
04 Nov 2024
Provable Length Generalization in Sequence Prediction via Spectral Filtering
Annie Marsden
Evan Dogariu
Naman Agarwal
Xinyi Chen
Daniel Suo
Elad Hazan
322
1
0
01 Nov 2024
Diffusion Beats Autoregressive: An Evaluation of Compositional Generation in Text-to-Image Models
Arash Marioriyad
Parham Rezaei
M. Baghshah
M. Rohban
CoGe
956
2
0
30 Oct 2024
On Memorization of Large Language Models in Logical Reasoning
Chulin Xie
Yangsibo Huang
Chiyuan Zhang
Da Yu
Xinyun Chen
Bill Yuchen Lin
Bo Li
Badih Ghazi
Ravi Kumar
LRM
432
92
0
30 Oct 2024
Natural Language Inference Improves Compositionality in Vision-Language Models
International Conference on Learning Representations (ICLR), 2024
Paola Cascante-Bonilla
Yu Hou
Yang Trista Cao
Hal Daumé III
Rachel Rudinger
ReLM
CoGe
VLM
267
5
0
29 Oct 2024
Delving into the Reversal Curse: How Far Can Large Language Models Generalize?
Neural Information Processing Systems (NeurIPS), 2024
Zhengkai Lin
Z. Fu
Kai Liu
Liang Xie
Binbin Lin
Wenxiao Wang
Xiaofei He
Yue Wu
Jieping Ye
LRM
350
7
0
24 Oct 2024
A Comprehensive Evaluation of Cognitive Biases in LLMs
Simon Malberg
Roman Poletukhin
Carolin M. Schuster
Georg Groh
ELM
277
15
0
20 Oct 2024
Supervised Chain of Thought
Xiang Zhang
Dujian Ding
LRM
AI4CE
101
3
0
18 Oct 2024
Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning
International Conference on Learning Representations (ICLR), 2024
Jiacheng Ye
Lei Li
Shansan Gong
Lin Zheng
Xin Jiang
Zhiyu Li
Dianbo Sui
DiffM
LRM
522
69
0
18 Oct 2024
Breaking Chains: Unraveling the Links in Multi-Hop Knowledge Unlearning
Minseok Choi
C. Park
Dohyun Lee
Jaegul Choo
KELM
MU
117
4
0
17 Oct 2024
How Numerical Precision Affects Arithmetical Reasoning Capabilities of LLMs
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Guhao Feng
Kai-Bo Yang
Yuntian Gu
Xinyue Ai
Shengjie Luo
Jiacheng Sun
Di He
Hao Sun
Liwei Wang
LRM
286
13
0
17 Oct 2024
Previous
1
2
3
4
5
6
7
Next