v1v2v3 (latest)

Faith and Fate: Limits of Transformers on Compositionality

Neural Information Processing Systems (NeurIPS), 2023

29 May 2023

Xiang Lorraine Li

Xiang Ren

Yejin Choi

ArXiv (abs)PDF HTML HuggingFace (7 upvotes)

Papers citing "Faith and Fate: Limits of Transformers on Compositionality"

50 / 328 papers shown

The CLRS-Text Algorithmic Reasoning Language Benchmark

Charles Blundell

Petar Veličković

269

06 Jun 2024

On Limitation of Transformer for Learning HMMs

Jiachen Hu

Qinghua Liu

Chi Jin

246

06 Jun 2024

Exact Conversion of In-Context Learning to Model Weights in Linearized-Attention Transformers

223

05 Jun 2024

Language Models Do Hard Arithmetic Tasks Easily and Hardly Do Easy Arithmetic Tasks

Andrew Gambardella

Yusuke Iwasawa

Yutaka Matsuo

LRM

149

04 Jun 2024

Explicitly Encoding Structural Symmetry is Key to Length Generalization in Arithmetic Tasks

280

04 Jun 2024

Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models

813

04 Jun 2024

ACCORD: Closing the Commonsense Measurability Gap

François Roewer-Després

356

04 Jun 2024

Contextual Counting: A Mechanistic Study of Transformers on a Quantitative Task

...

Bruno Régaldo-Saint Blancard

Kyunghyun Cho

Shirley Ho

177

30 May 2024

Language Models Need Inductive Biases to Count Inductively

Yingshan Chang

Yonatan Bisk

LRM

264

30 May 2024

Transformers Can Do Arithmetic with the Right Embeddings

...

181

27 May 2024

THREAD: Thinking Deeper with Recursive Spawning

316

27 May 2024

Limits of Deep Learning: Sequence Modeling through the Lens of Complexity Theory

369

26 May 2024

From Frege to chatGPT: Compositionality in language, cognition, and deep neural networks

516

24 May 2024

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

376

23 May 2024

From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by Step

Yuntian Deng

Yejin Choi

Stuart M. Shieber

ReLM LRM

281

120

23 May 2024

Investigating Symbolic Capabilities of Large Language Models

167

21 May 2024

A General Theory for Compositional Generalization

241

20 May 2024

LLM-Generated Black-box Explanations Can Be Adversarially Helpful

R. Ajwani

Shashidhar Reddy Javaji

Frank Rudzicz

Zining Zhu

AAML

292

10 May 2024

Initialization is Critical to Whether Transformers Fit Composite Functions by Reasoning or MemorizingNeural Information Processing Systems (NeurIPS), 2024

600

08 May 2024

Chain of Thoughtlessness? An Analysis of CoT in Planning

Kaya Stechly

Subbarao Kambhampati

LRM LM&Ro

548

08 May 2024

Exploring the Compositional Deficiency of Large Language Models in Mathematical ReasoningConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Xuanjing Huang

255

05 May 2024

What makes Models Compositional? A Theoretical View: With SupplementInternational Joint Conference on Artificial Intelligence (IJCAI), 2024

277

02 May 2024

Transcrib3D: 3D Referring Expression Resolution through Large Language Models

Jiading Fang

Xiangshan Tan

Shengjie Lin

Igor Vasiljevic

Vitor Campagnolo Guizilini

Hongyuan Mei

Rares Andrei Ambrus

Gregory Shakhnarovich

Matthew R. Walter

LM&Ro

188

30 Apr 2024

OCK: Unsupervised Dynamic Video Prediction with Object-Centric Kinematics

280

29 Apr 2024

Language Models Still Struggle to Zero-shot Reason about Time Series

273

17 Apr 2024

On the Empirical Complexity of Reasoning and Planning in LLMs

237

17 Apr 2024

Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language Models

294

09 Apr 2024

Characterizing Multimodal Long-form Summarization: A Case Study on Financial Reports

147

09 Apr 2024

Counting Like Transformers: Compiling Temporal Counting Logic Into Softmax Transformers

Andy Yang

David Chiang

225

05 Apr 2024

Iterated Learning Improves Compositionality in Large Vision-Language ModelsComputer Vision and Pattern Recognition (CVPR), 2024

259

02 Apr 2024

Beyond Accuracy: Evaluating the Reasoning Behavior of Large Language Models -- A Survey

Philipp Mondorf

Barbara Plank

ELM LRM LM&MA

361

02 Apr 2024

A Theory for Length Generalization in Learning to Reason

Changnan Xiao

Bing Liu

LRM

340

31 Mar 2024

Reasoning Abilities of Large Language Models: In-Depth Analysis on the Abstraction and Reasoning CorpusACM Transactions on Intelligent Systems and Technology (ACM TIST), 2024

177

18 Mar 2024

Elephants Never Forget: Testing Language Models for Memorization of Tabular Data

229

11 Mar 2024

The pitfalls of next-token predictionInternational Conference on Machine Learning (ICML), 2024

Gregor Bachmann

Vaishnavh Nagarajan

470

131

11 Mar 2024

Will GPT-4 Run DOOM?IEEE Transactions on Games (IEEE Trans. Games), 2024

Adrian de Wynter

LM&Ro MLLM

203

08 Mar 2024

Exploring Continual Learning of Compositional Generalization in NLI

Xiyan Fu

Anette Frank

CLL LRM

250

07 Mar 2024

Learning to Use Tools via Cooperative and Interactive Agents

383

05 Mar 2024

Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models

Ting-En Lin

Rui Yan

245

04 Mar 2024

Formulation Comparison for Timeline Construction using LLMs

292

01 Mar 2024

A Neural Rewriting System to Solve Algorithmic Problems

244

27 Feb 2024

Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Corrections

Lingjun Zhao

Khanh Nguyen

Hal Daumé

229

26 Feb 2024

Do Large Language Models Latently Perform Multi-Hop Reasoning?

398

160

26 Feb 2024

Data Science with LLMs and Interpretable Models

Sebastian Bordt

Benjamin J. Lengerich

Harsha Nori

Rich Caruana

SyDa AI4CE

208

22 Feb 2024

Understanding and Patching Compositional Reasoning in LLMs

Defu Lian

232

22 Feb 2024

Can LLMs Reason with Rules? Logic Scaffolding for Stress-Testing and Improving LLMs

Siyuan Wang

Zhongyu Wei

Yejin Choi

Xiang Ren

ReLM ELM LRM

206

18 Feb 2024

Transformers Can Achieve Length Generalization But Not Robustly

285

14 Feb 2024

Using Counterfactual Tasks to Evaluate the Generality of Analogical Reasoning in Large Language Models

Martha Lewis

Melanie Mitchell

ELM LRM

251

14 Feb 2024

On Limitations of the Transformer Architecture

Binghui Peng

Srini Narayanan

Christos H. Papadimitriou

255

13 Feb 2024

On the Self-Verification Limitations of Large Language Models on Reasoning and Planning Tasks

Kaya Stechly

Subbarao Kambhampati

ReLM LRM

178

12 Feb 2024