v1v2v3 (latest)

Faith and Fate: Limits of Transformers on Compositionality

Neural Information Processing Systems (NeurIPS), 2023

29 May 2023

Xiang Lorraine Li

Xiang Ren

Yejin Choi

ArXiv (abs)PDF HTML HuggingFace (7 upvotes)

Papers citing "Faith and Fate: Limits of Transformers on Compositionality"

50 / 328 papers shown

MetaScale: Test-Time Scaling with Evolving Meta-Thoughts

350

17 Mar 2025

Are formal and functional linguistic mechanisms dissociated in language models?

Michael Hanna

Sandro Pezzelle

Yonatan Belinkov

538

14 Mar 2025

Visualizing Thought: Conceptual Diagrams Enable Robust Planning in LMMs

380

14 Mar 2025

Don't Take Things Out of Context: Attention Intervention for Enhancing Chain-of-Thought Reasoning in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2025

348

14 Mar 2025

Language Models, Graph Searching, and Supervision Adulteration: When More Supervision is Less and How to Make More MoreAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Arvid Frydenlund

LRM

560

13 Mar 2025

Out-of-Context Reasoning in Large Language Models

445

13 Mar 2025

A Representationalist, Functionalist and Naturalistic Conception of Intelligence as a Foundation for AGI

Rolf Pfister

301

10 Mar 2025

AI-driven control of bioelectric signalling for real-time topological reorganization of cells

Gonçalo Hora de Carvalho

AI4CE

387

10 Mar 2025

MastermindEval: A Simple But Scalable Reasoning Benchmark

646

07 Mar 2025

From Infants to AI: Incorporating Infant-like Learning in Models Boosts Efficiency and Generalization in Learning Social Prediction Tasks

Shify Treger

Shimon Ullman

255

05 Mar 2025

Structural Deep Encoding for Table Question AnsweringAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

281

03 Mar 2025

Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models

448

27 Feb 2025

FINEREASON: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle SolvingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

300

27 Feb 2025

Beyond In-Distribution Success: Scaling Curves of CoT Granularity for Language Model Generalization

411

25 Feb 2025

The Role of Sparsity for Length Generalization in Transformers

237

24 Feb 2025

Reasoning about Affordances: Causal and Compositional Reasoning in LLMs

302

23 Feb 2025

Stepwise Informativeness Search for Efficient and Effective LLM Reasoning

230

21 Feb 2025

None of the Others: a General Technique to Distinguish Reasoning from Memorization in Multiple-Choice LLM Evaluation Benchmarks

627

18 Feb 2025

MathGAP: Out-of-Distribution Evaluation on Problems with Arbitrarily Complex ProofsInternational Conference on Learning Representations (ICLR), 2024

434

17 Feb 2025

Evaluating the Systematic Reasoning Abilities of Large Language Models through Graph Coloring

Alex Heyman

Joel Zylberberg

LRM

302

10 Feb 2025

Self-Improving Transformers Overcome Easy-to-Hard and Length Generalization Challenges

Dimitris Papailiopoulos

ReLM VLM LRM AI4CE

423

03 Feb 2025

Strassen Attention, Split VC Dimension and Compositionality in Transformers

423

31 Jan 2025

Will Systems of LLM Agents Cooperate: An Investigation into a Social Dilemma

319

28 Jan 2025

Are Transformers Able to Reason by Connecting Separated Knowledge in Training Data?International Conference on Learning Representations (ICLR), 2025

Yutong Yin

Zhaoran Wang

LRM ReLM

1.2K

27 Jan 2025

Evolution and The Knightian Blindspot of Machine Learning

338

22 Jan 2025

Infinite Time Turing Machines and their Applications

Rukmal Weerawarana

Maxwell Braun

AI4CE

22 Jan 2025

Exposing Limitations of Language Model Agents in Sequential-Task Compositions on the Web

608

03 Jan 2025

Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding

375

01 Jan 2025

Out-of-distribution generalization via composition: a lens through induction heads in TransformersProceedings of the National Academy of Sciences of the United States of America (PNAS), 2024

Jiajun Song

Zhuoyan Xu

Yiqiao Zhong

361

31 Dec 2024

Evolutionary Pre-Prompt Optimization for Mathematical Reasoning

237

05 Dec 2024

Theoretical limitations of multi-layer Transformer

468

04 Dec 2024

Learning Elementary Cellular Automata with Transformers

Mikhail Burtsev

427

02 Dec 2024

Sneaking Syntax into Transformer Language Models with Tree RegularizationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

Ananjan Nandi

Christopher D. Manning

Shikhar Murty

335

28 Nov 2024

Do Large Language Models Perform Latent Multi-Hop Reasoning without Exploiting Shortcuts?Annual Meeting of the Association for Computational Linguistics (ACL), 2024

417

25 Nov 2024

Lessons from Studying Two-Hop Latent Reasoning

468

25 Nov 2024

Learning to Reason Iteratively and Parallelly for Complex Visual Reasoning ScenariosNeural Information Processing Systems (NeurIPS), 2024

363

20 Nov 2024

SetLexSem Challenge: Using Set Operations to Evaluate the Lexical and Semantic Robustness of Language ModelsNeural Information Processing Systems (NeurIPS), 2024

250

11 Nov 2024

Quantifying artificial intelligence through algorithmic generalizationNature Machine Intelligence (Nat. Mach. Intell.), 2024

452

08 Nov 2024

A Implies B: Circuit Analysis in LLMs for Propositional Logical Reasoning

442

06 Nov 2024

Automatic Generation of Question Hints for Mathematics Problems using Large Language Models in Educational Technology

203

05 Nov 2024

Regress, Don't Guess -- A Regression-like Loss on Number Tokens for Language Models

...

Michael Morris Danziger

Jannis Born

407

04 Nov 2024

Provable Length Generalization in Sequence Prediction via Spectral Filtering

349

01 Nov 2024

Diffusion Beats Autoregressive: An Evaluation of Compositional Generation in Text-to-Image Models

Arash Marioriyad

977

30 Oct 2024

On Memorization of Large Language Models in Logical Reasoning

Chulin Xie

Bo Li

459

30 Oct 2024

Natural Language Inference Improves Compositionality in Vision-Language ModelsInternational Conference on Learning Representations (ICLR), 2024

Paola Cascante-Bonilla

334

29 Oct 2024

Delving into the Reversal Curse: How Far Can Large Language Models Generalize?Neural Information Processing Systems (NeurIPS), 2024

390

24 Oct 2024

A Comprehensive Evaluation of Cognitive Biases in LLMs

331

20 Oct 2024

Supervised Chain of Thought

Xiang Zhang

Dujian Ding

LRM AI4CE

129

18 Oct 2024

Beyond Autoregression: Discrete Diffusion for Complex Reasoning and PlanningInternational Conference on Learning Representations (ICLR), 2024

582

18 Oct 2024

Breaking Chains: Unraveling the Links in Multi-Hop Knowledge Unlearning

164

17 Oct 2024