ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.18654
  4. Cited By
Faith and Fate: Limits of Transformers on Compositionality
v1v2v3 (latest)

Faith and Fate: Limits of Transformers on Compositionality

Neural Information Processing Systems (NeurIPS), 2023
29 May 2023
Nouha Dziri
Ximing Lu
Melanie Sclar
Xiang Lorraine Li
Liwei Jian
Bill Yuchen Lin
Peter West
Chandra Bhagavatula
Ronan Le Bras
Jena D. Hwang
Soumya Sanyal
Sean Welleck
Xiang Ren
Allyson Ettinger
Zaïd Harchaoui
Yejin Choi
    ReLMLRM
ArXiv (abs)PDFHTMLHuggingFace (7 upvotes)

Papers citing "Faith and Fate: Limits of Transformers on Compositionality"

50 / 325 papers shown
Title
Interpreting token compositionality in LLMs: A robustness analysis
Interpreting token compositionality in LLMs: A robustness analysis
Nura Aljaafari
Danilo S. Carvalho
André Freitas
409
3
0
16 Oct 2024
Fine-grained Attention I/O Complexity: Comprehensive Analysis for
  Backward Passes
Fine-grained Attention I/O Complexity: Comprehensive Analysis for Backward Passes
Xiaoyu Li
Yingyu Liang
Zhenmei Shi
Zhao Song
Yufa Zhou
223
19
0
12 Oct 2024
Automatic Curriculum Expert Iteration for Reliable LLM Reasoning
Automatic Curriculum Expert Iteration for Reliable LLM ReasoningInternational Conference on Learning Representations (ICLR), 2024
Zirui Zhao
Hanze Dong
Amrita Saha
Caiming Xiong
Doyen Sahoo
LRM
320
13
0
10 Oct 2024
The Mystery of Compositional Generalization in Graph-based Generative
  Commonsense Reasoning
The Mystery of Compositional Generalization in Graph-based Generative Commonsense ReasoningConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Xiyan Fu
Anette Frank
LRM
410
1
0
08 Oct 2024
ET-Plan-Bench: Embodied Task-level Planning Benchmark Towards Spatial-Temporal Cognition with Foundation Models
ET-Plan-Bench: Embodied Task-level Planning Benchmark Towards Spatial-Temporal Cognition with Foundation Models
Lingfeng Zhang
Yuening Wang
Hongjian Gu
Atia Hamidizadeh
Zhanguang Zhang
...
Tongtong Cao
Yuzheng Zhuang
Yingxue Zhang
Jianye Hao
Jianye Hao
LM&Ro
243
8
0
02 Oct 2024
Composing Global Solutions to Reasoning Tasks via Algebraic Objects in Neural Nets
Composing Global Solutions to Reasoning Tasks via Algebraic Objects in Neural Nets
Yuandong Tian
374
4
0
02 Oct 2024
Can Models Learn Skill Composition from Examples?
Can Models Learn Skill Composition from Examples?Neural Information Processing Systems (NeurIPS), 2024
Haoyu Zhao
Simran Kaur
Dingli Yu
Anirudh Goyal
Sanjeev Arora
CoGeMoE
357
13
0
29 Sep 2024
E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding
E.T. Bench: Towards Open-Ended Event-Level Video-Language UnderstandingNeural Information Processing Systems (NeurIPS), 2024
Ye Liu
Zongyang Ma
Chen Ma
Yang Wu
Ying Shan
Chang Wen Chen
251
50
0
26 Sep 2024
Compositional Hardness of Code in Large Language Models -- A Probabilistic Perspective
Compositional Hardness of Code in Large Language Models -- A Probabilistic Perspective
Yotam Wolf
Binyamin Rothberg
Dorin Shteyman
Amnon Shashua
337
1
0
26 Sep 2024
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoningInternational Conference on Learning Representations (ICLR), 2024
Zayne Sprague
Fangcong Yin
Juan Diego Rodriguez
Dongwei Jiang
Manya Wadhwa
Prasann Singhal
Xinyu Zhao
Xi Ye
Kyle Mahowald
Greg Durrett
ReLMLRM
576
226
0
18 Sep 2024
Fast Analysis of the OpenAI O1-Preview Model in Solving Random K-SAT
  Problem: Does the LLM Solve the Problem Itself or Call an External SAT
  Solver?
Fast Analysis of the OpenAI O1-Preview Model in Solving Random K-SAT Problem: Does the LLM Solve the Problem Itself or Call an External SAT Solver?
Raffaele Marino
ReLMLRM
257
4
0
17 Sep 2024
Semformer: Transformer Language Models with Semantic Planning
Semformer: Transformer Language Models with Semantic PlanningConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Yongjing Yin
Junran Ding
Kai Song
Yue Zhang
303
6
0
17 Sep 2024
Benchmarking VLMs' Reasoning About Persuasive Atypical Images
Benchmarking VLMs' Reasoning About Persuasive Atypical ImagesIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Sina Malakouti
Aysan Aghazadeh
Ashmit Khandelwal
Adriana Kovashka
VLM
341
3
0
16 Sep 2024
Causal Language Modeling Can Elicit Search and Reasoning Capabilities on
  Logic Puzzles
Causal Language Modeling Can Elicit Search and Reasoning Capabilities on Logic PuzzlesNeural Information Processing Systems (NeurIPS), 2024
Kulin Shah
Nishanth Dikkala
Xin Wang
Rina Panigrahy
ELMReLMLRM
212
24
0
16 Sep 2024
Large Language Models in Drug Discovery and Development: From Disease
  Mechanisms to Clinical Trials
Large Language Models in Drug Discovery and Development: From Disease Mechanisms to Clinical Trials
Yizhen Zheng
Huan Yee Koh
M. Yang
Li Li
Lauren T. May
Geoffrey I. Webb
Shirui Pan
George Church
LM&MA
212
44
0
06 Sep 2024
Beyond Preferences in AI Alignment
Beyond Preferences in AI AlignmentPhilosophical Studies (Philos. Stud.), 2024
Tan Zhi-Xuan
Micah Carroll
Matija Franklin
Hal Ashton
315
37
0
30 Aug 2024
Logic Contrastive Reasoning with Lightweight Large Language Model for
  Math Word Problems
Logic Contrastive Reasoning with Lightweight Large Language Model for Math Word Problems
Ding Kai
Ma Zhenguo
Yan Xiaoran
LRM
160
0
0
29 Aug 2024
Political Bias in LLMs: Unaligned Moral Values in Agent-centric Simulations
Political Bias in LLMs: Unaligned Moral Values in Agent-centric SimulationsJournal for Language Technology and Computational Linguistics (JLCL), 2024
Simon Münker
SyDa
212
0
0
21 Aug 2024
Inductive Learning of Logical Theories with LLMs: An Expressivity-Graded Analysis
Inductive Learning of Logical Theories with LLMs: An Expressivity-Graded Analysis
Joao Pedro Gandarela
Danilo S. Carvalho
André Freitas
214
0
0
15 Aug 2024
Can Large Language Models Reason? A Characterization via 3-SAT
Can Large Language Models Reason? A Characterization via 3-SAT
Rishi Hazra
Gabriele Venturato
Pedro Zuidberg Dos Martires
Luc de Raedt
ELMReLMLRM
217
15
0
13 Aug 2024
Response Wide Shut: Surprising Observations in Basic Vision Language
  Model Capabilities
Response Wide Shut: Surprising Observations in Basic Vision Language Model Capabilities
Shivam Chandhok
Wan-Cyuan Fan
Leonid Sigal
VLMMLLM
149
7
0
13 Aug 2024
Your Context Is Not an Array: Unveiling Random Access Limitations in
  Transformers
Your Context Is Not an Array: Unveiling Random Access Limitations in Transformers
MohammadReza Ebrahimi
Sunny Panchal
Roland Memisevic
254
10
0
10 Aug 2024
Geometric Algebra Meets Large Language Models: Instruction-Based Transformations of Separate Meshes in 3D, Interactive and Controllable Scenes
Geometric Algebra Meets Large Language Models: Instruction-Based Transformations of Separate Meshes in 3D, Interactive and Controllable Scenes
Dimitris Angelís
Prodromos Kolyvakis
Manos N. Kamarianakis
242
3
0
05 Aug 2024
Do Large Language Models Have Compositional Ability? An Investigation
  into Limitations and Scalability
Do Large Language Models Have Compositional Ability? An Investigation into Limitations and Scalability
Zhuoyan Xu
Zhenmei Shi
Yingyu Liang
CoGeLRM
342
52
0
22 Jul 2024
From Words to Worlds: Compositionality for Cognitive Architectures
From Words to Worlds: Compositionality for Cognitive Architectures
Ruchira Dhar
Anders Sogaard
266
2
0
18 Jul 2024
Validating Mechanistic Interpretations: An Axiomatic Approach
Validating Mechanistic Interpretations: An Axiomatic Approach
Nils Palumbo
Ravi Mangal
Zifan Wang
Saranya Vijayakumar
Corina S. Pasareanu
Somesh Jha
281
1
0
18 Jul 2024
Leveraging Environment Interaction for Automated PDDL Generation and
  Planning with Large Language Models
Leveraging Environment Interaction for Automated PDDL Generation and Planning with Large Language Models
Sadegh Mahdavi
Raquel Aoki
Keyi Tang
Yanshuai Cao
LLMAG
117
0
0
17 Jul 2024
Reliable Reasoning Beyond Natural Language
Reliable Reasoning Beyond Natural Language
Nasim Borazjanizadeh
Steven T Piantadosi
LRMReLM
257
9
0
16 Jul 2024
Transforming Agency. On the mode of existence of Large Language Models
Transforming Agency. On the mode of existence of Large Language Models
Xabier E. Barandiaran
Lola S. Almendros
LLMAGLM&Ro
188
8
0
15 Jul 2024
LLM-Collaboration on Automatic Science Journalism for the General
  Audience
LLM-Collaboration on Automatic Science Journalism for the General Audience
Gongyao Jiang
Xinran Shi
Qiong Luo
160
3
0
13 Jul 2024
AI Safety in Generative AI Large Language Models: A Survey
AI Safety in Generative AI Large Language Models: A Survey
Jaymari Chua
Yun Yvonna Li
Shiyi Yang
Chen Wang
Lina Yao
LM&MA
333
35
0
06 Jul 2024
Algorithmic Language Models with Neurally Compiled Libraries
Algorithmic Language Models with Neurally Compiled Libraries
Lucas Saldyt
Subbarao Kambhampati
LRM
294
0
0
06 Jul 2024
Universal Length Generalization with Turing Programs
Universal Length Generalization with Turing Programs
Kaiying Hou
David Brandfonbrener
Sham Kakade
Samy Jelassi
Eran Malach
208
18
0
03 Jul 2024
Predicting vs. Acting: A Trade-off Between World Modeling & Agent
  Modeling
Predicting vs. Acting: A Trade-off Between World Modeling & Agent Modeling
Margaret Li
Weijia Shi
Artidoro Pagnoni
Peter West
Ari Holtzman
223
16
0
02 Jul 2024
DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning
  Graph
DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph
Zhehao Zhang
Jiaao Chen
Diyi Yang
LRM
204
23
0
25 Jun 2024
Less can be more for predicting properties with large language models
Less can be more for predicting properties with large language models
Nawaf Alampara
Santiago Miret
Kevin Maik Jablonka
371
10
0
25 Jun 2024
From Decoding to Meta-Generation: Inference-time Algorithms for Large
  Language Models
From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models
Sean Welleck
Amanda Bertsch
Matthew Finlayson
Hailey Schoelkopf
Alex Xie
Graham Neubig
Ilia Kulikov
Zaid Harchaoui
350
109
0
24 Jun 2024
Cognitive Map for Language Models: Optimal Planning via Verbally
  Representing the World Model
Cognitive Map for Language Models: Optimal Planning via Verbally Representing the World Model
Doyoung Kim
Jongwon Lee
Jinho Park
Minjoon Seo
LM&Ro
270
1
0
21 Jun 2024
RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math
  Reasoning by Eight-Fold
RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold
Amrith Rajagopal Setlur
Saurabh Garg
Xinyang Geng
Naman Garg
Virginia Smith
Aviral Kumar
453
91
0
20 Jun 2024
Data-Centric AI in the Age of Large Language Models
Data-Centric AI in the Age of Large Language Models
Xinyi Xu
Zhaoxuan Wu
Rui Qiao
Arun Verma
Yao Shu
...
Xiaoqiang Lin
Wenyang Hu
Zhongxiang Dai
Pang Wei Koh
Bryan Kian Hsiang Low
ALM
344
4
0
20 Jun 2024
Hopping Too Late: Exploring the Limitations of Large Language Models on
  Multi-Hop Queries
Hopping Too Late: Exploring the Limitations of Large Language Models on Multi-Hop QueriesConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Eden Biran
Daniela Gottesman
Sohee Yang
Mor Geva
Amir Globerson
LRM
215
64
0
18 Jun 2024
Liar, Liar, Logical Mire: A Benchmark for Suppositional Reasoning in
  Large Language Models
Liar, Liar, Logical Mire: A Benchmark for Suppositional Reasoning in Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Philipp Mondorf
Barbara Plank
HILMLRM
236
10
0
18 Jun 2024
Transformers meet Neural Algorithmic Reasoners
Transformers meet Neural Algorithmic Reasoners
Wilfried Bounsi
Borja Ibarz
Andrew Dudzik
Jessica B. Hamrick
Larisa Markeeva
Alex Vitvitskyi
Razvan Pascanu
Petar Veličković
NAIAI4CELRM
228
11
0
13 Jun 2024
TC-Bench: Benchmarking Temporal Compositionality in Text-to-Video and
  Image-to-Video Generation
TC-Bench: Benchmarking Temporal Compositionality in Text-to-Video and Image-to-Video Generation
Weixi Feng
Jiachen Li
Michael Stephen Saxon
Tsu-Jui Fu
Wenhu Chen
William Yang Wang
EGVMVGen
202
26
0
12 Jun 2024
Advancing Annotation of Stance in Social Media Posts: A Comparative
  Analysis of Large Language Models and Crowd Sourcing
Advancing Annotation of Stance in Social Media Posts: A Comparative Analysis of Large Language Models and Crowd Sourcing
Mao Li
Frederick Conrad
242
3
0
11 Jun 2024
Attention as a Hypernetwork
Attention as a HypernetworkInternational Conference on Learning Representations (ICLR), 2024
Simon Schug
Seijin Kobayashi
Yassir Akram
João Sacramento
Razvan Pascanu
GNN
228
8
0
09 Jun 2024
Compositional Generalization with Grounded Language Models
Compositional Generalization with Grounded Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Sondre Wold
Étienne Simon
Lucas Georges Gabriel Charpentier
Egor V. Kostylev
Erik Velldal
Lilja Øvrelid
KELM
222
2
0
07 Jun 2024
The CLRS-Text Algorithmic Reasoning Language Benchmark
The CLRS-Text Algorithmic Reasoning Language Benchmark
Larisa Markeeva
Sean McLeish
Borja Ibarz
Wilfried Bounsi
Olga Kozlova
Alex Vitvitskyi
Charles Blundell
Tom Goldstein
Avi Schwarzschild
Petar Veličković
LRM
249
20
0
06 Jun 2024
On Limitation of Transformer for Learning HMMs
On Limitation of Transformer for Learning HMMs
Jiachen Hu
Qinghua Liu
Chi Jin
206
7
0
06 Jun 2024
Exact Conversion of In-Context Learning to Model Weights in
  Linearized-Attention Transformers
Exact Conversion of In-Context Learning to Model Weights in Linearized-Attention Transformers
Brian K Chen
Tianyang Hu
Hui Jin
Hwee Kuan Lee
Kenji Kawaguchi
183
5
0
05 Jun 2024
Previous
1234567
Next