ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.18654
  4. Cited By
Faith and Fate: Limits of Transformers on Compositionality

Faith and Fate: Limits of Transformers on Compositionality

29 May 2023
Nouha Dziri
Ximing Lu
Melanie Sclar
Xiang Lorraine Li
Liwei Jian
Bill Yuchen Lin
Peter West
Chandra Bhagavatula
Ronan Le Bras
Jena D. Hwang
Soumya Sanyal
Sean Welleck
Xiang Ren
Allyson Ettinger
Zaïd Harchaoui
Yejin Choi
    ReLM
    LRM
ArXivPDFHTML

Papers citing "Faith and Fate: Limits of Transformers on Compositionality"

50 / 244 papers shown
Title
Automatic Generation of Question Hints for Mathematics Problems using
  Large Language Models in Educational Technology
Automatic Generation of Question Hints for Mathematics Problems using Large Language Models in Educational Technology
Junior Cedric Tonga
Benjamin Clément
Pierre-Yves Oudeyer
LRM
28
2
0
05 Nov 2024
Regress, Don't Guess -- A Regression-like Loss on Number Tokens for
  Language Models
Regress, Don't Guess -- A Regression-like Loss on Number Tokens for Language Models
Jonas Zausinger
Lars Pennig
Kacper Chlodny
Vincent Limbach
Anna Ketteler
Thorben Prein
Vishwa Mohan Singh
Michael Morris Danziger
Jannis Born
14
0
0
04 Nov 2024
Provable Length Generalization in Sequence Prediction via Spectral
  Filtering
Provable Length Generalization in Sequence Prediction via Spectral Filtering
Annie Marsden
Evan Dogariu
Naman Agarwal
Xinyi Chen
Daniel Suo
Elad Hazan
32
1
0
01 Nov 2024
Diffusion Beats Autoregressive: An Evaluation of Compositional Generation in Text-to-Image Models
Diffusion Beats Autoregressive: An Evaluation of Compositional Generation in Text-to-Image Models
Arash Marioriyad
Parham Rezaei
M. Baghshah
M. Rohban
CoGe
51
0
0
30 Oct 2024
On Memorization of Large Language Models in Logical Reasoning
On Memorization of Large Language Models in Logical Reasoning
Chulin Xie
Yangsibo Huang
Chiyuan Zhang
Da Yu
Xinyun Chen
Bill Yuchen Lin
Bo Li
Badih Ghazi
Ravi Kumar
LRM
41
20
0
30 Oct 2024
Natural Language Inference Improves Compositionality in Vision-Language
  Models
Natural Language Inference Improves Compositionality in Vision-Language Models
Paola Cascante-Bonilla
Yu Hou
Yang Trista Cao
Hal Daumé III
Rachel Rudinger
ReLM
CoGe
VLM
33
3
0
29 Oct 2024
Delving into the Reversal Curse: How Far Can Large Language Models
  Generalize?
Delving into the Reversal Curse: How Far Can Large Language Models Generalize?
Zhengkai Lin
Z. Fu
Kai Liu
Liang Xie
Binbin Lin
Wenxiao Wang
D. Cai
Yue Wu
Jieping Ye
LRM
23
3
0
24 Oct 2024
A Comprehensive Evaluation of Cognitive Biases in LLMs
A Comprehensive Evaluation of Cognitive Biases in LLMs
Simon Malberg
Roman Poletukhin
Carolin M. Schuster
Georg Groh
ELM
32
5
0
20 Oct 2024
Supervised Chain of Thought
Supervised Chain of Thought
Xiang Zhang
Dujian Ding
LRM
AI4CE
18
1
0
18 Oct 2024
Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning
Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning
Jiacheng Ye
Jiahui Gao
Shansan Gong
Lin Zheng
Xin Jiang
Z. Li
Lingpeng Kong
DiffM
LRM
37
15
0
18 Oct 2024
Enhancing Generalization in Sparse Mixture of Experts Models: The Case
  for Increased Expert Activation in Compositional Tasks
Enhancing Generalization in Sparse Mixture of Experts Models: The Case for Increased Expert Activation in Compositional Tasks
Jinze Zhao
MoE
19
0
0
17 Oct 2024
How Numerical Precision Affects Mathematical Reasoning Capabilities of
  LLMs
How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs
Guhao Feng
Kai-Bo Yang
Yuntian Gu
Xinyue Ai
Shengjie Luo
Jiacheng Sun
Di He
Z. Li
Liwei Wang
LRM
25
1
0
17 Oct 2024
Breaking Chains: Unraveling the Links in Multi-Hop Knowledge Unlearning
Breaking Chains: Unraveling the Links in Multi-Hop Knowledge Unlearning
Minseok Choi
C. Park
Dohyun Lee
Jaegul Choo
KELM
MU
18
0
0
17 Oct 2024
Fine-grained Attention I/O Complexity: Comprehensive Analysis for
  Backward Passes
Fine-grained Attention I/O Complexity: Comprehensive Analysis for Backward Passes
Xiaoyu Li
Yingyu Liang
Zhenmei Shi
Zhao-quan Song
Yufa Zhou
44
15
0
12 Oct 2024
Automatic Curriculum Expert Iteration for Reliable LLM Reasoning
Automatic Curriculum Expert Iteration for Reliable LLM Reasoning
Zirui Zhao
Hanze Dong
Amrita Saha
Caiming Xiong
Doyen Sahoo
LRM
27
3
0
10 Oct 2024
The Mystery of Compositional Generalization in Graph-based Generative
  Commonsense Reasoning
The Mystery of Compositional Generalization in Graph-based Generative Commonsense Reasoning
Xiyan Fu
Anette Frank
LRM
23
0
0
08 Oct 2024
Composing Global Optimizers to Reasoning Tasks via Algebraic Objects in
  Neural Nets
Composing Global Optimizers to Reasoning Tasks via Algebraic Objects in Neural Nets
Yuandong Tian
41
0
0
02 Oct 2024
ET-Plan-Bench: Embodied Task-level Planning Benchmark Towards Spatial-Temporal Cognition with Foundation Models
ET-Plan-Bench: Embodied Task-level Planning Benchmark Towards Spatial-Temporal Cognition with Foundation Models
Lingfeng Zhang
Yuening Wang
Hongjian Gu
Atia Hamidizadeh
Zhanguang Zhang
...
Tongtong Cao
Yuzheng Zhuang
Yingxue Zhang
Jianye Hao
Jianye Hao
LM&Ro
31
0
0
02 Oct 2024
Can Models Learn Skill Composition from Examples?
Can Models Learn Skill Composition from Examples?
Haoyu Zhao
Simran Kaur
Dingli Yu
Anirudh Goyal
Sanjeev Arora
CoGe
MoE
48
2
0
29 Sep 2024
E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding
E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding
Ye Liu
Zongyang Ma
Zhongang Qi
Yang Wu
Ying Shan
Chang Wen Chen
18
15
0
26 Sep 2024
Compositional Hardness of Code in Large Language Models -- A Probabilistic Perspective
Compositional Hardness of Code in Large Language Models -- A Probabilistic Perspective
Yotam Wolf
Binyamin Rothberg
Dorin Shteyman
Amnon Shashua
13
0
0
26 Sep 2024
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
Zayne Sprague
Fangcong Yin
Juan Diego Rodriguez
Dongwei Jiang
Manya Wadhwa
Prasann Singhal
Xinyu Zhao
Xi Ye
Kyle Mahowald
Greg Durrett
ReLM
LRM
90
79
0
18 Sep 2024
Fast Analysis of the OpenAI O1-Preview Model in Solving Random K-SAT
  Problem: Does the LLM Solve the Problem Itself or Call an External SAT
  Solver?
Fast Analysis of the OpenAI O1-Preview Model in Solving Random K-SAT Problem: Does the LLM Solve the Problem Itself or Call an External SAT Solver?
Raffaele Marino
ReLM
LRM
16
2
0
17 Sep 2024
Semformer: Transformer Language Models with Semantic Planning
Semformer: Transformer Language Models with Semantic Planning
Yongjing Yin
Junran Ding
Kai Song
Yue Zhang
29
0
0
17 Sep 2024
Benchmarking VLMs' Reasoning About Persuasive Atypical Images
Benchmarking VLMs' Reasoning About Persuasive Atypical Images
Sina Malakouti
Aysan Aghazadeh
Ashmit Khandelwal
Adriana Kovashka
VLM
26
2
0
16 Sep 2024
Causal Language Modeling Can Elicit Search and Reasoning Capabilities on
  Logic Puzzles
Causal Language Modeling Can Elicit Search and Reasoning Capabilities on Logic Puzzles
Kulin Shah
Nishanth Dikkala
Xin Wang
Rina Panigrahy
ELM
ReLM
LRM
21
9
0
16 Sep 2024
Large Language Models in Drug Discovery and Development: From Disease
  Mechanisms to Clinical Trials
Large Language Models in Drug Discovery and Development: From Disease Mechanisms to Clinical Trials
Yizhen Zheng
Huan Yee Koh
M. Yang
Li Li
Lauren T. May
Geoffrey I. Webb
Shirui Pan
George Church
LM&MA
42
9
0
06 Sep 2024
Beyond Preferences in AI Alignment
Beyond Preferences in AI Alignment
Tan Zhi-Xuan
Micah Carroll
Matija Franklin
Hal Ashton
23
16
0
30 Aug 2024
Logic Contrastive Reasoning with Lightweight Large Language Model for
  Math Word Problems
Logic Contrastive Reasoning with Lightweight Large Language Model for Math Word Problems
Ding Kai
Ma Zhenguo
Yan Xiaoran
LRM
20
0
0
29 Aug 2024
Towards "Differential AI Psychology" and in-context Value-driven
  Statement Alignment with Moral Foundations Theory
Towards "Differential AI Psychology" and in-context Value-driven Statement Alignment with Moral Foundations Theory
Simon Münker
SyDa
19
0
0
21 Aug 2024
Inductive Learning of Logical Theories with LLMs: An Expressivity-Graded Analysis
Inductive Learning of Logical Theories with LLMs: An Expressivity-Graded Analysis
Joao Pedro Gandarela
Danilo S. Carvalho
André Freitas
14
0
0
15 Aug 2024
Can Large Language Models Reason? A Characterization via 3-SAT
Can Large Language Models Reason? A Characterization via 3-SAT
Rishi Hazra
Gabriele Venturato
Pedro Zuidberg Dos Martires
Luc de Raedt
ELM
ReLM
LRM
17
4
0
13 Aug 2024
Response Wide Shut: Surprising Observations in Basic Vision Language
  Model Capabilities
Response Wide Shut: Surprising Observations in Basic Vision Language Model Capabilities
Shivam Chandhok
Wan-Cyuan Fan
Leonid Sigal
VLM
MLLM
18
3
0
13 Aug 2024
Your Context Is Not an Array: Unveiling Random Access Limitations in
  Transformers
Your Context Is Not an Array: Unveiling Random Access Limitations in Transformers
MohammadReza Ebrahimi
Sunny Panchal
Roland Memisevic
25
2
0
10 Aug 2024
Geometric Algebra Meets Large Language Models: Instruction-Based
  Transformations of Separate Meshes in 3D, Interactive and Controllable Scenes
Geometric Algebra Meets Large Language Models: Instruction-Based Transformations of Separate Meshes in 3D, Interactive and Controllable Scenes
Dimitris Angelís
Prodromos Kolyvakis
Manos N. Kamarianakis
George Papagiannakis
27
1
0
05 Aug 2024
Do Large Language Models Have Compositional Ability? An Investigation
  into Limitations and Scalability
Do Large Language Models Have Compositional Ability? An Investigation into Limitations and Scalability
Zhuoyan Xu
Zhenmei Shi
Yingyu Liang
CoGe
LRM
19
27
0
22 Jul 2024
Mechanistically Interpreting a Transformer-based 2-SAT Solver: An
  Axiomatic Approach
Mechanistically Interpreting a Transformer-based 2-SAT Solver: An Axiomatic Approach
Nils Palumbo
Ravi Mangal
Zifan Wang
Saranya Vijayakumar
Corina S. Pasareanu
Somesh Jha
34
1
0
18 Jul 2024
From Words to Worlds: Compositionality for Cognitive Architectures
From Words to Worlds: Compositionality for Cognitive Architectures
Ruchira Dhar
Anders Sogaard
24
0
0
18 Jul 2024
Leveraging Environment Interaction for Automated PDDL Generation and
  Planning with Large Language Models
Leveraging Environment Interaction for Automated PDDL Generation and Planning with Large Language Models
Sadegh Mahdavi
Raquel Aoki
Keyi Tang
Yanshuai Cao
LLMAG
25
0
0
17 Jul 2024
Reliable Reasoning Beyond Natural Language
Reliable Reasoning Beyond Natural Language
Nasim Borazjanizadeh
Steven T Piantadosi
LRM
ReLM
32
5
0
16 Jul 2024
Transforming Agency. On the mode of existence of Large Language Models
Transforming Agency. On the mode of existence of Large Language Models
Xabier E. Barandiaran
Lola S. Almendros
LLMAG
LM&Ro
32
4
0
15 Jul 2024
LLM-Collaboration on Automatic Science Journalism for the General
  Audience
LLM-Collaboration on Automatic Science Journalism for the General Audience
Gongyao Jiang
Xinran Shi
Qiong Luo
18
3
0
13 Jul 2024
AI Safety in Generative AI Large Language Models: A Survey
AI Safety in Generative AI Large Language Models: A Survey
Jaymari Chua
Yun Yvonna Li
Shiyi Yang
Chen Wang
Lina Yao
LM&MA
29
12
0
06 Jul 2024
Algorithmic Language Models with Neurally Compiled Libraries
Algorithmic Language Models with Neurally Compiled Libraries
Lucas Saldyt
Subbarao Kambhampati
LRM
43
0
0
06 Jul 2024
Universal Length Generalization with Turing Programs
Universal Length Generalization with Turing Programs
Kaiying Hou
David Brandfonbrener
Sham Kakade
Samy Jelassi
Eran Malach
27
1
0
03 Jul 2024
Predicting vs. Acting: A Trade-off Between World Modeling & Agent
  Modeling
Predicting vs. Acting: A Trade-off Between World Modeling & Agent Modeling
Margaret Li
Weijia Shi
Artidoro Pagnoni
Peter West
Ari Holtzman
35
6
0
02 Jul 2024
MatText: Do Language Models Need More than Text & Scale for Materials
  Modeling?
MatText: Do Language Models Need More than Text & Scale for Materials Modeling?
Nawaf Alampara
Santiago Miret
K. Jablonka
37
8
0
25 Jun 2024
DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning
  Graph
DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph
Zhehao Zhang
Jiaao Chen
Diyi Yang
LRM
32
7
0
25 Jun 2024
From Decoding to Meta-Generation: Inference-time Algorithms for Large
  Language Models
From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models
Sean Welleck
Amanda Bertsch
Matthew Finlayson
Hailey Schoelkopf
Alex Xie
Graham Neubig
Ilia Kulikov
Zaid Harchaoui
33
45
0
24 Jun 2024
Cognitive Map for Language Models: Optimal Planning via Verbally
  Representing the World Model
Cognitive Map for Language Models: Optimal Planning via Verbally Representing the World Model
Doyoung Kim
Jongwon Lee
Jinho Park
Minjoon Seo
LM&Ro
25
0
0
21 Jun 2024
Previous
12345
Next