Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2405.04776
Cited By
v1
v2
v3 (latest)
Chain of Thoughtlessness? An Analysis of CoT in Planning
8 May 2024
Kaya Stechly
Kaya Stechly
Subbarao Kambhampati
LRM
LM&Ro
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"Chain of Thoughtlessness? An Analysis of CoT in Planning"
50 / 102 papers shown
LLMs Still Can't Plan; Can LRMs? A Preliminary Evaluation of OpenAI's o1 on PlanBench
Kaya Stechly
Kaya Stechly
Subbarao Kambhampati
LLMAG
LRM
ELM
405
89
0
20 Sep 2024
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
International Conference on Learning Representations (ICLR), 2024
Zayne Sprague
Fangcong Yin
Juan Diego Rodriguez
Dongwei Jiang
Manya Wadhwa
Prasann Singhal
Xinyu Zhao
Xi Ye
Kyle Mahowald
Greg Durrett
ReLM
LRM
637
232
0
18 Sep 2024
EVINCE: Optimizing Multi-LLM Dialogues Using Conditional Statistics and Information Theory
Edward Y. Chang
AAML
136
0
0
26 Aug 2024
Algorithmic Language Models with Neurally Compiled Libraries
Lucas Saldyt
Subbarao Kambhampati
LRM
323
0
0
06 Jul 2024
Deciphering the Factors Influencing the Efficacy of Chain-of-Thought: Probability, Memorization, and Noisy Reasoning
Akshara Prabhakar
Thomas Griffiths
R. Thomas McCoy
LRM
265
30
0
01 Jul 2024
Cognitive Map for Language Models: Optimal Planning via Verbally Representing the World Model
Doyoung Kim
Jongwon Lee
Jinho Park
Minjoon Seo
LM&Ro
317
1
0
21 Jun 2024
Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning
Chaojie Wang
Yanchen Deng
Zhiyi Lyu
Liang Zeng
Jujie He
Shuicheng Yan
Bo An
LRM
ReLM
341
94
0
20 Jun 2024
Exploring and Benchmarking the Planning Capabilities of Large Language Models
Bernd Bohnet
Azade Nova
Aaron T Parisi
Kevin Swersky
Katayoon Goshvadi
Hanjun Dai
Dale Schuurmans
Noah Fiedel
Hanie Sedghi
187
17
0
18 Jun 2024
Robust Planning with LLM-Modulo Framework: Case Study in Travel Planning
Atharva Gundawar
Mudit Verma
L. Guan
Kaya Stechly
Siddhant Bhambri
Subbarao Kambhampati
182
35
0
31 May 2024
SELF-[IN]CORRECT: LLMs Struggle with Refining Self-Generated Responses
AAAI Conference on Artificial Intelligence (AAAI), 2024
Dongwei Jiang
Jingyu Zhang
Orion Weller
Nathaniel Weir
Benjamin Van Durme
Daniel Khashabi
225
12
0
04 Apr 2024
Multi-Conditional Ranking with Large Language Models
Pouya Pezeshkpour
Estevam R. Hruschka
LRM
179
1
0
30 Mar 2024
ChainLM: Empowering Large Language Models with Improved Chain-of-Thought Prompting
Xiaoxue Cheng
Junyi Li
Wayne Xin Zhao
Ji-Rong Wen
LRM
AI4CE
ReLM
163
19
0
21 Mar 2024
Benchmarking GPT-4 on Algorithmic Problems: A Systematic Evaluation of Prompting Strategies
Flavio Petruzzellis
Alberto Testolin
A. Sperduti
ELM
313
15
0
27 Feb 2024
How Interpretable are Reasoning Explanations from Prompting Large Language Models?
Yeo Wei Jie
Frank Xing
Rick Mong
Xiaoshi Zhong
ReLM
LRM
328
37
0
19 Feb 2024
On the Self-Verification Limitations of Large Language Models on Reasoning and Planning Tasks
Kaya Stechly
Kaya Stechly
Subbarao Kambhampati
ReLM
LRM
178
98
0
12 Feb 2024
Efficient Tool Use with Chain-of-Abstraction Reasoning
Silin Gao
Jane Dwivedi-Yu
Ping Yu
X. Tan
Ramakanth Pasunuru
O. Yu. Golovneva
Koustuv Sinha
Asli Celikyilmaz
Antoine Bosselut
Tianlu Wang
LRM
356
35
0
30 Jan 2024
Demystifying Chains, Trees, and Graphs of Thoughts
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Maciej Besta
Florim Memedi
Zhenyu Zhang
Robert Gerstenberger
Guangyuan Piao
...
Aleš Kubíček
H. Niewiadomski
Aidan O'Mahony
Onur Mutlu
Torsten Hoefler
AI4CE
LRM
1.0K
52
0
25 Jan 2024
A Closer Look at the Self-Verification Abilities of Large Language Models in Logical Reasoning
North American Chapter of the Association for Computational Linguistics (NAACL), 2023
Ruixin Hong
Hongming Zhang
Xinyu Pang
Dong Yu
Changshui Zhang
LRM
225
43
0
14 Nov 2023
KITAB: Evaluating LLMs on Constraint Satisfaction for Information Retrieval
International Conference on Learning Representations (ICLR), 2023
Marah Abdin
Suriya Gunasekar
Varun Chandrasekaran
Jerry Li
Mert Yuksekgonul
Rahee Peshawaria
Ranjita Naik
Besmira Nushi
177
14
0
24 Oct 2023
Large Language Models Cannot Self-Correct Reasoning Yet
International Conference on Learning Representations (ICLR), 2023
Jie Huang
Xinyun Chen
Swaroop Mishra
Huaixiu Steven Zheng
Adams Wei Yu
Xinying Song
Denny Zhou
ReLM
LRM
519
696
0
03 Oct 2023
Invalid Logic, Equivalent Gains: The Bizarreness of Reasoning in Language Model Prompting
Rylan Schaeffer
Kateryna Pistunova
Samarth Khanna
Sarthak Consul
Oluwasanmi Koyejo
ReLM
LRM
125
13
0
20 Jul 2023
Boosting Language Models Reasoning with Chain-of-Knowledge Prompting
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Jiadong Wang
Qiushi Sun
Xiang Li
Ming Gao
ReLM
LRM
311
105
0
10 Jun 2023
Deductive Verification of Chain-of-Thought Reasoning
Neural Information Processing Systems (NeurIPS), 2023
Z. Ling
Yunhao Fang
Xuanlin Li
Zhiao Huang
Mingu Lee
Roland Memisevic
Hao Su
ReLM
LRM
496
194
0
06 Jun 2023
Faith and Fate: Limits of Transformers on Compositionality
Neural Information Processing Systems (NeurIPS), 2023
Nouha Dziri
Ximing Lu
Melanie Sclar
Xiang Lorraine Li
Liwei Jian
...
Sean Welleck
Xiang Ren
Allyson Ettinger
Zaïd Harchaoui
Yejin Choi
ReLM
LRM
519
497
0
29 May 2023
On the Planning Abilities of Large Language Models : A Critical Investigation
Neural Information Processing Systems (NeurIPS), 2023
Kaya Stechly
Matthew Marquez
S. Sreedharan
Subbarao Kambhampati
LLMAG
LRM
270
340
0
25 May 2023
Towards Revealing the Mystery behind Chain of Thought: A Theoretical Perspective
Neural Information Processing Systems (NeurIPS), 2023
Guhao Feng
Bohang Zhang
Yuntian Gu
Haotian Ye
Di He
Liwei Wang
LRM
649
354
0
24 May 2023
Improving Factuality and Reasoning in Language Models through Multiagent Debate
International Conference on Machine Learning (ICML), 2023
Yilun Du
Shuang Li
Antonio Torralba
J. Tenenbaum
Igor Mordatch
LLMAG
LRM
351
1,182
0
23 May 2023
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
International Conference on Learning Representations (ICLR), 2023
Zhibin Gou
Zhihong Shao
Yeyun Gong
Yelong Shen
Yujiu Yang
Nan Duan
Weizhu Chen
KELM
LRM
392
584
0
19 May 2023
Language Models Don't Always Say What They Think: Unfaithful Explanations in Chain-of-Thought Prompting
Neural Information Processing Systems (NeurIPS), 2023
Miles Turpin
Julian Michael
Ethan Perez
Sam Bowman
ReLM
LRM
533
725
0
07 May 2023
GPT-4 Technical Report
OpenAI OpenAI
OpenAI Josh Achiam
Steven Adler
Sandhini Agarwal
Lama Ahmad
...
Shengjia Zhao
Tianhao Zheng
Juntang Zhuang
William Zhuk
Barret Zoph
LLMAG
MLLM
4.6K
20,902
0
15 Mar 2023
Faithful Chain-of-Thought Reasoning
International Joint Conference on Natural Language Processing (IJCNLP), 2023
Qing Lyu
Shreya Havaldar
Adam Stein
Li Zhang
D. Rao
Eric Wong
Marianna Apidianaki
Chris Callison-Burch
ReLM
LRM
456
317
0
31 Jan 2023
Reasoning with Language Model Prompting: A Survey
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Shuofei Qiao
Yixin Ou
Ningyu Zhang
Xiang Chen
Yunzhi Yao
Shumin Deng
Chuanqi Tan
Fei Huang
Huajun Chen
ReLM
ELM
LRM
707
395
0
19 Dec 2022
Teaching Algorithmic Reasoning via In-context Learning
Hattie Zhou
Azade Nova
Hugo Larochelle
Rameswar Panda
Behnam Neyshabur
Hanie Sedghi
LRM
ReLM
254
130
0
15 Nov 2022
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Mirac Suzgun
Nathan Scales
Nathanael Scharli
Sebastian Gehrmann
Yi Tay
...
Aakanksha Chowdhery
Quoc V. Le
Ed H. Chi
Denny Zhou
Jason W. Wei
ALM
ELM
LRM
ReLM
518
1,558
0
17 Oct 2022
Automatic Chain of Thought Prompting in Large Language Models
International Conference on Learning Representations (ICLR), 2022
Zhuosheng Zhang
Aston Zhang
Mu Li
Alexander J. Smola
ReLM
LRM
496
852
0
07 Oct 2022
Language Models are Multilingual Chain-of-Thought Reasoners
International Conference on Learning Representations (ICLR), 2022
Freda Shi
Mirac Suzgun
Markus Freitag
Xuezhi Wang
Suraj Srivats
...
Yi Tay
Sebastian Ruder
Denny Zhou
Dipanjan Das
Jason W. Wei
ReLM
LRM
587
492
0
06 Oct 2022
ReAct: Synergizing Reasoning and Acting in Language Models
International Conference on Learning Representations (ICLR), 2022
Shunyu Yao
Jeffrey Zhao
Dian Yu
Nan Du
Izhak Shafran
Karthik Narasimhan
Yuan Cao
LLMAG
ReLM
LRM
2.5K
5,256
0
06 Oct 2022
Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-Thought
International Conference on Learning Representations (ICLR), 2022
Abulhair Saparov
He He
ELM
LRM
ReLM
850
422
0
03 Oct 2022
Faithful Reasoning Using Large Language Models
Antonia Creswell
Murray Shanahan
ReLM
LRM
193
139
0
30 Aug 2022
Limitations of Language Models in Arithmetic and Symbolic Induction
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Jingu Qian
Hong Wang
Zekun Li
Shiyang Li
Xifeng Yan
ReLM
LRM
318
85
0
09 Aug 2022
Exploring Length Generalization in Large Language Models
Neural Information Processing Systems (NeurIPS), 2022
Cem Anil
Yuhuai Wu
Anders Andreassen
Aitor Lewkowycz
Vedant Misra
V. Ramasesh
Ambrose Slone
Guy Gur-Ari
Ethan Dyer
Behnam Neyshabur
ReLM
LRM
348
211
0
11 Jul 2022
PlanBench: An Extensible Benchmark for Evaluating Large Language Models on Planning and Reasoning about Change
Neural Information Processing Systems (NeurIPS), 2022
Kaya Stechly
Matthew Marquez
Alberto Olmo
S. Sreedharan
Subbarao Kambhampati
ReLM
LRM
343
329
0
21 Jun 2022
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
Aarohi Srivastava
Abhinav Rastogi
Abhishek Rao
Abu Awal Md Shoeb
Abubakar Abid
...
Zhuoye Zhao
Zijian Wang
Zijie J. Wang
Zirui Wang
Ziyi Wu
ELM
697
2,150
0
09 Jun 2022
Large Language Models are Zero-Shot Reasoners
Neural Information Processing Systems (NeurIPS), 2022
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
1.4K
6,087
0
24 May 2022
Least-to-Most Prompting Enables Complex Reasoning in Large Language Models
International Conference on Learning Representations (ICLR), 2022
Denny Zhou
Nathanael Scharli
Le Hou
Jason W. Wei
Nathan Scales
...
Dale Schuurmans
Claire Cui
Olivier Bousquet
Quoc Le
Ed H. Chi
RALM
LRM
AI4CE
659
1,483
0
21 May 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
International Conference on Learning Representations (ICLR), 2022
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLM
BDL
LRM
AI4CE
2.7K
5,537
0
21 Mar 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Neural Information Processing Systems (NeurIPS), 2022
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
2.3K
14,449
0
28 Jan 2022
Show Your Work: Scratchpads for Intermediate Computation with Language Models
Maxwell Nye
Anders Andreassen
Guy Gur-Ari
Henryk Michalewski
Jacob Austin
...
Aitor Lewkowycz
Maarten Bosma
D. Luan
Charles Sutton
Augustus Odena
ReLM
LRM
544
920
0
30 Nov 2021
Training Verifiers to Solve Math Word Problems
K. Cobbe
V. Kosaraju
Mohammad Bavarian
Mark Chen
Heewoo Jun
...
Jerry Tworek
Jacob Hilton
Reiichiro Nakano
Christopher Hesse
John Schulman
ReLM
OffRL
LRM
1.1K
6,810
0
27 Oct 2021
A Diverse Corpus for Evaluating and Developing English Math Word Problem Solvers
Annual Meeting of the Association for Computational Linguistics (ACL), 2020
Shen-Yun Miao
Chao-Chun Liang
Keh-Yih Su
275
418
0
30 Jun 2021
Previous
1
2
3
Next