ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.04776
  4. Cited By
Chain of Thoughtlessness? An Analysis of CoT in Planning
v1v2v3 (latest)

Chain of Thoughtlessness? An Analysis of CoT in Planning

8 May 2024
Kaya Stechly
Kaya Stechly
Subbarao Kambhampati
    LRMLM&Ro
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "Chain of Thoughtlessness? An Analysis of CoT in Planning"

50 / 102 papers shown
LLMs Still Can't Plan; Can LRMs? A Preliminary Evaluation of OpenAI's o1
  on PlanBench
LLMs Still Can't Plan; Can LRMs? A Preliminary Evaluation of OpenAI's o1 on PlanBench
Kaya Stechly
Kaya Stechly
Subbarao Kambhampati
LLMAGLRMELM
405
89
0
20 Sep 2024
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoningInternational Conference on Learning Representations (ICLR), 2024
Zayne Sprague
Fangcong Yin
Juan Diego Rodriguez
Dongwei Jiang
Manya Wadhwa
Prasann Singhal
Xinyu Zhao
Xi Ye
Kyle Mahowald
Greg Durrett
ReLMLRM
637
232
0
18 Sep 2024
EVINCE: Optimizing Multi-LLM Dialogues Using Conditional Statistics and Information Theory
EVINCE: Optimizing Multi-LLM Dialogues Using Conditional Statistics and Information Theory
Edward Y. Chang
AAML
136
0
0
26 Aug 2024
Algorithmic Language Models with Neurally Compiled Libraries
Algorithmic Language Models with Neurally Compiled Libraries
Lucas Saldyt
Subbarao Kambhampati
LRM
323
0
0
06 Jul 2024
Deciphering the Factors Influencing the Efficacy of Chain-of-Thought:
  Probability, Memorization, and Noisy Reasoning
Deciphering the Factors Influencing the Efficacy of Chain-of-Thought: Probability, Memorization, and Noisy Reasoning
Akshara Prabhakar
Thomas Griffiths
R. Thomas McCoy
LRM
265
30
0
01 Jul 2024
Cognitive Map for Language Models: Optimal Planning via Verbally
  Representing the World Model
Cognitive Map for Language Models: Optimal Planning via Verbally Representing the World Model
Doyoung Kim
Jongwon Lee
Jinho Park
Minjoon Seo
LM&Ro
317
1
0
21 Jun 2024
Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning
Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning
Chaojie Wang
Yanchen Deng
Zhiyi Lyu
Liang Zeng
Jujie He
Shuicheng Yan
Bo An
LRMReLM
341
94
0
20 Jun 2024
Exploring and Benchmarking the Planning Capabilities of Large Language
  Models
Exploring and Benchmarking the Planning Capabilities of Large Language Models
Bernd Bohnet
Azade Nova
Aaron T Parisi
Kevin Swersky
Katayoon Goshvadi
Hanjun Dai
Dale Schuurmans
Noah Fiedel
Hanie Sedghi
187
17
0
18 Jun 2024
Robust Planning with LLM-Modulo Framework: Case Study in Travel Planning
Robust Planning with LLM-Modulo Framework: Case Study in Travel Planning
Atharva Gundawar
Mudit Verma
L. Guan
Kaya Stechly
Siddhant Bhambri
Subbarao Kambhampati
182
35
0
31 May 2024
SELF-[IN]CORRECT: LLMs Struggle with Refining Self-Generated Responses
SELF-[IN]CORRECT: LLMs Struggle with Refining Self-Generated ResponsesAAAI Conference on Artificial Intelligence (AAAI), 2024
Dongwei Jiang
Jingyu Zhang
Orion Weller
Nathaniel Weir
Benjamin Van Durme
Daniel Khashabi
225
12
0
04 Apr 2024
Multi-Conditional Ranking with Large Language Models
Multi-Conditional Ranking with Large Language Models
Pouya Pezeshkpour
Estevam R. Hruschka
LRM
179
1
0
30 Mar 2024
ChainLM: Empowering Large Language Models with Improved Chain-of-Thought
  Prompting
ChainLM: Empowering Large Language Models with Improved Chain-of-Thought Prompting
Xiaoxue Cheng
Junyi Li
Wayne Xin Zhao
Ji-Rong Wen
LRMAI4CEReLM
163
19
0
21 Mar 2024
Benchmarking GPT-4 on Algorithmic Problems: A Systematic Evaluation of
  Prompting Strategies
Benchmarking GPT-4 on Algorithmic Problems: A Systematic Evaluation of Prompting Strategies
Flavio Petruzzellis
Alberto Testolin
A. Sperduti
ELM
313
15
0
27 Feb 2024
How Interpretable are Reasoning Explanations from Prompting Large
  Language Models?
How Interpretable are Reasoning Explanations from Prompting Large Language Models?
Yeo Wei Jie
Frank Xing
Rick Mong
Xiaoshi Zhong
ReLMLRM
328
37
0
19 Feb 2024
On the Self-Verification Limitations of Large Language Models on
  Reasoning and Planning Tasks
On the Self-Verification Limitations of Large Language Models on Reasoning and Planning Tasks
Kaya Stechly
Kaya Stechly
Subbarao Kambhampati
ReLMLRM
178
98
0
12 Feb 2024
Efficient Tool Use with Chain-of-Abstraction Reasoning
Efficient Tool Use with Chain-of-Abstraction Reasoning
Silin Gao
Jane Dwivedi-Yu
Ping Yu
X. Tan
Ramakanth Pasunuru
O. Yu. Golovneva
Koustuv Sinha
Asli Celikyilmaz
Antoine Bosselut
Tianlu Wang
LRM
356
35
0
30 Jan 2024
Demystifying Chains, Trees, and Graphs of Thoughts
Demystifying Chains, Trees, and Graphs of ThoughtsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Maciej Besta
Florim Memedi
Zhenyu Zhang
Robert Gerstenberger
Guangyuan Piao
...
Aleš Kubíček
H. Niewiadomski
Aidan O'Mahony
Onur Mutlu
Torsten Hoefler
AI4CELRM
1.0K
52
0
25 Jan 2024
A Closer Look at the Self-Verification Abilities of Large Language
  Models in Logical Reasoning
A Closer Look at the Self-Verification Abilities of Large Language Models in Logical ReasoningNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Ruixin Hong
Hongming Zhang
Xinyu Pang
Dong Yu
Changshui Zhang
LRM
225
43
0
14 Nov 2023
KITAB: Evaluating LLMs on Constraint Satisfaction for Information
  Retrieval
KITAB: Evaluating LLMs on Constraint Satisfaction for Information RetrievalInternational Conference on Learning Representations (ICLR), 2023
Marah Abdin
Suriya Gunasekar
Varun Chandrasekaran
Jerry Li
Mert Yuksekgonul
Rahee Peshawaria
Ranjita Naik
Besmira Nushi
177
14
0
24 Oct 2023
Large Language Models Cannot Self-Correct Reasoning Yet
Large Language Models Cannot Self-Correct Reasoning YetInternational Conference on Learning Representations (ICLR), 2023
Jie Huang
Xinyun Chen
Swaroop Mishra
Huaixiu Steven Zheng
Adams Wei Yu
Xinying Song
Denny Zhou
ReLMLRM
519
696
0
03 Oct 2023
Invalid Logic, Equivalent Gains: The Bizarreness of Reasoning in
  Language Model Prompting
Invalid Logic, Equivalent Gains: The Bizarreness of Reasoning in Language Model Prompting
Rylan Schaeffer
Kateryna Pistunova
Samarth Khanna
Sarthak Consul
Oluwasanmi Koyejo
ReLMLRM
125
13
0
20 Jul 2023
Boosting Language Models Reasoning with Chain-of-Knowledge Prompting
Boosting Language Models Reasoning with Chain-of-Knowledge PromptingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Jiadong Wang
Qiushi Sun
Xiang Li
Ming Gao
ReLMLRM
311
105
0
10 Jun 2023
Deductive Verification of Chain-of-Thought Reasoning
Deductive Verification of Chain-of-Thought ReasoningNeural Information Processing Systems (NeurIPS), 2023
Z. Ling
Yunhao Fang
Xuanlin Li
Zhiao Huang
Mingu Lee
Roland Memisevic
Hao Su
ReLMLRM
496
194
0
06 Jun 2023
Faith and Fate: Limits of Transformers on Compositionality
Faith and Fate: Limits of Transformers on CompositionalityNeural Information Processing Systems (NeurIPS), 2023
Nouha Dziri
Ximing Lu
Melanie Sclar
Xiang Lorraine Li
Liwei Jian
...
Sean Welleck
Xiang Ren
Allyson Ettinger
Zaïd Harchaoui
Yejin Choi
ReLMLRM
519
497
0
29 May 2023
On the Planning Abilities of Large Language Models : A Critical
  Investigation
On the Planning Abilities of Large Language Models : A Critical InvestigationNeural Information Processing Systems (NeurIPS), 2023
Kaya Stechly
Matthew Marquez
S. Sreedharan
Subbarao Kambhampati
LLMAGLRM
270
340
0
25 May 2023
Towards Revealing the Mystery behind Chain of Thought: A Theoretical
  Perspective
Towards Revealing the Mystery behind Chain of Thought: A Theoretical PerspectiveNeural Information Processing Systems (NeurIPS), 2023
Guhao Feng
Bohang Zhang
Yuntian Gu
Haotian Ye
Di He
Liwei Wang
LRM
649
354
0
24 May 2023
Improving Factuality and Reasoning in Language Models through Multiagent
  Debate
Improving Factuality and Reasoning in Language Models through Multiagent DebateInternational Conference on Machine Learning (ICML), 2023
Yilun Du
Shuang Li
Antonio Torralba
J. Tenenbaum
Igor Mordatch
LLMAGLRM
351
1,182
0
23 May 2023
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive
  Critiquing
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive CritiquingInternational Conference on Learning Representations (ICLR), 2023
Zhibin Gou
Zhihong Shao
Yeyun Gong
Yelong Shen
Yujiu Yang
Nan Duan
Weizhu Chen
KELMLRM
392
584
0
19 May 2023
Language Models Don't Always Say What They Think: Unfaithful
  Explanations in Chain-of-Thought Prompting
Language Models Don't Always Say What They Think: Unfaithful Explanations in Chain-of-Thought PromptingNeural Information Processing Systems (NeurIPS), 2023
Miles Turpin
Julian Michael
Ethan Perez
Sam Bowman
ReLMLRM
533
725
0
07 May 2023
GPT-4 Technical Report
GPT-4 Technical Report
OpenAI OpenAI
OpenAI Josh Achiam
Steven Adler
Sandhini Agarwal
Lama Ahmad
...
Shengjia Zhao
Tianhao Zheng
Juntang Zhuang
William Zhuk
Barret Zoph
LLMAGMLLM
4.6K
20,902
0
15 Mar 2023
Faithful Chain-of-Thought Reasoning
Faithful Chain-of-Thought ReasoningInternational Joint Conference on Natural Language Processing (IJCNLP), 2023
Qing Lyu
Shreya Havaldar
Adam Stein
Li Zhang
D. Rao
Eric Wong
Marianna Apidianaki
Chris Callison-Burch
ReLMLRM
456
317
0
31 Jan 2023
Reasoning with Language Model Prompting: A Survey
Reasoning with Language Model Prompting: A SurveyAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Shuofei Qiao
Yixin Ou
Ningyu Zhang
Xiang Chen
Yunzhi Yao
Shumin Deng
Chuanqi Tan
Fei Huang
Huajun Chen
ReLMELMLRM
707
395
0
19 Dec 2022
Teaching Algorithmic Reasoning via In-context Learning
Teaching Algorithmic Reasoning via In-context Learning
Hattie Zhou
Azade Nova
Hugo Larochelle
Rameswar Panda
Behnam Neyshabur
Hanie Sedghi
LRMReLM
254
130
0
15 Nov 2022
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve ThemAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Mirac Suzgun
Nathan Scales
Nathanael Scharli
Sebastian Gehrmann
Yi Tay
...
Aakanksha Chowdhery
Quoc V. Le
Ed H. Chi
Denny Zhou
Jason W. Wei
ALMELMLRMReLM
518
1,558
0
17 Oct 2022
Automatic Chain of Thought Prompting in Large Language Models
Automatic Chain of Thought Prompting in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2022
Zhuosheng Zhang
Aston Zhang
Mu Li
Alexander J. Smola
ReLMLRM
496
852
0
07 Oct 2022
Language Models are Multilingual Chain-of-Thought Reasoners
Language Models are Multilingual Chain-of-Thought ReasonersInternational Conference on Learning Representations (ICLR), 2022
Freda Shi
Mirac Suzgun
Markus Freitag
Xuezhi Wang
Suraj Srivats
...
Yi Tay
Sebastian Ruder
Denny Zhou
Dipanjan Das
Jason W. Wei
ReLMLRM
587
492
0
06 Oct 2022
ReAct: Synergizing Reasoning and Acting in Language Models
ReAct: Synergizing Reasoning and Acting in Language ModelsInternational Conference on Learning Representations (ICLR), 2022
Shunyu Yao
Jeffrey Zhao
Dian Yu
Nan Du
Izhak Shafran
Karthik Narasimhan
Yuan Cao
LLMAGReLMLRM
2.5K
5,256
0
06 Oct 2022
Language Models Are Greedy Reasoners: A Systematic Formal Analysis of
  Chain-of-Thought
Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-ThoughtInternational Conference on Learning Representations (ICLR), 2022
Abulhair Saparov
He He
ELMLRMReLM
850
422
0
03 Oct 2022
Faithful Reasoning Using Large Language Models
Faithful Reasoning Using Large Language Models
Antonia Creswell
Murray Shanahan
ReLMLRM
193
139
0
30 Aug 2022
Limitations of Language Models in Arithmetic and Symbolic Induction
Limitations of Language Models in Arithmetic and Symbolic InductionAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Jingu Qian
Hong Wang
Zekun Li
Shiyang Li
Xifeng Yan
ReLMLRM
318
85
0
09 Aug 2022
Exploring Length Generalization in Large Language Models
Exploring Length Generalization in Large Language ModelsNeural Information Processing Systems (NeurIPS), 2022
Cem Anil
Yuhuai Wu
Anders Andreassen
Aitor Lewkowycz
Vedant Misra
V. Ramasesh
Ambrose Slone
Guy Gur-Ari
Ethan Dyer
Behnam Neyshabur
ReLMLRM
348
211
0
11 Jul 2022
PlanBench: An Extensible Benchmark for Evaluating Large Language Models
  on Planning and Reasoning about Change
PlanBench: An Extensible Benchmark for Evaluating Large Language Models on Planning and Reasoning about ChangeNeural Information Processing Systems (NeurIPS), 2022
Kaya Stechly
Matthew Marquez
Alberto Olmo
S. Sreedharan
Subbarao Kambhampati
ReLMLRM
343
329
0
21 Jun 2022
Beyond the Imitation Game: Quantifying and extrapolating the
  capabilities of language models
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
Aarohi Srivastava
Abhinav Rastogi
Abhishek Rao
Abu Awal Md Shoeb
Abubakar Abid
...
Zhuoye Zhao
Zijian Wang
Zijie J. Wang
Zirui Wang
Ziyi Wu
ELM
697
2,150
0
09 Jun 2022
Large Language Models are Zero-Shot Reasoners
Large Language Models are Zero-Shot ReasonersNeural Information Processing Systems (NeurIPS), 2022
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLMLRM
1.4K
6,087
0
24 May 2022
Least-to-Most Prompting Enables Complex Reasoning in Large Language
  Models
Least-to-Most Prompting Enables Complex Reasoning in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2022
Denny Zhou
Nathanael Scharli
Le Hou
Jason W. Wei
Nathan Scales
...
Dale Schuurmans
Claire Cui
Olivier Bousquet
Quoc Le
Ed H. Chi
RALMLRMAI4CE
659
1,483
0
21 May 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Self-Consistency Improves Chain of Thought Reasoning in Language ModelsInternational Conference on Learning Representations (ICLR), 2022
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLMBDLLRMAI4CE
2.7K
5,537
0
21 Mar 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language ModelsNeural Information Processing Systems (NeurIPS), 2022
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&RoLRMAI4CEReLM
2.3K
14,449
0
28 Jan 2022
Show Your Work: Scratchpads for Intermediate Computation with Language
  Models
Show Your Work: Scratchpads for Intermediate Computation with Language Models
Maxwell Nye
Anders Andreassen
Guy Gur-Ari
Henryk Michalewski
Jacob Austin
...
Aitor Lewkowycz
Maarten Bosma
D. Luan
Charles Sutton
Augustus Odena
ReLMLRM
544
920
0
30 Nov 2021
Training Verifiers to Solve Math Word Problems
Training Verifiers to Solve Math Word Problems
K. Cobbe
V. Kosaraju
Mohammad Bavarian
Mark Chen
Heewoo Jun
...
Jerry Tworek
Jacob Hilton
Reiichiro Nakano
Christopher Hesse
John Schulman
ReLMOffRLLRM
1.1K
6,810
0
27 Oct 2021
A Diverse Corpus for Evaluating and Developing English Math Word Problem
  Solvers
A Diverse Corpus for Evaluating and Developing English Math Word Problem SolversAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Shen-Yun Miao
Chao-Chun Liang
Keh-Yih Su
275
418
0
30 Jun 2021
Previous
123
Next