ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.04091
  4. Cited By
Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning
  by Large Language Models

Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models

6 May 2023
Lei Wang
Wanyu Xu
Yihuai Lan
Zhiqiang Hu
Yunshi Lan
Roy Ka-Wei Lee
Ee-Peng Lim
    ReLM
    LRM
ArXivPDFHTML

Papers citing "Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models"

50 / 215 papers shown
Title
Software Performance Engineering for Foundation Model-Powered Software
  (FMware)
Software Performance Engineering for Foundation Model-Powered Software (FMware)
Haoxiang Zhang
Shi Chang
Arthur Leung
Kishanthan Thangarajah
Boyuan Chen
Hanan Lutfiyya
Ahmed E. Hassan
54
0
0
14 Nov 2024
Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Fangyu Lei
Jixuan Chen
Yuxiao Ye
Ruisheng Cao
Dongchan Shin
...
Caiming Xiong
Ruoxi Sun
Qian Liu
Sida I. Wang
Tao Yu
LMTD
77
21
0
12 Nov 2024
Enhancing Security Control Production With Generative AI
Enhancing Security Control Production With Generative AI
Chen Ling
Mina Ghashami
Vianne Gao
Ali Torkamani
Ruslan Vaulin
...
Farhan Diwan
Malini SS
Mingrui Cheng
Shreya Tarur Kumar
Felix Candelario
23
0
0
06 Nov 2024
EXPLORA: Efficient Exemplar Subset Selection for Complex Reasoning
EXPLORA: Efficient Exemplar Subset Selection for Complex Reasoning
Kiran Purohit
Venktesh V
Raghuram Devalla
Krishna Mohan Yerragorla
Sourangshu Bhattacharya
Avishek Anand
LRM
27
1
0
06 Nov 2024
RL-STaR: Theoretical Analysis of Reinforcement Learning Frameworks for Self-Taught Reasoner
RL-STaR: Theoretical Analysis of Reinforcement Learning Frameworks for Self-Taught Reasoner
Fu-Chieh Chang
Yu-Ting Lee
Hui-Ying Shih
Pei-Yuan Wu
Pei-Yuan Wu
OffRL
LRM
76
0
0
31 Oct 2024
OSCAR: Operating System Control via State-Aware Reasoning and
  Re-Planning
OSCAR: Operating System Control via State-Aware Reasoning and Re-Planning
Xiaoqiang Wang
Bang Liu
LLMAG
LM&Ro
LRM
31
6
0
24 Oct 2024
Learning Mathematical Rules with Large Language Models
Learning Mathematical Rules with Large Language Models
Antoine Gorceix
Bastien Le Chenadec
Ahmad Rammal
N. Vadori
Manuela Veloso
18
1
0
22 Oct 2024
Towards Safer Heuristics With XPlain
Towards Safer Heuristics With XPlain
Pantea Karimi
Solal Pirelli
Siva Kesava Reddy Kakarla
Ryan Beckett
Santiago Segarra
Beibin Li
Pooria Namyar
Behnaz Arzani
34
0
0
19 Oct 2024
LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs
LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs
Yujun Zhou
Jingdong Yang
Kehan Guo
Pin-Yu Chen
Tian Gao
...
Tian Gao
Werner Geyer
Nuno Moniz
Nitesh V Chawla
Xiangliang Zhang
33
4
0
18 Oct 2024
Enhancing Mathematical Reasoning in LLMs by Stepwise Correction
Enhancing Mathematical Reasoning in LLMs by Stepwise Correction
Zhenyu Wu
Qingkai Zeng
Z. Zhang
Zhaoxuan Tan
Chao Shen
Meng-Long Jiang
KELM
LRM
34
4
0
16 Oct 2024
Not All Votes Count! Programs as Verifiers Improve Self-Consistency of
  Language Models for Math Reasoning
Not All Votes Count! Programs as Verifiers Improve Self-Consistency of Language Models for Math Reasoning
Vernon Y.H. Toh
Deepanway Ghosal
Soujanya Poria
LRM
41
2
0
16 Oct 2024
FLARE: Faithful Logic-Aided Reasoning and Exploration
FLARE: Faithful Logic-Aided Reasoning and Exploration
Erik Arakelyan
Pasquale Minervini
Pat Verga
Patrick Lewis
Isabelle Augenstein
ReLM
LRM
59
2
0
14 Oct 2024
From Sparse Dependence to Sparse Attention: Unveiling How Chain-of-Thought Enhances Transformer Sample Efficiency
From Sparse Dependence to Sparse Attention: Unveiling How Chain-of-Thought Enhances Transformer Sample Efficiency
Kaiyue Wen
Huaqing Zhang
Hongzhou Lin
Jingzhao Zhang
MoE
LRM
58
2
0
07 Oct 2024
Training Nonlinear Transformers for Chain-of-Thought Inference: A
  Theoretical Generalization Analysis
Training Nonlinear Transformers for Chain-of-Thought Inference: A Theoretical Generalization Analysis
Hongkang Li
Meng Wang
Songtao Lu
Xiaodong Cui
Pin-Yu Chen
LRM
19
5
0
03 Oct 2024
Evaluating Robustness of Reward Models for Mathematical Reasoning
Evaluating Robustness of Reward Models for Mathematical Reasoning
Sunghwan Kim
Dongjin Kang
Taeyoon Kwon
Hyungjoo Chae
Jungsoo Won
Dongha Lee
Jinyoung Yeo
23
4
0
02 Oct 2024
Instance-adaptive Zero-shot Chain-of-Thought Prompting
Instance-adaptive Zero-shot Chain-of-Thought Prompting
Xiaosong Yuan
Chen Shen
Shaotian Yan
Xiaofeng Zhang
Liang Xie
Wenxiao Wang
Renchu Guan
Ying Wang
Jieping Ye
ReLM
LRM
44
4
0
30 Sep 2024
System-Level Defense against Indirect Prompt Injection Attacks: An
  Information Flow Control Perspective
System-Level Defense against Indirect Prompt Injection Attacks: An Information Flow Control Perspective
Fangzhou Wu
Ethan Cecchetti
Chaowei Xiao
29
12
0
27 Sep 2024
A Survey on the Honesty of Large Language Models
A Survey on the Honesty of Large Language Models
Siheng Li
Cheng Yang
Taiqiang Wu
Chufan Shi
Yuji Zhang
...
Jie Zhou
Yujiu Yang
Ngai Wong
Xixin Wu
Wai Lam
HILM
27
4
0
27 Sep 2024
A Survey on Complex Tasks for Goal-Directed Interactive Agents
A Survey on Complex Tasks for Goal-Directed Interactive Agents
Mareike Hartmann
Alexander Koller
LM&Ro
LLMAG
32
0
0
27 Sep 2024
BEATS: Optimizing LLM Mathematical Capabilities with BackVerify and
  Adaptive Disambiguate based Efficient Tree Search
BEATS: Optimizing LLM Mathematical Capabilities with BackVerify and Adaptive Disambiguate based Efficient Tree Search
Linzhuang Sun
Hao Liang
Jingxuan Wei
Bihui Yu
Conghui He
Zenan Zhou
Wentao Zhang
16
4
0
26 Sep 2024
InterMind: A Doctor-Patient-Family Interactive Depression Assessment
  System Empowered by Large Language Models
InterMind: A Doctor-Patient-Family Interactive Depression Assessment System Empowered by Large Language Models
Zhiyuan Zhou
Jilong Liu
Sanwang Wang
Shijie Hao
Yanrong Guo
Richang Hong
AI4MH
34
0
0
23 Sep 2024
ChainBuddy: An AI Agent System for Generating LLM Pipelines
ChainBuddy: An AI Agent System for Generating LLM Pipelines
Jingyue Zhang
Ian Arawjo
LLMAG
22
3
0
20 Sep 2024
Unlocking Reasoning Potential in Large Langauge Models by Scaling
  Code-form Planning
Unlocking Reasoning Potential in Large Langauge Models by Scaling Code-form Planning
Jiaxin Wen
Jian Guan
Hongning Wang
Wei Wu
Minlie Huang
ReLM
OffRL
LRM
26
7
0
19 Sep 2024
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
Zayne Sprague
Fangcong Yin
Juan Diego Rodriguez
Dongwei Jiang
Manya Wadhwa
Prasann Singhal
Xinyu Zhao
Xi Ye
Kyle Mahowald
Greg Durrett
ReLM
LRM
111
79
0
18 Sep 2024
LLM-as-BT-Planner: Leveraging LLMs for Behavior Tree Generation in Robot Task Planning
LLM-as-BT-Planner: Leveraging LLMs for Behavior Tree Generation in Robot Task Planning
Jicong Ao
Fan Wu
Yansong Wu
Abdalla Swikir
Sami Haddadin
26
5
0
16 Sep 2024
Confidence Estimation for LLM-Based Dialogue State Tracking
Confidence Estimation for LLM-Based Dialogue State Tracking
Yi-Jyun Sun
Suvodip Dey
Dilek Z. Hakkani-Tür
Gökhan Tür
30
1
0
15 Sep 2024
Behavior Tree Generation using Large Language Models for Sequential
  Manipulation Planning with Human Instructions and Feedback
Behavior Tree Generation using Large Language Models for Sequential Manipulation Planning with Human Instructions and Feedback
Jicong Ao
Yansong Wu
Fan Wu
Sami Haddadin
LM&Ro
13
1
0
14 Sep 2024
DiPT: Enhancing LLM reasoning through diversified perspective-taking
DiPT: Enhancing LLM reasoning through diversified perspective-taking
H. Just
Mahavir Dabas
Lifu Huang
Ming Jin
Ruoxi Jia
LRM
32
1
0
10 Sep 2024
MetaBGM: Dynamic Soundtrack Transformation For Continuous Multi-Scene
  Experiences With Ambient Awareness And Personalization
MetaBGM: Dynamic Soundtrack Transformation For Continuous Multi-Scene Experiences With Ambient Awareness And Personalization
Haoxuan Liu
Zihao Wang
HaoRong Hong
Youwei Feng
Jiaxin Yu
Han Diao
Yunfei Xu
K. Zhang
24
0
0
05 Sep 2024
Agentic Society: Merging skeleton from real world and texture from Large
  Language Model
Agentic Society: Merging skeleton from real world and texture from Large Language Model
Yuqi Bai
Kun Sun
Huishi Yin
26
1
0
02 Sep 2024
The Death of Schema Linking? Text-to-SQL in the Age of Well-Reasoned
  Language Models
The Death of Schema Linking? Text-to-SQL in the Age of Well-Reasoned Language Models
Karime Maamari
Fadhil Abubaker
Daniel Jaroslawicz
Amine Mhedhbi
LRM
52
25
0
14 Aug 2024
Document-Level Event Extraction with Definition-Driven ICL
Document-Level Event Extraction with Definition-Driven ICL
Zhuoyuan Liu
Yilin Luo
74
1
0
10 Aug 2024
A Jailbroken GenAI Model Can Cause Substantial Harm: GenAI-powered
  Applications are Vulnerable to PromptWares
A Jailbroken GenAI Model Can Cause Substantial Harm: GenAI-powered Applications are Vulnerable to PromptWares
Stav Cohen
Ron Bitton
Ben Nassi
SILM
33
5
0
09 Aug 2024
Prompt Recursive Search: A Living Framework with Adaptive Growth in LLM
  Auto-Prompting
Prompt Recursive Search: A Living Framework with Adaptive Growth in LLM Auto-Prompting
Xiangyu Zhao
Chengqian Ma
22
2
0
02 Aug 2024
Coalitions of Large Language Models Increase the Robustness of AI Agents
Coalitions of Large Language Models Increase the Robustness of AI Agents
Prattyush Mangal
Carol Mak
Theo Kanakis
Timothy Donovan
Dave Braines
Edward Pyzer-Knapp
28
1
0
02 Aug 2024
AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation
AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation
Mengkang Hu
Yixiao Wang
Can Xu
Lingfeng Sun
Chensheng Peng
T. Hannagan
Nicola Poerio
Saravan Rajmohan
LM&Ro
LLMAG
60
15
0
01 Aug 2024
Can LLMs "Reason" in Music? An Evaluation of LLMs' Capability of Music
  Understanding and Generation
Can LLMs "Reason" in Music? An Evaluation of LLMs' Capability of Music Understanding and Generation
Ziya Zhou
Yuhang Wu
Zhiyue Wu
Xinyue Zhang
Ruibin Yuan
Yi Ma
Lu Wang
Emmanouil Benetos
Wei Xue
Yi-Ting Guo
LRM
35
2
0
31 Jul 2024
InstructAV: Instruction Fine-tuning Large Language Models for Authorship
  Verification
InstructAV: Instruction Fine-tuning Large Language Models for Authorship Verification
Yujia Hu
Zhiqiang Hu
C. Seah
Roy Ka-wei Lee
19
0
0
16 Jul 2024
Is Your Model Really A Good Math Reasoner? Evaluating Mathematical
  Reasoning with Checklist
Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist
Zihao Zhou
Shudong Liu
Maizhen Ning
Wei Liu
Jindong Wang
Derek F. Wong
Xiaowei Huang
Qiufeng Wang
Kaizhu Huang
ELM
LRM
61
23
0
11 Jul 2024
PAS: Data-Efficient Plug-and-Play Prompt Augmentation System
PAS: Data-Efficient Plug-and-Play Prompt Augmentation System
Miao Zheng
H. Liang
Fan Yang
Haoze Sun
Tianpeng Li
...
Kun Fang
Weipeng Chen
Bin Cui
Wentao Zhang
Zenan Zhou
RALM
37
3
0
08 Jul 2024
InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation
InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation
Gaurav Sahu
Abhay Puri
Juan A. Rodriguez
Alexandre Drouin
Perouz Taslakian
...
Christopher Pal
Nicolas Chapados
I. Laradji
Sai Rajeswar Mudumba
Issam Hadj Laradji
ELM
37
4
0
08 Jul 2024
Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning
Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning
Brandon Huang
Chancharik Mitra
Assaf Arbelle
Leonid Karlinsky
Trevor Darrell
Roei Herzig
37
12
0
21 Jun 2024
End-to-end Text-to-SQL Generation within an Analytics Insight Engine
End-to-end Text-to-SQL Generation within an Analytics Insight Engine
Karime Maamari
Amine Mhedhbi
SyDa
43
4
0
17 Jun 2024
CroPrompt: Cross-task Interactive Prompting for Zero-shot Spoken
  Language Understanding
CroPrompt: Cross-task Interactive Prompting for Zero-shot Spoken Language Understanding
Libo Qin
Fuxuan Wei
Qiguang Chen
Jingxuan Zhou
Shijue Huang
Jiasheng Si
Wenpeng Lu
Wanxiang Che
LRM
VLM
40
0
0
15 Jun 2024
Scaling Large Language Model-based Multi-Agent Collaboration
Scaling Large Language Model-based Multi-Agent Collaboration
Chen Qian
Zihao Xie
YiFei Wang
Wei Liu
Yufan Dang
...
Zhuoyun Du
Weize Chen
Cheng Yang
Zhiyuan Liu
Maosong Sun
AI4CE
LLMAG
LM&Ro
54
44
0
11 Jun 2024
DISCOVERYWORLD: A Virtual Environment for Developing and Evaluating
  Automated Scientific Discovery Agents
DISCOVERYWORLD: A Virtual Environment for Developing and Evaluating Automated Scientific Discovery Agents
Peter Alexander Jansen
Marc-Alexandre Côté
Tushar Khot
Erin Bransom
Bhavana Dalvi Mishra
Bodhisattwa Prasad Majumder
Oyvind Tafjord
Peter Clark
LLMAG
30
21
0
10 Jun 2024
Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning
  Strategies
Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies
Junlin Wang
Siddhartha Jain
Dejiao Zhang
Baishakhi Ray
Varun Kumar
Ben Athiwaratkun
30
19
0
10 Jun 2024
SelfGoal: Your Language Agents Already Know How to Achieve High-level
  Goals
SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals
Ruihan Yang
Jiangjie Chen
Yikai Zhang
Siyu Yuan
Aili Chen
Kyle Richardson
Yanghua Xiao
Deqing Yang
AI4CE
LM&Ro
41
8
0
07 Jun 2024
Mixture-of-Agents Enhances Large Language Model Capabilities
Mixture-of-Agents Enhances Large Language Model Capabilities
Junlin Wang
Jue Wang
Ben Athiwaratkun
Ce Zhang
James Zou
LLMAG
AIFin
36
94
0
07 Jun 2024
Towards Learning Foundation Models for Heuristic Functions to Solve
  Pathfinding Problems
Towards Learning Foundation Models for Heuristic Functions to Solve Pathfinding Problems
Vedant Khandelwal
Amit Sheth
Forest Agostinelli
26
2
0
01 Jun 2024
Previous
12345
Next