ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.18127
  4. Cited By
Ask more, know better: Reinforce-Learned Prompt Questions for Decision
  Making with Large Language Models

Ask more, know better: Reinforce-Learned Prompt Questions for Decision Making with Large Language Models

27 October 2023
Xue Yan
Yan Song
Xinyu Cui
Filippos Christianos
Haifeng Zhang
D. Mguni
Jun Wang
    LRM
ArXivPDFHTML

Papers citing "Ask more, know better: Reinforce-Learned Prompt Questions for Decision Making with Large Language Models"

9 / 9 papers shown
Title
DSADF: Thinking Fast and Slow for Decision Making
DSADF: Thinking Fast and Slow for Decision Making
Alex Zhihao Dou
Dongfei Cui
Jun Yan
W. Wang
Benteng Chen
Haoming Wang
Zeke Xie
Shufei Zhang
OffRL
17
0
0
13 May 2025
ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning
Ziyu Wan
Yunxiang Li
Y. Song
Hanjing Wang
Linyi Yang
Mark W. Schmidt
J. Wang
Weinan Zhang
Shuyue Hu
Ying Wen
LLMAG
KELM
LRM
AI4CE
84
6
0
12 Mar 2025
Reinforcement Learning for Aligning Large Language Models Agents with
  Interactive Environments: Quantifying and Mitigating Prompt Overfitting
Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting
Mohamed Salim Aissi
Clément Romac
Thomas Carta
Sylvain Lamprier
Pierre-Yves Oudeyer
Olivier Sigaud
Laure Soulier
Nicolas Thome
19
2
0
25 Oct 2024
Efficient Reinforcement Learning with Large Language Model Priors
Efficient Reinforcement Learning with Large Language Model Priors
Xue Yan
Yan Song
Xidong Feng
Mengyue Yang
Haifeng Zhang
Haitham Bou Ammar
Jun Wang
OffRL
31
3
0
10 Oct 2024
Generative Agents: Interactive Simulacra of Human Behavior
Generative Agents: Interactive Simulacra of Human Behavior
J. Park
Joseph C. O'Brien
Carrie J. Cai
Meredith Ringel Morris
Percy Liang
Michael S. Bernstein
LM&Ro
AI4CE
215
1,727
0
07 Apr 2023
ReAct: Synergizing Reasoning and Acting in Language Models
ReAct: Synergizing Reasoning and Acting in Language Models
Shunyu Yao
Jeffrey Zhao
Dian Yu
Nan Du
Izhak Shafran
Karthik Narasimhan
Yuan Cao
LLMAG
ReLM
LRM
233
2,470
0
06 Oct 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLM
BDL
LRM
AI4CE
297
3,217
0
21 Mar 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
315
8,402
0
28 Jan 2022
Hierarchical Reinforcement Learning By Discovering Intrinsic Options
Hierarchical Reinforcement Learning By Discovering Intrinsic Options
Jesse Zhang
Haonan Yu
W. Xu
BDL
120
82
0
16 Jan 2021
1