v1v2v3v4 (latest)

Reflexion: Language Agents with Verbal Reinforcement Learning

Neural Information Processing Systems (NeurIPS), 2023

20 March 2023

ArXiv (abs)PDF HTML HuggingFace (5 upvotes)

Papers citing "Reflexion: Language Agents with Verbal Reinforcement Learning"

50 / 1,270 papers shown

LLM Agents can Autonomously Hack Websites

282

06 Feb 2024

Learning to Generate Explainable Stock Predictions using Self-Reflective Large Language ModelsThe Web Conference (WWW), 2024

351

06 Feb 2024

Professional Agents -- Evolving Large Language Models into Autonomous Experts with Human-Level Competencies

234

06 Feb 2024

RAP: Retrieval-Augmented Planning with Contextual Memory for Multimodal LLM Agents

Yang You

212

06 Feb 2024

Toward Human-AI Alignment in Large-Scale Multi-Player Games

186

05 Feb 2024

Unified Hallucination Detection for Multimodal Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Ningyu Zhang

Lei Liang

Huajun Chen

450

05 Feb 2024

Graph-enhanced Large Language Models in Asynchronous Plan Reasoning

377

05 Feb 2024

Understanding the planning of LLM agents: A survey

Xu Huang

Defu Lian

Ruiming Tang

Enhong Chen

LLMAG LM&Ro

314

353

05 Feb 2024

Enhance Reasoning for Large Language Models in the Game Werewolf

307

04 Feb 2024

More Agents Is All You Need

371

117

03 Feb 2024

Calibration and Correctness of Language Models for Code

Md Rafiqul Islam Rabin

339

03 Feb 2024

TravelPlanner: A Benchmark for Real-World Planning with Language Agents

Yanghua Xiao

326

300

02 Feb 2024

Foundation Model Sherpas: Guiding Foundation Models through Knowledge and Reasoning

215

02 Feb 2024

AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback

200

02 Feb 2024

LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks

575

173

02 Feb 2024

Reasoning Capacity in Multi-Agent Systems: Limitations, Challenges and Human-Centered Solutions

195

02 Feb 2024

Learning Planning-based Reasoning by Trajectories Collection and Process Reward Synthesizing

Nancy F. Chen

254

01 Feb 2024

WSC+: Enhancing The Winograd Schema Challenge Using Tree-of-Experts

Pardis Sadat Zahraei

Ali Emami

164

31 Jan 2024

Investigate-Consolidate-Exploit: A General Strategy for Inter-Task Agent Self-Evolution

Yankai Lin

Zhiyuan Liu

Maosong Sun

LLMAG

221

25 Jan 2024

Demystifying Chains, Trees, and Graphs of ThoughtsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

...

1.0K

25 Jan 2024

Multi-granularity Knowledge Transfer for Continual Reinforcement LearningInternational Joint Conference on Artificial Intelligence (IJCAI), 2024

564

25 Jan 2024

TPD: Enhancing Student Language Model Reasoning via Principle Discovery and Guidance

Lingkai Kong

259

24 Jan 2024

AgentBoard: An Analytical Evaluation Board of Multi-turn LLM AgentsNeural Information Processing Systems (NeurIPS), 2024

Yujiu Yang

Lingpeng Kong

236

132

24 Jan 2024

AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents

Michael Ahn

Debidatta Dwibedi

Chelsea Finn

Montse Gonzalez Arenas

...

244

23 Jan 2024

Large Language Model based Multi-Agents: A Survey of Progress and ChallengesInternational Joint Conference on Artificial Intelligence (IJCAI), 2024

LLMAG LM&Ro AI4CE LRM

513

629

21 Jan 2024

R-Judge: Benchmarking Safety Risk Awareness for LLM Agents

...

Rui Wang

411

142

18 Jan 2024

A Study on Training and Developing Large Language Models for Behavior Tree Generation

259

16 Jan 2024

DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models (Exemplified as A Video Agent)International Conference on Machine Learning (ICML), 2024

Guikun Chen

513

16 Jan 2024

Small Language Model Can Self-correctAAAI Conference on Artificial Intelligence (AAAI), 2024

Yanghua Xiao

269

14 Jan 2024

EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health RecordsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Ran Xu

203

13 Jan 2024

Mutual Enhancement of Large Language and Reinforcement Learning Models through Bi-Directional Feedback Mechanisms: A Planning Case Study

Shangding Gu

LLMAG

357

12 Jan 2024

AutoAct: Automatic Agent Learning from Scratch for QA via Self-PlanningAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Ningyu Zhang

Huajun Chen

351

10 Jan 2024

Agent Alignment in Evolving Social Norms

Shimin Li

Tianxiang Sun

Qinyuan Cheng

Xipeng Qiu

LLMAG

301

09 Jan 2024

A Philosophical Introduction to Language Models -- Part I: Continuity With Classic Debates

Raphael Milliere

Cameron Buckner

LRM ELM

191

08 Jan 2024

CogGPT: Unleashing the Power of Cognitive Dynamics on Large Language Models

214

06 Jan 2024

AFSPP: Agent Framework for Shaping Preference and Personality with Large Language Models

Zihong He

Changwang Zhang

LLMAG

185

05 Jan 2024

Using LLM to select the right SQL Query from candidates

Zhenwen Li

Tao Xie

LLMAG

164

04 Jan 2024

Towards Truly Zero-shot Compositional Visual Reasoning with LLMs as Programmers

322

03 Jan 2024

State Machine of Thoughts: Leveraging Past Reasoning Trajectories for Enhancing Problem Solving

189

29 Dec 2023

Experiential Co-Learning of Software-Developing Agents

Wei Liu

...

Zhiyuan Liu

Maosong Sun

LLMAG

369

28 Dec 2023

AutoTask: Executing Arbitrary Voice Commands by Exploring and Learning from Mobile GUI

Chun Yu

166

26 Dec 2023

LARP: Language-Agent Role Play for Open-World Games

247

24 Dec 2023

NPHardEval: Dynamic Benchmark on Reasoning Ability of Large Language Models via Complexity Classes

Lizhou Fan

357

22 Dec 2023

Towards Message Brokers for Generative AI: Survey, Challenges, and Opportunities

340

22 Dec 2023

ASSISTGUI: Task-Oriented Desktop Graphical User Interface Automation

...

323

20 Dec 2023

AgentCoder: Multi-Agent-based Code Generation with Iterative Testing and Optimisation

Heming Cui

237

20 Dec 2023

Large Language Models Empowered Agent-based Modeling and Simulation: A Survey and Perspectives

Jingtao Ding

Yong Li

287

291

19 Dec 2023

Evaluating Language-Model Agents on Realistic Autonomous Tasks

...

323

18 Dec 2023

Evaluating and Enhancing Large Language Models for Conversational Reasoning on Knowledge Graphs

167

18 Dec 2023

CLOVA: A Closed-Loop Visual Assistant with Tool Usage and Update

Yuntao Du

Xiaojian Ma

395

18 Dec 2023