v1v2v3v4 (latest)

Reflexion: Language Agents with Verbal Reinforcement Learning

Neural Information Processing Systems (NeurIPS), 2023

20 March 2023

ArXiv (abs)PDF HTML HuggingFace (5 upvotes)

Papers citing "Reflexion: Language Agents with Verbal Reinforcement Learning"

50 / 1,271 papers shown

Empowering LLMs with Parameterized Skills for Adversarial Long-Horizon Planning

192

16 Sep 2025

EvoEmpirBench: Dynamic Spatial Reasoning with Agent-ExpVer

209

16 Sep 2025

^2

R: Hierarchical Hindsight Reflection for Multi-Task LLM Agents

132

16 Sep 2025

AI Agents with Human-Like Collaborative Tools: Adaptive Strategies for Enhanced Problem-Solving

16 Sep 2025

Enhancing Computational Cognitive Architectures with LLMs: A Case Study

Ron Sun

129

13 Sep 2025

ZapGPT: Free-form Language Prompting for Simulated Cellular Control

131

12 Sep 2025

SEDM: Scalable Self-Evolving Distributed Memory for Agents

212

11 Sep 2025

Visual Programmability: A Guide for Code-as-Thought in Chart Understanding

135

11 Sep 2025

Latency and Token-Aware Test-Time Compute

Jenny Y. Huang

Mehul Damani

Yousef El-Kurdi

Ramón Fernandez Astudillo

Wei Sun

100

11 Sep 2025

TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making

...

120

10 Sep 2025

Accelerating Reinforcement Learning Algorithms Convergence using Pre-trained Large Language Models as Tutors With Advice Reusing

Lukas Toral

Teddy Lazebnik

183

10 Sep 2025

Evaluating LLMs Without Oracle Feedback: Agentic Annotation Evaluation Through Unsupervised Consistency Signals

Cheng Chen

Haiyan Yin

Ivor Tsang

155

10 Sep 2025

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

...

155

10 Sep 2025

K2-Think: A Parameter-Efficient Reasoning System

...

307

09 Sep 2025

RAFFLES: Reasoning-based Attribution of Faults for LLM Systems

157

08 Sep 2025

Another Turn, Better Output? A Turn-Wise Analysis of Iterative LLM Prompting

Shashidhar Reddy Javaji

Bhavul Gauri

Zining Zhu

LRM

222

08 Sep 2025

MachineLearningLM: Scaling Many-shot In-context Learning via Continued Pretraining

455

08 Sep 2025

PaVeRL-SQL: Text-to-SQL via Partial-Match Rewards and Verbal Reinforcement Learning

Heng Hao

Wenjun Hu

Oxana Verkholyak

Davoud Ataee Tarzanagh

135

08 Sep 2025

Cross-Question Method Reuse in Large Language Models: From Word-Level Prediction to Rational Logical-Layer Reasoning

Hong Su

LRM

149

06 Sep 2025

Orchestrator: Active Inference for Multi-Agent Systems in Long-Horizon Tasks

140

06 Sep 2025

DRF: LLM-AGENT Dynamic Reputation Filtering Framework

131

06 Sep 2025

AI Agents for Web Testing: A Case Study in the Wild

140

05 Sep 2025

Bootstrapping Task Spaces for Self-Improvement

176

04 Sep 2025

Long-Horizon Visual Imitation Learning via Plan and Code Reflection

175

04 Sep 2025

Meta-Policy Reflexion: Reusable Reflective Memory and Rule Admissibility for Resource-Efficient LLM Agent

128

04 Sep 2025

ArcMemo: Abstract Reasoning Composition with Lifelong LLM Memory

195

04 Sep 2025

Learning When to Plan: Efficiently Allocating Test-Time Compute for LLM Agents

242

03 Sep 2025

ReCode: Improving LLM-based Code Repair with Fine-Grained Retrieval-Augmented Generation

172

02 Sep 2025

Plan Verification for LLM-Based Embodied Task Completion Agents

388

02 Sep 2025

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

...

288

02 Sep 2025

On Verifiable Legal Reasoning: A Multi-Agent Framework with Formalized Knowledge Representations

Albert Sadowski

Jarosław A. Chudziak

ELM

152

31 Aug 2025

Analysis of Error Sources in LLM-based Hypothesis Search for Few-Shot Rule Induction

133

31 Aug 2025

SQL-of-Thought: Multi-agentic Text-to-SQL with Guided Error Correction

Saumya Chaturvedi

Aman Chadha

Laurent Bindschaedler

LRM

151

30 Aug 2025

LLM-Assisted Iterative Evolution with Swarm Intelligence Toward SuperBrain

Li Weigang

Pedro Brom

Lucas Ramson Siefert

169

30 Aug 2025

SHERPA: A Model-Driven Framework for Large Language Model Execution

Boqi Chen

Kua Chen

José Antonio Hernández López

122

29 Aug 2025

The Complexity Trap: Simple Observation Masking Is as Efficient as LLM Summarization for Agent Context Management

262

29 Aug 2025

Disabling Self-Correction in Retrieval-Augmented Generation via Stealthy Retriever Poisoning

157

27 Aug 2025

Adaptive Originality Filtering: Rejection Based Prompting and RiddleScore for Culturally Grounded Multilingual Riddle Generation

217

26 Aug 2025

Trustworthy Agents for Electronic Health Records through Confidence Estimation

102

26 Aug 2025

Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

176

26 Aug 2025

Entropy-Guided Loop: Achieving Reasoning through Uncertainty-Aware Generation

Andrew G. A. Correa

Ana C. H de Matos

LRM

171

26 Aug 2025

Reflection-Enhanced Meta-Optimization Integrating TextGrad-style Prompt Optimization with Memory-Driven Self-Evolution

Chunlong Wu

Zhibo Qu

160

26 Aug 2025

UniC-RAG: Universal Knowledge Corruption Attacks to Retrieval-Augmented Generation

143

26 Aug 2025

RubikSQL: Lifelong Learning Agentic Knowledge Base as an Industrial NL2SQL System

...

143

25 Aug 2025

From Language to Action: A Review of Large Language Models as Autonomous Agents and Tool Users

335

24 Aug 2025

WebSight: A Vision-First Architecture for Robust Web Agents

Tanvir Bhathal

Asanshay Gupta

LRM

134

23 Aug 2025

Unveiling the Latent Directions of Reflection in Large Language Models

256

23 Aug 2025

The next question after Turing's question: Introducing the Grow-AI test

Alexandru Tugui

ELM

128

22 Aug 2025

Memento: Fine-tuning LLM Agents without Fine-tuning LLMs

...

Youssef Attia El Hili

Linyi Yang

Jun Wang

LLMAG

443

22 Aug 2025

LLM Agents for Generating Microservice-based Applications: how complex is your specification?

Daniel M. Yellin

120

22 Aug 2025