v1v2v3v4 (latest)

Reflexion: Language Agents with Verbal Reinforcement Learning

Neural Information Processing Systems (NeurIPS), 2023

20 March 2023

ArXiv (abs)PDF HTML HuggingFace (5 upvotes)

Papers citing "Reflexion: Language Agents with Verbal Reinforcement Learning"

50 / 1,269 papers shown

Active Confusion Expression in Large Language Models: Leveraging World Models toward Better Social Reasoning

175

09 Oct 2025

xRouter: Training Cost-Aware LLMs Orchestration System via Reinforcement Learning

...

172

09 Oct 2025

FURINA: A Fully Customizable Role-Playing Benchmark via Scalable Multi-Agent Collaboration Pipeline

268

08 Oct 2025

CompassLLM: A Multi-Agent Approach toward Geo-Spatial Reasoning for Popular Path Query

Md. Nazmul Islam Ananto

115

08 Oct 2025

SanDRA: Safe Large-Language-Model-Based Decision Making for Automated Vehicles Using Reachability Analysis

Yuanfei Lin

Sebastian Illing

Matthias Althoff

181

08 Oct 2025

Cross-Modal Attention Guided Unlearning in Vision-Language Models

185

08 Oct 2025

BG-FlipIn: A Bayesian game framework for FlipIt-insider models in advanced persistent threats

103

08 Oct 2025

Adaptive Tool Generation with Models as Tools and Reinforcement Learning

121

08 Oct 2025

Haystack Engineering: Context Engineering for Heterogeneous and Agentic Long-Context Evaluation

...

226

08 Oct 2025

ProSEA: Problem Solving via Exploration Agents

144

08 Oct 2025

Non-Stationary Online Structured Prediction with Surrogate Losses

Shinsaku Sakaue

Han Bao

Yuzhou Cao

131

08 Oct 2025

Mission Impossible: Feedback-Guided Dynamic Interactive Planning for Improving Reasoning on LLMs

119

07 Oct 2025

Beyond Grid-Locked Voxels: Neural Response Functions for Continuous Brain Encoding

150

07 Oct 2025

Limited-Angle Tomography Reconstruction via Projector Guided 3D Diffusion

114

07 Oct 2025

A Survey on Agentic Security: Applications, Threats and Defenses

146

07 Oct 2025

Learning to Crawl: Latent Model-Based Reinforcement Learning for Soft Robotic Adaptive Locomotion

Vaughn Gzenda

Robin Chhabra

120

07 Oct 2025

Mixing Mechanisms: How Language Models Retrieve Bound Entities In-Context

150

07 Oct 2025

RareAgent: Self-Evolving Reasoning for Drug Repurposing in Rare Diseases

172

07 Oct 2025

ARM: Discovering Agentic Reasoning Modules for Generalizable Multi-Agent Systems

Bohan Yao

Shiva Krishna Reddy Malay

Vikas Yadav

LM&Ro LRM

152

07 Oct 2025

Alignment Tipping Process: How Self-Evolution Pushes LLM Agents Off the Rails

134

06 Oct 2025

TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool Use

...

183

06 Oct 2025

LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning

589

06 Oct 2025

AInstein: Assessing the Feasibility of AI-Generated Approaches to Research Problems

06 Oct 2025

ViTs: Teaching Machines to See Time Series Anomalies Like Human Experts

...

142

06 Oct 2025

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

...

222

06 Oct 2025

Bridging Reasoning to Learning: Unmasking Illusions using Complexity Out of Distribution Generalization

Mohammad Mahdi Samiei Paqaleh

Arash Marioriyad

Arman Tahmasebi-Zadeh

Mohamadreza Fereydooni

Mahdi Ghaznavai

Mahdieh Soleymani Baghshah

120

06 Oct 2025

A global log for medical AI

Ayush Noori

Adam Rodman

Alan Karthikesalingam

Bilal A. Mateen

Christopher A. Longhurst

...

172

05 Oct 2025

NegotiationGym: Self-Optimizing Agents in a Multi-Agent Social Simulation Environment

05 Oct 2025

Zephyrus: An Agentic Framework for Weather Science

...

Taylor Berg-Kirkpatrick

120

05 Oct 2025

Large Language Models Hallucination: A Comprehensive Survey

Aisha Alansari

Hamzah Luqman

HILM LRM

461

05 Oct 2025

Utility-Learning Tension in Self-Modifying Agents

Charles L. Wang

Keir Dorchen

Peter Jin

129

05 Oct 2025

Just-in-time Episodic Feedback Hinter: Leveraging Offline Knowledge to Improve LLM Agents Adaptation

Orlando Marquez Ayala

128

05 Oct 2025

SPOGW: a Score-based Preference Optimization method via Group-Wise comparison for workflows

159

05 Oct 2025

Adversarial Agent Collaboration for C to Rust Translation

154

04 Oct 2025

Towards Policy-Compliant Agents: Learning Efficient Guardrails For Policy Violation Detection

156

03 Oct 2025

Self-Reflective Generation at Test Time

144

03 Oct 2025

FOR-Prompting: From Objection to Revision via an Asymmetric Prompting Protocol

02 Oct 2025

An Algorithmic Information-Theoretic Perspective on the Symbol Grounding Problem

Zhangchi Liu

106

02 Oct 2025

Enhancing Large Language Model Reasoning with Reward Models: An Analytical Survey

278

02 Oct 2025

ReSeek: A Self-Correcting Framework for Search Agents with Instructive Rewards

176

01 Oct 2025

A Tale of LLMs and Induced Small Proxies: Scalable Agents for Knowledge Mining

146

01 Oct 2025

MEMTRACK: Evaluating Long-Term Memory and State Tracking in Multi-Platform Dynamic Agent Environments

01 Oct 2025

Fine-tuning with RAG for Improving LLM Learning of New Skills

100

01 Oct 2025

Rethinking Thinking Tokens: LLMs as Improvement Operators

191

01 Oct 2025

Typed Chain-of-Thought: A Curry-Howard Framework for Verifying LLM Reasoning

Elija Perrier

LRM

108

01 Oct 2025

MAVUL: Multi-Agent Vulnerability Detection via Contextual Reasoning and Interactive Refinement

125

30 Sep 2025

GRPO-

λ

: Credit Assignment improves LLM Reasoning

Prasanna Parthasarathi

175

30 Sep 2025

Planner-R1: Reward Shaping Enables Efficient Agentic RL with Smaller LLMs

137

30 Sep 2025

LLM-based Multi-Agent Blackboard System for Information Discovery in Data Science

136

30 Sep 2025

Interactive Learning for LLM Reasoning

280

30 Sep 2025