v1v2v3v4 (latest)

Reflexion: Language Agents with Verbal Reinforcement Learning

Neural Information Processing Systems (NeurIPS), 2023

20 March 2023

ArXiv (abs)PDF HTML HuggingFace (5 upvotes)

Papers citing "Reflexion: Language Agents with Verbal Reinforcement Learning"

50 / 1,269 papers shown

Certified Self-Consistency: Statistical Guarantees and Test-Time Training for Reliable Reasoning in LLMs

Paula Cordero-Encinar

Andrew Duncan

LRM

196

20 Oct 2025

Empowering Real-World: A Survey on the Technology, Practice, and Evaluation of LLM-driven Industry Agents

...

158

20 Oct 2025

OPTAGENT: Optimizing Multi-Agent LLM Interactions Through Verbal Reinforcement Learning for Enhanced Reasoning

134

20 Oct 2025

DynaQuery: A Self-Adapting Framework for Querying Structured and Multimodal Data

Aymane Hassini

20 Oct 2025

Reasoning Distillation and Structural Alignment for Improved Code Generation

Amir Jalilifard

Anderson de Rezende Rocha

Marcos Medeiros Raimundo

OffRL LRM

131

20 Oct 2025

Enterprise Deep Research: Steerable Multi-Agent Deep Research for Enterprise Analytics

168

20 Oct 2025

STARK: Strategic Team of Agents for Refining Kernels

19 Oct 2025

SolverLLM: Leveraging Test-Time Scaling for Optimization Problem via LLM-Guided Search

177

19 Oct 2025

What Limits Agentic Systems Efficiency?

Shivaram Venkataraman

LLMAG LRM

141

18 Oct 2025

SSL4RL: Revisiting Self-supervised Learning as Intrinsic Reward for Visual-Language Reasoning

...

159

18 Oct 2025

Prompt Optimization via Retrieved Reasoning Assets and Multi-Agent Analysis

138

18 Oct 2025

LANPO: Bootstrapping Language and Numerical Feedback for Reinforcement Learning in LLMs

176

18 Oct 2025

EvolveR: Self-Evolving LLM Agents through an Experience-Driven Lifecycle

...

112

17 Oct 2025

CarBoN: Calibrated Best-of-N Sampling Improves Test-time Reasoning

104

17 Oct 2025

Experience-Driven Exploration for Efficient API-Free AI Agents

203

17 Oct 2025

Multi-dimensional Data Analysis and Applications Basing on LLM Agents and Knowledge Graph Interactions

...

134

17 Oct 2025

The Gatekeeper Knows Enough

Fikresilase Wondmeneh Abebayew

LLMAG

16 Oct 2025

LLM Agents Beyond Utility: An Open-Ended Perspective

209

16 Oct 2025

Stop-RAG: Value-Based Retrieval Control for Iterative RAG

Jaewan Park

Solbee Cho

Jay-Yoon Lee

114

16 Oct 2025

Mapping Smarter, Not Harder: A Test-Time Reinforcement Learning Agent That Improves Without Labels or Model Updates

Wen-Kwang Tsao

Yao-Ching Yu

Chien-Ming Huang

16 Oct 2025

Stable but Miscalibrated: A Kantian View on Overconfidence from Filters to Large Language Models

Akira Okutomi

LRM

200

16 Oct 2025

Training LLM Agents to Empower Humans

184

15 Oct 2025

EvoTest: Evolutionary Test-Time Learning for Self-Improving Agentic Systems

301

15 Oct 2025

Static Sandboxes Are Inadequate: Modeling Societal Complexity Requires Open-Ended Co-Evolution in LLM-Based Multi-Agent Simulations

262

15 Oct 2025

Retrieval-in-the-Chain: Bootstrapping Large Language Models for Generative Retrieval

187

15 Oct 2025

LLMs Can Get "Brain Rot"!

155

15 Oct 2025

Toward Reasoning-Centric Time-Series Analysis

143

14 Oct 2025

Deep Research Brings Deeper Harm

172

13 Oct 2025

ReLook: Vision-Grounded RL with a Multimodal LLM Critic for Agentic Web Coding

106

13 Oct 2025

PaperArena: An Evaluation Benchmark for Tool-Augmented Agentic Reasoning on Scientific Literature

264

13 Oct 2025

RAG-Pull: Imperceptible Attacks on RAG Systems for Code Generation

219

13 Oct 2025

A Survey on Agentic Multimodal Large Language Models

...

LM&Ro AIFin AI4TS LRM AI4CE

250

13 Oct 2025

D3MAS: Decompose, Deduce, and Distribute for Enhanced Knowledge Sharing in Multi-Agent Systems

124

12 Oct 2025

Sample-Efficient Online Learning in LM Agents via Hindsight Trajectory Rewriting

106

11 Oct 2025

Failure-Driven Workflow Refinement

115

11 Oct 2025

Audit-of-Understanding: Posterior-Constrained Inference for Mathematical Reasoning in Language Models

187

11 Oct 2025

Agentic Systems in Radiology: Design, Applications, Evaluation, and Challenges

...

285

10 Oct 2025

GRETEL: A Goal-driven Retrieval and Execution-based Trial Framework for LLM Tool Selection Enhancing

10 Oct 2025

Fundamentals of Building Autonomous LLM Agents

Victor de Lamo Castrillo

204

10 Oct 2025

Dyna-Mind: Learning to Simulate from Experience for Better AI Agents

116

10 Oct 2025

How can we assess human-agent interactions? Case studies in software agent design

178

10 Oct 2025

Gold Panning: Turning Positional Bias into Signal for Multi-Document LLM Reasoning

Adam Byerly

Daniel Khashabi

116

10 Oct 2025

Towards Reliable LLM-based Robot Planning via Combined Uncertainty Estimation

118

09 Oct 2025

Agent Learning via Early Experience

...

198

09 Oct 2025

Dream to Recall: Imagination-Guided Experience Retrieval for Memory-Persistent Vision-and-Language Navigation

09 Oct 2025

FlowSearch: Advancing deep research with dynamic structured knowledge flow

...

Wenlong Zhang

Lei Bai

Bo Zhang

AI4CE

150

09 Oct 2025

MOSAIC: Multi-agent Orchestration for Task-Intelligent Scientific Coding

Siddeshwar Raghavan

Tanwi Mallick

AI4CE

136

09 Oct 2025

Training-Free Group Relative Policy Optimization

...

230

09 Oct 2025

Active Confusion Expression in Large Language Models: Leveraging World Models toward Better Social Reasoning

175

09 Oct 2025

RA-Gen: A Controllable Code Generation Framework Using ReAct for Multi-Agent Task Execution

09 Oct 2025