Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2303.11366
Cited By
v1
v2
v3
v4 (latest)
Reflexion: Language Agents with Verbal Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2023
20 March 2023
Noah Shinn
Federico Cassano
Beck Labash
A. Gopinath
Karthik Narasimhan
Shunyu Yao
LLMAG
KELM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (5 upvotes)
Papers citing
"Reflexion: Language Agents with Verbal Reinforcement Learning"
50 / 1,260 papers shown
Title
MathSE: Improving Multimodal Mathematical Reasoning via Self-Evolving Iterative Reflection and Reward-Guided Fine-Tuning
Jinhao Chen
Zhen Yang
Jianxin Shi
Tianyu Wo
J. Tang
ReLM
LRM
244
0
0
10 Nov 2025
Recursive Dynamics in Fast-Weights Homeostatic Reentry Networks: Toward Reflective Intelligence
B. G. Chae
141
2
0
10 Nov 2025
FLEX: Continuous Agent Evolution via Forward Learning from Experience
Zhicheng Cai
Xinyuan Guo
Yu Pei
Jiangtao Feng
Jiangjie Chen
Ya Zhang
Wei-Ying Ma
Mingxuan Wang
Hao Zhou
Hao Zhou
CLL
LLMAG
LRM
258
3
0
09 Nov 2025
Self-Abstraction from Grounded Experience for Plan-Guided Policy Refinement
Hiroaki Hayashi
Bo Pang
Wenting Zhao
Ye Liu
Akash Gokul
Srijan Bansal
Caiming Xiong
Semih Yavuz
Yingbo Zhou
LLMAG
LM&Ro
LRM
292
0
0
08 Nov 2025
Evaluation of retrieval-based QA on QUEST-LOFT
Nathan Scales
Nathanael Scharli
Olivier Bousquet
RALM
344
0
0
08 Nov 2025
Long Grounded Thoughts: Distilling Compositional Visual Reasoning Chains at Scale
Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
David Acuna
Chao-Han Huck Yang
Yuntian Deng
Jaehun Jung
Ximing Lu
Prithviraj Ammanabrolu
Hyunwoo J. Kim
Yuan-Hong Liao
Yejin Choi
ReLM
OffRL
LRM
327
1
0
07 Nov 2025
KLASS: KL-Guided Fast Inference in Masked Diffusion Models
S. Kim
S. Hong
Hojung Jung
Youngrok Park
Se-Young Yun
DiffM
VLM
116
0
0
07 Nov 2025
Collaborative Agents for Automated Program Repair in Ruby
Nikta Akbarpour
Mahdieh Sadat Benis
Fatemeh H. Fard
Ali Ouni
Mohamed Aymen Saied
KELM
273
0
0
06 Nov 2025
RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring
Khouloud Oueslati
Maxime Lamothe
Foutse Khomh
LLMAG
144
0
0
05 Nov 2025
Secure Code Generation at Scale with Reflexion
Arup Datta
Ahmed Aljohani
Hyunsook Do
ELM
116
0
0
05 Nov 2025
Leveraging LLM-based agents for social science research: insights from citation network simulations
Jiarui Ji
Runlin Lei
X. Pan
Zhewei Wei
Hao Sun
...
X. Chen
Yongzheng Yang
Yaliang Li
Bolin Ding
Ji-Rong Wen
LLMAG
266
0
0
05 Nov 2025
Context-Guided Decompilation: A Step Towards Re-executability
Xiaohan Wang
Yuxin Hu
Kevin Leach
91
0
0
03 Nov 2025
Continual Learning, Not Training: Online Adaptation For Agents
Aman Jaglan
Jarrod Barnes
CLL
168
0
0
02 Nov 2025
DRIP: Defending Prompt Injection via Token-wise Representation Editing and Residual Instruction Fusion
Ruofan Liu
Yun Lin
Zhiyong Huang
Jin Song Dong
AAML
SILM
342
0
0
01 Nov 2025
A CPU-Centric Perspective on Agentic AI
Ritik Raj
Hong Wang
Tushar Krishna
217
0
0
01 Nov 2025
CATArena: Evaluation of LLM Agents through Iterative Tournament Competitions
Lingyue Fu
Xin Ding
Yaoming Zhu
Shao Zhang
Lin Qiu
...
W. Zhang
Xuezhi Cao
Xunliang Cai
Jiaxin Ding
Yong Yu
LLMAG
ELM
203
0
0
30 Oct 2025
FELA: A Multi-Agent Evolutionary System for Feature Engineering of Industrial Event Log Data
Kun ouyang
Haoyu Wang
Dong Fang
LLMAG
AI4CE
174
0
0
29 Oct 2025
Iterative Critique-Refine Framework for Enhancing LLM Personalization
Durga Prasad Maram
Dhruvin Gandhi
Z. Yao
Gayathri Akkinapalli
Franck Dernoncourt
Yu Wang
Ryan Rossi
Nesreen K. Ahmed
124
0
0
28 Oct 2025
Evidence-Bound Autonomous Research (EviBound): A Governance Framework for Eliminating False Claims
Ruiying Chen
72
0
0
28 Oct 2025
Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning
Zhiheng Xi
Jixuan Huang
Xin Guo
Boyang Hong
Dingwen Yang
...
Jiecao Chen
Rui Zheng
Tao Gui
Qi Zhang
Xuanjing Huang
OffRL
LRM
166
0
0
28 Oct 2025
Compositional Image Synthesis with Inference-Time Scaling
Minsuk Ji
Sanghyeok Lee
Namhyuk Ahn
DiffM
MLLM
EGVM
VLM
246
0
0
28 Oct 2025
AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis
Xuanzhong Chen
Zile Qiao
Guoxin Chen
L. Su
Zhen Zhang
Xinyu Wang
Pengjun Xie
Fei Huang
Jingren Zhou
Yong Jiang
LLMAG
ELM
141
2
0
28 Oct 2025
From Observability Data to Diagnosis: An Evolving Multi-agent System for Incident Management in Cloud Systems
Yu Luo
Jiamin Jiang
J. Feng
Lei Tao
Qingliang Zhang
Xidao Wen
Yongqian Sun
Shenglin Zhang
Jielong Huang
157
0
0
28 Oct 2025
Language Server CLI Empowers Language Agents with Process Rewards
Yifan Zhang
Lanser Contributors
44
0
0
27 Oct 2025
COOPERA: Continual Open-Ended Human-Robot Assistance
Chenyang Ma
Kai Lu
Ruta Desai
Xavier Puig
Andrew Markham
Niki Trigoni
128
1
0
27 Oct 2025
SwiftSolve: A Self-Iterative, Complexity-Aware Multi-Agent Framework for Competitive Programming
Adhyayan Veer Singh
Aaron Shen
Brian Law
Ahmed Ismail
Jonas Rohweder
Sean O'Brien
Kevin Zhu
LRM
81
0
0
26 Oct 2025
Agentic Meta-Orchestrator for Multi-task Copilots
Xiaofeng Zhu
Yunshen Zhou
LLMAG
257
0
0
26 Oct 2025
Embracing Trustworthy Brain-Agent Collaboration as Paradigm Extension for Intelligent Assistive Technologies
Yankai Chen
Xinni Zhang
Yifei Zhang
Yangning Li
Henry Peng Zou
Chunyu Miao
Weizhi Zhang
Xue Liu
Philip S. Yu
129
1
0
25 Oct 2025
Co-Sight: Enhancing LLM-Based Agents via Conflict-Aware Meta-Verification and Trustworthy Reasoning with Structured Facts
Hongwei Zhang
Ji Lu
Shiqing Jiang
Chenxiang Zhu
Li Xie
...
Baoyu Tang
Lingjun Huang
Baoli Wang
Fang Tan
Peng Zou
LRM
170
1
0
24 Oct 2025
Automated Detection of Visual Attribute Reliance with a Self-Reflective Agent
Christy Li
Josep Lopez Camunas
Jake Thomas Touchet
Jacob Andreas
Àgata Lapedriza
Antonio Torralba
Tamar Rott Shaham
183
0
0
24 Oct 2025
LLM-AR: LLM-powered Automated Reasoning Framework
Rick Chen
Joseph Ternasky
Aaron Ontoyin Yin
Xianling Mu
Fuat Alican
Yigit Ihlamur
LRM
52
0
0
24 Oct 2025
RETuning: Upgrading Inference-Time Scaling for Stock Movement Prediction with Large Language Models
Xueyuan Lin
Cehao Yang
Ye Ma
Ming Li
Rongjunchen Zhang
Yang Ni
Xiaojun Wu
Chengjin Xu
Jian Guo
Hui Xiong
AIFin
LRM
170
0
0
24 Oct 2025
Boosting Accuracy and Efficiency of Budget Forcing in LLMs via Reinforcement Learning for Mathematical Reasoning
Ravindra Aribowo Tarunokusumo
Rafael Fernandes Cunha
OffRL
ReLM
LRM
128
0
0
24 Oct 2025
Integrating Machine Learning into Belief-Desire-Intention Agents: Current Advances and Open Challenges
Andrea Agiollo
Andrea Omicini
LM&Ro
AI4CE
148
0
0
23 Oct 2025
Automated Cloud Infrastructure-as-Code Reconciliation with AI Agents
Zhenning Yang
Hui Guan
Victor Nicolet
Brandon Paulsen
Joey Dodds
Daniel Kroening
Ang Chen
96
0
0
23 Oct 2025
Code-enabled language models can outperform reasoning models on diverse tasks
Cedegao E. Zhang
Cédric Colas
Gabriel Poesia
Joshua B. Tenenbaum
Jacob Andreas
ReLM
ALM
LRM
AI4CE
164
0
0
23 Oct 2025
NeSyPr: Neurosymbolic Proceduralization For Efficient Embodied Reasoning
Wonje Choi
Jooyoung Kim
Honguk Woo
LRM
124
0
0
22 Oct 2025
See, Think, Act: Online Shopper Behavior Simulation with VLM Agents
Yimeng Zhang
Jiri Gesi
Ran Xue
Tian Wang
Ziyi Wang
...
Qingjun Cui
Yufan Guo
Jing Huang
Mubarak Shah
Dakuo Wang
OffRL
156
0
0
22 Oct 2025
Beyond Reactivity: Measuring Proactive Problem Solving in LLM Agents
Gil Pasternak
Dheeraj Rajagopal
Julia White
Dhruv Atreja
Matthew Thomas
George Hurn-Maloney
Ash Lewis
LLMAG
171
0
0
22 Oct 2025
Teaming LLMs to Detect and Mitigate Hallucinations
Demian Till
John Smeaton
Peter Haubrick
Gouse Saheb
Florian Graef
David Berman
HILM
314
0
0
22 Oct 2025
StarBench: A Turn-Based RPG Benchmark for Agentic Multimodal Decision-Making and Information Seeking
Haoran Zhang
C. Zhu
Sicong Guo
Hanzhe Guo
Haiming Li
Donglin Yu
122
0
0
21 Oct 2025
PlanU: Large Language Model Reasoning through Planning under Uncertainty
Ziwei Deng
Mian Deng
Chenjing Liang
Zeming Gao
Chennan Ma
Chenxing Lin
Haipeng Zhang
Songzhu Mei
Cheng-Yu Wang
Siqi Shen
141
0
0
21 Oct 2025
Illusions of reflection: open-ended task reveals systematic failures in Large Language Models' reflective reasoning
Sion Weatherhead
Flora D. Salim
Aaron Belbasis
ReLM
LRM
ELM
186
0
0
21 Oct 2025
AlphaOPT: Formulating Optimization Programs with Self-Improving LLM Experience Library
Minwei Kong
Ao Qu
Xiaotong Guo
Wenbin Ouyang
Chonghe Jiang
...
Hai Wang
Cathy Wu
Jinhua Zhao
Cathy Wu
Jinhua Zhao
96
0
0
21 Oct 2025
Certified Self-Consistency: Statistical Guarantees and Test-Time Training for Reliable Reasoning in LLMs
Paula Cordero-Encinar
Andrew Duncan
LRM
193
1
0
20 Oct 2025
OPTAGENT: Optimizing Multi-Agent LLM Interactions Through Verbal Reinforcement Learning for Enhanced Reasoning
Zhenyu Bi
Meng Lu
Yang Li
Swastik Roy
Weijie Guan
Morteza Ziyadi
Xuan Wang
LLMAG
LRM
130
1
0
20 Oct 2025
Enterprise Deep Research: Steerable Multi-Agent Deep Research for Enterprise Analytics
Akshara Prabhakar
Roshan Ram
Zixiang Chen
Silvio Savarese
Frank Wang
Caiming Xiong
Huan Wang
Weiran Yao
154
0
0
20 Oct 2025
Reasoning Distillation and Structural Alignment for Improved Code Generation
Amir Jalilifard
Anderson de Rezende Rocha
Marcos Medeiros Raimundo
OffRL
LRM
108
0
0
20 Oct 2025
DynaQuery: A Self-Adapting Framework for Querying Structured and Multimodal Data
Aymane Hassini
80
0
0
20 Oct 2025
Empowering Real-World: A Survey on the Technology, Practice, and Evaluation of LLM-driven Industry Agents
Yihong Tang
Kehai Chen
Liang Yue
Jinxin Fan
Caishen Zhou
...
Kaiyang Guo
Xingshan Zeng
Wenjing Cun
L. Shang
Min Zhang
LLMAG
142
0
0
20 Oct 2025
Previous
1
2
3
4
5
...
24
25
26
Next