Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2303.11366
Cited By
v1
v2
v3
v4 (latest)
Reflexion: Language Agents with Verbal Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2023
20 March 2023
Noah Shinn
Federico Cassano
Beck Labash
A. Gopinath
Karthik Narasimhan
Shunyu Yao
LLMAG
KELM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (5 upvotes)
Papers citing
"Reflexion: Language Agents with Verbal Reinforcement Learning"
50 / 1,260 papers shown
Title
MASLegalBench: Benchmarking Multi-Agent Systems in Deductive Legal Reasoning
Huihao Jing
Wenbin Hu
Hongyu Luo
Jianhui Yang
Wei Fan
Haoran Li
Yangqiu Song
LLMAG
AILaw
ELM
228
0
0
29 Sep 2025
Beyond Manuals and Tasks: Instance-Level Context Learning for LLM Agents
Kuntai Cai
Juncheng Liu
Xianglin Yang
Zhaojie Niu
X. Xiao
Xing Chen
LLMAG
204
0
0
29 Sep 2025
Agentic Specification Generator for Move Programs
Yu-Fu Fu
Meng Xu
Taesoo Kim
52
1
0
29 Sep 2025
Dual-Scale World Models for LLM Agents Towards Hard-Exploration Problems
Minsoo Kim
Seung-won Hwang
204
0
0
28 Sep 2025
How LLMs Learn to Reason: A Complex Network Perspective
Sihan Hu
X-D Cai
Yuan Huang
Zhiyuan Yao
Linfeng Zhang
Pan Zhang
Youjin Deng
Kun Chen
LRM
213
1
0
28 Sep 2025
RADAR: A Risk-Aware Dynamic Multi-Agent Framework for LLM Safety Evaluation via Role-Specialized Collaboration
X. Chen
Jian Zhao
Yuchen Yuan
T. Zhang
Huilin Zhou
...
Ping Hu
Linghe Kong
Chi Zhang
Weiran Huang
Xuelong Li
311
3
0
28 Sep 2025
Optimization Modeling via Semantic Anchored Alignment
Yansen Zhang
Qingcan Kang
Yujie Chen
Yufei Wang
Xiongwei Han
Tao Zhong
Mingxuan Yuan
Chen Ma
88
0
0
28 Sep 2025
PARL-MT: Learning to Call Functions in Multi-Turn Conversation with Progress Awareness
Huacan Chai
Zijie Cao
M. R
Y. Yang
Jianghao Lin
...
Muning Wen
Weiwen Liu
Weinan Zhang
Fei Huang
Y. Wen
OffRL
AIFin
LRM
186
0
0
27 Sep 2025
Diagnose, Localize, Align: A Full-Stack Framework for Reliable LLM Multi-Agent Systems under Instruction Conflicts
Guancheng Wan
Leixin Sun
Longxu Dou
Zitong Shi
Fang Wu
...
Hejia Geng
Xiangru Tang
Z. Yin
Yizhou Sun
Wei Wang
142
1
0
27 Sep 2025
GUI-PRA: Process Reward Agent for GUI Tasks
Tao Xiong
Xavier Hu
Yurun Chen
Yuhang Liu
Changqiao Wu
Pengzhi Gao
Wei Liu
Jian Luan
Shengyu Zhang
LLMAG
237
0
0
27 Sep 2025
Cognition-of-Thought Elicits Social-Aligned Reasoning in Large Language Models
Xuanming Zhang
Yuxuan Chen
Min-Hsuan Yeh
Yixuan Li
LRM
200
2
0
27 Sep 2025
LAGEA: Language Guided Embodied Agents for Robotic Manipulation
Abdul Monaf Chowdhury
Akm Moshiur Rahman Mazumder
Rabeya Akter
S. Arib
LM&Ro
100
0
0
27 Sep 2025
Goal-Guided Efficient Exploration via Large Language Model in Reinforcement Learning
Yajie Qi
Wei Wei
Lin Li
Lijun Zhang
Zhidong Gao
Da Wang
Huizhong Song
120
0
0
26 Sep 2025
A2R: An Asymmetric Two-Stage Reasoning Framework for Parallel Reasoning
Z. Wang
Boye Niu
Ruoyao Xiao
Linghui Meng
Jing Liu
Zhi Zheng
Tong Xu
H. Wu
Haifeng Wang
Enhong Chen
LRM
96
1
0
26 Sep 2025
Benchmarking and Mitigating Sycophancy in Medical Vision Language Models
Zikun Guo
Xinyue Xu
Pei Xiang
Shu Yang
Xin Han
Di Wang
Lijie Hu
128
0
0
26 Sep 2025
Solving the Granularity Mismatch: Hierarchical Preference Learning for Long-Horizon LLM Agents
Heyang Gao
Guoqing Liu
Erxue Min
Hengyi Cai
Shuaiqiang Wang
Dawei Yin
Xu Chen
144
0
0
26 Sep 2025
Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Yulei Qin
Xiaoyu Tan
Zhengbao He
Gang Li
Haojia Lin
...
Yuzheng Cai
Xuan Zhang
Sheng Ye
Ke Li
Xing Sun
381
1
0
26 Sep 2025
AgentPack: A Dataset of Code Changes, Co-Authored by Agents and Humans
Yangtian Zi
Zixuan Wu
Aleksander Boruch-Gruszecki
Jonathan Bell
Arjun Guha
148
0
0
26 Sep 2025
Creative Adversarial Testing (CAT): A Novel Framework for Evaluating Goal-Oriented Agentic AI Systems
Hassen Dhrif
60
0
0
26 Sep 2025
Eigen-1: Adaptive Multi-Agent Refinement with Monitor-Based RAG for Scientific Reasoning
Xiangru Tang
Wanghan Xu
Yujie Wang
Qiantai Feng
Daniel Shao
...
Lei Bai
Z. Yin
Philip Torr
Hanrui Wang
Di Jin
LLMAG
LRM
183
1
0
25 Sep 2025
CLAUSE: Agentic Neuro-Symbolic Knowledge Graph Reasoning via Dynamic Learnable Context Engineering
Yang Zhao
Chengxiao Dai
Wei Zhuo
Yue Xiu
Dusit Niyato
140
0
0
25 Sep 2025
Painless Activation Steering: An Automated, Lightweight Approach for Post-Training Large Language Models
Sasha Cui
Zhongren Chen
LLMSV
220
1
0
25 Sep 2025
Hallucination-Resistant, Domain-Specific Research Assistant with Self-Evaluation and Vector-Grounded Retrieval
Vivek Bhavsar
Joseph Ereifej
Aravanan Gurusami
RALM
96
0
0
25 Sep 2025
What Do LLM Agents Do When Left Alone? Evidence of Spontaneous Meta-Cognitive Patterns
Stefan Szeider
LLMAG
LM&Ro
62
0
0
25 Sep 2025
Training Task Reasoning LLM Agents for Multi-turn Task Planning via Single-turn Reinforcement Learning
Hanjiang Hu
Changliu Liu
Na Li
Yebin Wang
OffRL
LRM
111
0
0
24 Sep 2025
ToolBrain: A Flexible Reinforcement Learning Framework for Agentic Tools
Quy Minh Le
Minh Sao Khue Luu
Khanh-Tung Tran
Duc-Hai Nguyen
Hoang-Quoc-Viet Pham
Quan Le
Hoang Thanh Lam
Hoang D. Nguyen
56
1
0
24 Sep 2025
DAOpt: Modeling and Evaluation of Data-Driven Optimization under Uncertainty with LLMs
WenZhuo Zhu
Zheng Cui
Wenhan Lu
Sheng Liu
Yue Zhao
AI4CE
32
0
0
24 Sep 2025
Embodied AI: From LLMs to World Models
Tongtong Feng
Xin Wang
Yu Jiang
Wenwu Zhu
LM&Ro
325
8
0
24 Sep 2025
SIM-CoT: Supervised Implicit Chain-of-Thought
Xilin Wei
Xiaoran Liu
Yuhang Zang
Xiaoyi Dong
Yuhang Cao
Jiaqi Wang
Xipeng Qiu
Dahua Lin
LRM
198
3
0
24 Sep 2025
MARS: toward more efficient multi-agent collaboration for LLM reasoning
X. Wang
Jia Wang
Y Samuel Wang
Pengtao Dang
Sha Cao
Chi Zhang
LLMAG
LRM
101
1
0
24 Sep 2025
Federation of Agents: A Semantics-Aware Communication Fabric for Large-Scale Agentic AI
Lorenzo Giusti
Ole Anton Werner
Riccardo Taiello
Matilde Carvalho Costa
Emre Tosun
...
Marc Molina
Rodrigo Lopes de Almeida
Paolo Cacace
Diogo Reis Santos
Luigi Serio
172
0
0
24 Sep 2025
The Case for Negative Data: From Crash Reports to Counterfactuals for Reasonable Driving
Jay Patrikar
Apoorva Sharma
Sushant Veer
Boyi Li
Sebastian A. Scherer
Marco Pavone
124
0
0
23 Sep 2025
Experience Scaling: Post-Deployment Evolution For Large Language Models
Xingkun Yin
Kaibin Huang
Dong In Kim
Hongyang Du
111
0
0
23 Sep 2025
Code Driven Planning with Domain-Adaptive Critic
Zikang Tian
Shaohui Peng
Du Huang
Jiaming Guo
Ruizhi Chen
...
Qi Guo
Ling Li
Yewen Pu
Xing Hu
Yunji Chen
173
0
0
23 Sep 2025
MemOrb: A Plug-and-Play Verbal-Reinforcement Memory Layer for E-Commerce Customer Service
Y. Huang
Yang Liu
Ruiyu Zhao
Xiaolong Zhong
Xingming Yue
Ling Jiang
LLMAG
KELM
100
1
0
23 Sep 2025
Actions Speak Louder than Prompts: A Large-Scale Study of LLMs for Graph Inference
Ben Finkelshtein
Silviu Cucerzan
S. Jauhar
Ryen W. White
160
0
0
23 Sep 2025
Check Field Detection Agent (CFD-Agent) using Multimodal Large Language and Vision Language Models
Sourav Halder
Jinjun Tong
Xinyu Wu
81
0
0
22 Sep 2025
Variation in Verification: Understanding Verification Dynamics in Large Language Models
Yefan Zhou
Austin Xu
Yilun Zhou
Janvijay Singh
Jiang Gui
Shafiq Joty
LRM
176
3
0
22 Sep 2025
MCTS-EP: Empowering Embodied Planning with Online Preference Optimization
Hang Xu
Zang Yu
Yehui Tang
Pengbo Hu
Yuhao Tang
Hao Dong
124
0
0
21 Sep 2025
Generating High-Quality Datasets for Code Editing via Open-Source Language Models
Zekai Zhang
Xin Peng
Z. Chen
Linxi Liang
Yuxuan Chen
Guangsheng Ou
Yanlin Wang
Dan Li
Xin Peng
Zibin Zheng
SyDa
186
0
0
19 Sep 2025
(P)rior(D)yna(F)low: A Priori Dynamic Workflow Construction via Multi-Agent Collaboration
Yi Lin
Lujin Zhao
Yijie Shi
AI4CE
112
0
0
18 Sep 2025
CollabVLA: Self-Reflective Vision-Language-Action Model Dreaming Together with Human
Nan Sun
Yongchang Li
Chenxu Wang
Huiying Li
Huaping Liu
LM&Ro
VLM
114
0
0
18 Sep 2025
Orion: Fuzzing Workflow Automation
Max Bazalii
Marius Fleischer
112
0
0
18 Sep 2025
An Empirical Study on Failures in Automated Issue Solving
Simiao Liu
Fang Liu
Liehao Li
Xin Tan
Yinghao Zhu
Xiaoli Lian
Li Zhang
100
4
0
17 Sep 2025
H
2
^2
2
R: Hierarchical Hindsight Reflection for Multi-Task LLM Agents
Shicheng Ye
Chao Yu
Kaiqiang Ke
C. Xu
Yinqi Wei
108
1
0
16 Sep 2025
AI Agents with Human-Like Collaborative Tools: Adaptive Strategies for Enhanced Problem-Solving
Harper Reed
Michael Sugimura
Angelo Zangari
LLMAG
56
0
0
16 Sep 2025
Empowering LLMs with Parameterized Skills for Adversarial Long-Horizon Planning
Sijia Cui
Shuai Xu
Aiyao He
Yanna Wang
Bo Xu
LLMAG
165
2
0
16 Sep 2025
EvoEmpirBench: Dynamic Spatial Reasoning with Agent-ExpVer
Pukun Zhao
Longxiang Wang
Miaowei Wang
Chen Chen
Fanqing Zhou
Haojian Huang
184
0
0
16 Sep 2025
Enhancing Computational Cognitive Architectures with LLMs: A Case Study
Ron Sun
117
0
0
13 Sep 2025
Visual Programmability: A Guide for Code-as-Thought in Chart Understanding
Bohao Tang
Yan Ma
Fei Zhang
Jiadi Su
Ethan Chern
Zhulin Hu
Zhixin Wang
Pengfei Liu
Ya Zhang
LRM
118
0
0
11 Sep 2025
Previous
1
2
3
4
5
6
...
24
25
26
Next