Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2303.11366
Cited By
v1
v2
v3
v4 (latest)
Reflexion: Language Agents with Verbal Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2023
20 March 2023
Noah Shinn
Federico Cassano
Beck Labash
A. Gopinath
Karthik Narasimhan
Shunyu Yao
LLMAG
KELM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (5 upvotes)
Papers citing
"Reflexion: Language Agents with Verbal Reinforcement Learning"
50 / 1,271 papers shown
Empowering LLMs with Parameterized Skills for Adversarial Long-Horizon Planning
Sijia Cui
Shuai Xu
Aiyao He
Yanna Wang
Bo Xu
LLMAG
192
2
0
16 Sep 2025
EvoEmpirBench: Dynamic Spatial Reasoning with Agent-ExpVer
Pukun Zhao
Longxiang Wang
Miaowei Wang
Chen Chen
Fanqing Zhou
Haojian Huang
209
0
0
16 Sep 2025
H
2
^2
2
R: Hierarchical Hindsight Reflection for Multi-Task LLM Agents
Shicheng Ye
Chao Yu
Kaiqiang Ke
C. Xu
Yinqi Wei
132
2
0
16 Sep 2025
AI Agents with Human-Like Collaborative Tools: Adaptive Strategies for Enhanced Problem-Solving
Harper Reed
Michael Sugimura
Angelo Zangari
LLMAG
65
0
0
16 Sep 2025
Enhancing Computational Cognitive Architectures with LLMs: A Case Study
Ron Sun
129
1
0
13 Sep 2025
ZapGPT: Free-form Language Prompting for Simulated Cellular Control
Nam H. Le
Patrick Erickson
Yanbo Zhang
Michael Levin
Josh Bongard
LM&Ro
131
0
0
12 Sep 2025
SEDM: Scalable Self-Evolving Distributed Memory for Agents
Haoran Xu
Jiacong Hu
Ke Zhang
Lei Yu
Yuxin Tang
Xinyuan Song
Yiqun Duan
Lynn Ai
Bill Shi
212
1
0
11 Sep 2025
Visual Programmability: A Guide for Code-as-Thought in Chart Understanding
Bohao Tang
Yan Ma
Fei Zhang
Jiadi Su
Ethan Chern
Zhulin Hu
Zhixin Wang
Pengfei Liu
Ya Zhang
LRM
135
0
0
11 Sep 2025
Latency and Token-Aware Test-Time Compute
Jenny Y. Huang
Mehul Damani
Yousef El-Kurdi
Ramón Fernandez Astudillo
Wei Sun
100
2
0
11 Sep 2025
TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Kechen Jiao
Zhirui Fang
Jiahao Liu
Bei Li
Qifan Wang
...
Zhongjian Qiao
Yifan Zhu
Yaxin Xu
Jingang Wang
Xiu Li
120
0
0
10 Sep 2025
Accelerating Reinforcement Learning Algorithms Convergence using Pre-trained Large Language Models as Tutors With Advice Reusing
Lukas Toral
Teddy Lazebnik
183
0
0
10 Sep 2025
Evaluating LLMs Without Oracle Feedback: Agentic Annotation Evaluation Through Unsupervised Consistency Signals
Cheng Chen
Haiyan Yin
Ivor Tsang
155
1
0
10 Sep 2025
AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning
Zhiheng Xi
J. Huang
Chenyang Liao
Baodai Huang
Honglin Guo
...
Tao Gui
Zuxuan Wu
Qi Zhang
Xuanjing Huang
Yu-Gang Jiang
155
20
0
10 Sep 2025
K2-Think: A Parameter-Efficient Reasoning System
Zhoujun Cheng
Richard Fan
Shibo Hao
Taylor W. Killian
Haonan Li
...
Xuezhe Ma
Guowei He
Zhiting Hu
Zhengzhong Liu
Eric P. Xing
ReLM
OffRL
ALM
LRM
307
5
0
09 Sep 2025
RAFFLES: Reasoning-based Attribution of Faults for LLM Systems
Chenyang Zhu
Spencer Hong
Jingyu Wu
Kushal Chawla
Charlotte Tang
Youbing Yin
Nathan Wolfe
Erin Babinsky
Daben Liu
157
0
0
08 Sep 2025
Another Turn, Better Output? A Turn-Wise Analysis of Iterative LLM Prompting
Shashidhar Reddy Javaji
Bhavul Gauri
Zining Zhu
LRM
222
1
0
08 Sep 2025
MachineLearningLM: Scaling Many-shot In-context Learning via Continued Pretraining
Haoyu Dong
Pengkun Zhang
Mingzhe Lu
Yanzhen Shen
Guolin Ke
ReLM
LRM
455
3
0
08 Sep 2025
PaVeRL-SQL: Text-to-SQL via Partial-Match Rewards and Verbal Reinforcement Learning
Heng Hao
Wenjun Hu
Oxana Verkholyak
Davoud Ataee Tarzanagh
Baruch Gutow
Sima Didari
Masoud Faraki
H. Moon
Seungjai Min
135
0
0
08 Sep 2025
Cross-Question Method Reuse in Large Language Models: From Word-Level Prediction to Rational Logical-Layer Reasoning
Hong Su
LRM
149
2
0
06 Sep 2025
Orchestrator: Active Inference for Multi-Agent Systems in Long-Horizon Tasks
Lukas Beckenbauer
Johannes-Lucas Loewe
Ge Zheng
Alexandra Brintrup
AI4CE
140
1
0
06 Sep 2025
DRF: LLM-AGENT Dynamic Reputation Filtering Framework
Yuwei Lou
Hao Hu
Shaocong Ma
Zongfei Zhang
Liang Wang
Jidong Ge
Xianping Tao
131
5
0
06 Sep 2025
AI Agents for Web Testing: A Case Study in the Wild
Naimeng Ye
Xiao Yu
Ruize Xu
Tianyi Peng
Zhou Yu
LLMAG
140
0
0
05 Sep 2025
Bootstrapping Task Spaces for Self-Improvement
Minqi Jiang
Andrei Lupu
Yoram Bachrach
LRM
176
3
0
04 Sep 2025
Long-Horizon Visual Imitation Learning via Plan and Code Reflection
Quan Chen
Chenrui Shi
Qi Chen
Yuwei Wu
Zhi Gao
Xintong Zhang
Rui Gao
Kun Wu
Yunde Jia
175
1
0
04 Sep 2025
Meta-Policy Reflexion: Reusable Reflective Memory and Rule Admissibility for Resource-Efficient LLM Agent
Chunlong Wu
Ye Luo
Zhibo Qu
Min Wang
128
0
0
04 Sep 2025
ArcMemo: Abstract Reasoning Composition with Lifelong LLM Memory
Matthew Ho
Chen Si
Zhaoxiang Feng
Fangxu Yu
Yichi Yang
Zhijian Liu
Zhiting Hu
Lianhui Qin
LRM
195
7
0
04 Sep 2025
Learning When to Plan: Efficiently Allocating Test-Time Compute for LLM Agents
Davide Paglieri
Bartłomiej Cupiał
Jonathan Cook
Ulyana Piterbarg
Jens Tuyls
Edward Grefenstette
Jakob Foerster
Jack Parker-Holder
Tim Rocktaschel
LLMAG
242
2
0
03 Sep 2025
ReCode: Improving LLM-based Code Repair with Fine-Grained Retrieval-Augmented Generation
Yicong Zhao
Shisong Chen
Jiacheng Zhang
Zhixu Li
172
1
0
02 Sep 2025
Plan Verification for LLM-Based Embodied Task Completion Agents
Ananth Hariharan
Vardhan Dongre
Dilek Hakkani-Tur
Gokhan Tur
LLMAG
LM&Ro
388
1
0
02 Sep 2025
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning
Haoming Wang
Haoyang Zou
Huatong Song
J. Feng
Junjie Fang
...
Xianzheng Ma
Xiaojun Xiao
X. Y. Huang
Xinjie Chen
Yidi Du
LLMAG
288
54
0
02 Sep 2025
On Verifiable Legal Reasoning: A Multi-Agent Framework with Formalized Knowledge Representations
Albert Sadowski
Jarosław A. Chudziak
ELM
152
3
0
31 Aug 2025
Analysis of Error Sources in LLM-based Hypothesis Search for Few-Shot Rule Induction
Aishni Parab
Hongjing Lu
Ying Nian Wu
Sumit Gulwani
133
0
0
31 Aug 2025
SQL-of-Thought: Multi-agentic Text-to-SQL with Guided Error Correction
Saumya Chaturvedi
Aman Chadha
Laurent Bindschaedler
LRM
151
2
0
30 Aug 2025
LLM-Assisted Iterative Evolution with Swarm Intelligence Toward SuperBrain
Li Weigang
Pedro Brom
Lucas Ramson Siefert
169
0
0
30 Aug 2025
SHERPA: A Model-Driven Framework for Large Language Model Execution
Boqi Chen
Kua Chen
José Antonio Hernández López
Gunter Mussbacher
Dániel Varró
Amir Feizpour
LRM
122
1
0
29 Aug 2025
The Complexity Trap: Simple Observation Masking Is as Efficient as LLM Summarization for Agent Context Management
Tobias Lindenbauer
Igor Slinko
Ludwig Felder
Egor Bogomolov
Yaroslav Zharov
LLMAG
262
2
0
29 Aug 2025
Disabling Self-Correction in Retrieval-Augmented Generation via Stealthy Retriever Poisoning
Yanbo Dai
Zhenlan Ji
Zongjie Li
Kuan Li
Shuai Wang
SILM
AAML
KELM
157
1
0
27 Aug 2025
Adaptive Originality Filtering: Rejection Based Prompting and RiddleScore for Culturally Grounded Multilingual Riddle Generation
Duy Le
Kent Ziti
Evan Girard-Sun
Bakr Bouhaya
Sean O'Brien
Sean O Brien
Kevin Zhu
217
0
0
26 Aug 2025
Trustworthy Agents for Electronic Health Records through Confidence Estimation
Yongwoo Song
Minbyul Jeong
Mujeen Sung
HILM
102
0
0
26 Aug 2025
Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks
Taishi Nakamura
Satoki Ishikawa
Masaki Kawamura
Takumi Okamoto
Daisuke Nohara
Jun Suzuki
Rio Yokota
MoE
LRM
176
0
0
26 Aug 2025
Entropy-Guided Loop: Achieving Reasoning through Uncertainty-Aware Generation
Andrew G. A. Correa
Ana C. H de Matos
LRM
171
1
0
26 Aug 2025
Reflection-Enhanced Meta-Optimization Integrating TextGrad-style Prompt Optimization with Memory-Driven Self-Evolution
Chunlong Wu
Zhibo Qu
160
0
0
26 Aug 2025
UniC-RAG: Universal Knowledge Corruption Attacks to Retrieval-Augmented Generation
Runpeng Geng
Yanting Wang
Ying Chen
Jinyuan Jia
AAML
143
1
0
26 Aug 2025
RubikSQL: Lifelong Learning Agentic Knowledge Base as an Industrial NL2SQL System
Z. Chen
Han Li
Xinhao Zhang
Xiaoyu Chen
Chunyin Dong
...
Jinxu Li
S. Wang
Dousheng Zhao
Sanhai Gao
Guangyi Liu
143
0
0
25 Aug 2025
From Language to Action: A Review of Large Language Models as Autonomous Agents and Tool Users
Sadia Sultana Chowa
Riasad Alvi
Subhey Sadi Rahman
M. R
M. R
M. Islam
Mukhtar Hussain
Sami Azam
LLMAG
LM&Ro
ELM
335
9
0
24 Aug 2025
WebSight: A Vision-First Architecture for Robust Web Agents
Tanvir Bhathal
Asanshay Gupta
LRM
134
2
0
23 Aug 2025
Unveiling the Latent Directions of Reflection in Large Language Models
Fu-Chieh Chang
Yu-Ting Lee
Pei-Yuan Wu
LLMSV
LRM
256
0
0
23 Aug 2025
The next question after Turing's question: Introducing the Grow-AI test
Alexandru Tugui
ELM
128
0
0
22 Aug 2025
Memento: Fine-tuning LLM Agents without Fine-tuning LLMs
Huichi Zhou
Yihang Chen
Siyuan Guo
Xue Yan
Kin-Hei Lee
...
Ka Yiu Lee
Guchun Zhang
Youssef Attia El Hili
Linyi Yang
Jun Wang
LLMAG
443
13
0
22 Aug 2025
LLM Agents for Generating Microservice-based Applications: how complex is your specification?
Daniel M. Yellin
120
0
0
22 Aug 2025
Previous
1
2
3
...
5
6
7
...
24
25
26
Next
Page 6 of 26
Page
of 26
Go