Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2303.11366
Cited By
v1
v2
v3
v4 (latest)
Reflexion: Language Agents with Verbal Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2023
20 March 2023
Noah Shinn
Federico Cassano
Beck Labash
A. Gopinath
Karthik Narasimhan
Shunyu Yao
LLMAG
KELM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (5 upvotes)
Papers citing
"Reflexion: Language Agents with Verbal Reinforcement Learning"
50 / 1,269 papers shown
Certified Self-Consistency: Statistical Guarantees and Test-Time Training for Reliable Reasoning in LLMs
Paula Cordero-Encinar
Andrew Duncan
LRM
196
1
0
20 Oct 2025
Empowering Real-World: A Survey on the Technology, Practice, and Evaluation of LLM-driven Industry Agents
Yihong Tang
Kehai Chen
Liang Yue
Jinxin Fan
Caishen Zhou
...
Kaiyang Guo
Xingshan Zeng
Wenjing Cun
L. Shang
Min Zhang
LLMAG
158
0
0
20 Oct 2025
OPTAGENT: Optimizing Multi-Agent LLM Interactions Through Verbal Reinforcement Learning for Enhanced Reasoning
Zhenyu Bi
Meng Lu
Yang Li
Swastik Roy
Weijie Guan
Morteza Ziyadi
Xuan Wang
LLMAG
LRM
134
2
0
20 Oct 2025
DynaQuery: A Self-Adapting Framework for Querying Structured and Multimodal Data
Aymane Hassini
86
0
0
20 Oct 2025
Reasoning Distillation and Structural Alignment for Improved Code Generation
Amir Jalilifard
Anderson de Rezende Rocha
Marcos Medeiros Raimundo
OffRL
LRM
131
0
0
20 Oct 2025
Enterprise Deep Research: Steerable Multi-Agent Deep Research for Enterprise Analytics
Akshara Prabhakar
Roshan Ram
Zixiang Chen
Silvio Savarese
Frank Wang
Caiming Xiong
Huan Wang
Weiran Yao
168
0
0
20 Oct 2025
STARK: Strategic Team of Agents for Refining Kernels
Juncheng Dong
Yang Yang
Tao Liu
Y. Wang
Feng Qi
Vahid Tarokh
Kaushik Rangadurai
Shuang Yang
LLMAG
94
1
0
19 Oct 2025
SolverLLM: Leveraging Test-Time Scaling for Optimization Problem via LLM-Guided Search
Dong Li
Xujiang Zhao
Linlin Yu
Yanchi Liu
Wei Cheng
Zhengzhang Chen
Zhong Chen
Feng Chen
Chen Zhao
H. Chen
LRM
177
1
0
19 Oct 2025
What Limits Agentic Systems Efficiency?
S. Bian
Minghao Yan
Anand Jayarajan
Gennady Pekhimenko
Shivaram Venkataraman
LLMAG
LRM
141
1
0
18 Oct 2025
SSL4RL: Revisiting Self-supervised Learning as Intrinsic Reward for Visual-Language Reasoning
Xiaojun Guo
Runyu Zhou
Yifei Wang
Qi Zhang
Chenheng Zhang
...
Xiaohan Wang
Jiajun Chai
Guojun Yin
Wei Lin
Y. Wang
LRM
VLM
159
2
0
18 Oct 2025
Prompt Optimization via Retrieved Reasoning Assets and Multi-Agent Analysis
Wonduk Seo
Juhyeon Lee
Junseo Koh
Hyunjin An
Jian Park
Seunghyun Lee
Haihua Chen
Yi Bu
LLMAG
LRM
138
0
0
18 Oct 2025
LANPO: Bootstrapping Language and Numerical Feedback for Reinforcement Learning in LLMs
Ang Li
Yifei Wang
Zhihang Yuan
Stefanie Jegelka
Y. X. R. Wang
ALM
KELM
176
0
0
18 Oct 2025
EvolveR: Self-Evolving LLM Agents through an Experience-Driven Lifecycle
Rong Wu
Xiaoman Wang
Jianbiao Mei
Pinlong Cai
Daocheng Fu
...
Licheng Wen
Xuemeng Yang
Yufan Shen
Yuxin Wang
Botian Shi
112
3
0
17 Oct 2025
CarBoN: Calibrated Best-of-N Sampling Improves Test-time Reasoning
Yung-Chen Tang
Pin-Yu Chen
Andrea Cavallaro
LRM
104
0
0
17 Oct 2025
Experience-Driven Exploration for Efficient API-Free AI Agents
Chenwei Tang
Jingyu Xing
Xinyu Liu
Zizhou Wang
Jiawei Du
Liangli Zhen
Jiancheng Lv
203
0
0
17 Oct 2025
Multi-dimensional Data Analysis and Applications Basing on LLM Agents and Knowledge Graph Interactions
Xi Wang
Xianyao Ling
Kun Li
Gang Yin
Liang Zhang
...
Jun Xu
Fu Zhang
Wenbo Lei
Annie Wang
Peng Gong
134
0
0
17 Oct 2025
The Gatekeeper Knows Enough
Fikresilase Wondmeneh Abebayew
LLMAG
99
0
0
16 Oct 2025
LLM Agents Beyond Utility: An Open-Ended Perspective
Asen Nachkov
Xi Wang
Luc Van Gool
LLMAG
AIFin
ELM
LRM
209
0
0
16 Oct 2025
Stop-RAG: Value-Based Retrieval Control for Iterative RAG
Jaewan Park
Solbee Cho
Jay-Yoon Lee
114
1
0
16 Oct 2025
Mapping Smarter, Not Harder: A Test-Time Reinforcement Learning Agent That Improves Without Labels or Model Updates
Wen-Kwang Tsao
Yao-Ching Yu
Chien-Ming Huang
76
0
0
16 Oct 2025
Stable but Miscalibrated: A Kantian View on Overconfidence from Filters to Large Language Models
Akira Okutomi
LRM
200
0
0
16 Oct 2025
Training LLM Agents to Empower Humans
Evan Ellis
Vivek Myers
Jens Tuyls
Sergey Levine
Anca Dragan
Benjamin Eysenbach
184
0
0
15 Oct 2025
EvoTest: Evolutionary Test-Time Learning for Self-Improving Agentic Systems
Yufei He
Juncheng Liu
Yue Liu
Yibo Li
Tri Cao
Zhiyuan Hu
X. Xu
Bryan Hooi
TTA
VLM
301
1
0
15 Oct 2025
Static Sandboxes Are Inadequate: Modeling Societal Complexity Requires Open-Ended Co-Evolution in LLM-Based Multi-Agent Simulations
Jinkun Chen
Sher Badshah
Xuemin Yu
Sijia Han
262
0
0
15 Oct 2025
Retrieval-in-the-Chain: Bootstrapping Large Language Models for Generative Retrieval
Yingchen Zhang
Ruqing Zhang
Jiafeng Guo
W. Peng
Sen Li
Fuyu Lv
LRM
187
0
0
15 Oct 2025
LLMs Can Get "Brain Rot"!
Shuo Xing
Junyuan Hong
Yifan Wang
Runjin Chen
Zhenyu Zhang
A. Grama
Zhengzhong Tu
Z. Wang
155
0
0
15 Oct 2025
Toward Reasoning-Centric Time-Series Analysis
Xinlei Wang
Mingtian Tan
Jing Qiu
Junhua Zhao
Jinjin Gu
AI4TS
143
0
0
14 Oct 2025
Deep Research Brings Deeper Harm
Shuo Chen
Zonggen Li
Zhen Han
Bailan He
Tong Liu
Haokun Chen
Georg Groh
Philip Torr
Volker Tresp
Jindong Gu
172
0
0
13 Oct 2025
ReLook: Vision-Grounded RL with a Multimodal LLM Critic for Agentic Web Coding
Yuhang Li
Chenchen Zhang
Ruilin Lv
Ao Liu
K. Deng
Yuanxing Zhang
Jiaheng Liu
Wiggin Zhou
B. Zhou
LRM
106
3
0
13 Oct 2025
PaperArena: An Evaluation Benchmark for Tool-Augmented Agentic Reasoning on Scientific Literature
Daoyu Wang
Mingyue Cheng
Qi Liu
Shuo Yu
Zirui Liu
Ze Guo
LRM
264
3
0
13 Oct 2025
RAG-Pull: Imperceptible Attacks on RAG Systems for Code Generation
Vasilije Stambolic
Aritra Dhar
Lukas Cavigelli
AAML
SILM
219
0
0
13 Oct 2025
A Survey on Agentic Multimodal Large Language Models
Huanjin Yao
Ruifei Zhang
Jiaxing Huang
Jingyi Zhang
Yibo Wang
...
Ruolin Zhu
Yongcheng Jing
Shunyu Liu
Guanbin Li
Dacheng Tao
LM&Ro
AIFin
AI4TS
LRM
AI4CE
250
5
0
13 Oct 2025
D3MAS: Decompose, Deduce, and Distribute for Enhanced Knowledge Sharing in Multi-Agent Systems
Heng Zhang
Yuling Shi
Xiaodong Gu
Haochen You
Zijian Zhang
Lubin Gan
Yilei Yuan
Jin Huang
124
0
0
12 Oct 2025
Sample-Efficient Online Learning in LM Agents via Hindsight Trajectory Rewriting
Michael Y. Hu
Benjamin Van Durme
Jacob Andreas
Harsh Jhamtani
LLMAG
106
0
0
11 Oct 2025
Failure-Driven Workflow Refinement
Jusheng Zhang
Kaitong Cai
Qinglin Zeng
Ningyuan Liu
Stephen Fan
Ziliang Chen
Keze Wang
115
12
0
11 Oct 2025
Audit-of-Understanding: Posterior-Constrained Inference for Mathematical Reasoning in Language Models
Samir Abdaljalil
Erchin Serpedin
K. Qaraqe
Hasan Kurban
RALM
LRM
187
0
0
11 Oct 2025
Agentic Systems in Radiology: Design, Applications, Evaluation, and Challenges
Christian Bluethgen
Dave Van Veen
Daniel Truhn
Jakob Nikolas Kather
Michael Moor
...
Akshay S. Chaudhari
Thomas Frauenfelder
C. Langlotz
Michael Krauthammer
Farhad Nooralahzadeh
LM&MA
AI4CE
285
0
0
10 Oct 2025
GRETEL: A Goal-driven Retrieval and Execution-based Trial Framework for LLM Tool Selection Enhancing
Zongze Wu
Yani Guo
Churong Liang
Runnan Li
76
0
0
10 Oct 2025
Fundamentals of Building Autonomous LLM Agents
Victor de Lamo Castrillo
Habtom Kahsay Gidey
Alexander Lenz
Alois Knoll
LLMAG
LM&Ro
204
2
0
10 Oct 2025
Dyna-Mind: Learning to Simulate from Experience for Better AI Agents
Xiao Yu
Baolin Peng
Michel Galley
Hao Cheng
Qianhui Wu
Janardhan Kulkarni
Suman Nath
Zhou Yu
Jianfeng Gao
LRM
AI4CE
116
0
0
10 Oct 2025
How can we assess human-agent interactions? Case studies in software agent design
Valerie Chen
Rohit Malhotra
Xingyao Wang
Juan Michelini
Xuhui Zhou
Aditya Bharat Soni
Hoang H. Tran
Calvin Smith
Ameet Talwalkar
Graham Neubig
178
0
0
10 Oct 2025
Gold Panning: Turning Positional Bias into Signal for Multi-Document LLM Reasoning
Adam Byerly
Daniel Khashabi
116
0
0
10 Oct 2025
Towards Reliable LLM-based Robot Planning via Combined Uncertainty Estimation
Shiyuan Yin
Chenjia Bai
Z. Zhang
Junwei Jin
Xinxin Zhang
Chi Zhang
Xuelong Li
118
0
0
09 Oct 2025
Agent Learning via Early Experience
Kai Zhang
Xiangchao Chen
Bo Liu
Tianci Xue
Zeyi Liao
...
J. Zhu
Huan Sun
Jason Weston
Eric Fosler-Lussier
Y. Wu
OffRL
198
8
0
09 Oct 2025
Dream to Recall: Imagination-Guided Experience Retrieval for Memory-Persistent Vision-and-Language Navigation
Yunzhe Xu
Yiyuan Pan
Zhe Liu
LM&Ro
88
0
0
09 Oct 2025
FlowSearch: Advancing deep research with dynamic structured knowledge flow
Yusong Hu
Runmin Ma
Yue Fan
Jinxin Shi
Zongsheng Cao
...
Lei Bai
Bo Zhang
Wenlong Zhang
Lei Bai
Bo Zhang
AI4CE
150
1
0
09 Oct 2025
MOSAIC: Multi-agent Orchestration for Task-Intelligent Scientific Coding
Siddeshwar Raghavan
Tanwi Mallick
AI4CE
136
0
0
09 Oct 2025
Training-Free Group Relative Policy Optimization
Yuzheng Cai
Siqi Cai
Yuchen Shi
Zihan Xu
Lichao Chen
...
Zongyi Li
Haojia Lin
Yong Mao
Ke Li
Xing Sun
OffRL
230
6
0
09 Oct 2025
Active Confusion Expression in Large Language Models: Leveraging World Models toward Better Social Reasoning
Jialu Du
Guiyang Hou
Yihui Fu
Chen Wu
Wenqi Zhang
Yongliang Shen
Weiming Lu
LLMAG
LRM
175
0
0
09 Oct 2025
RA-Gen: A Controllable Code Generation Framework Using ReAct for Multi-Agent Task Execution
Aofan Liu
Haoxuan Li
Bin Wang
Ao Yang
Hui Li
LLMAG
95
1
0
09 Oct 2025
Previous
1
2
3
4
5
6
...
24
25
26
Next