ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.11366
  4. Cited By
Reflexion: Language Agents with Verbal Reinforcement Learning
v1v2v3v4 (latest)

Reflexion: Language Agents with Verbal Reinforcement Learning

Neural Information Processing Systems (NeurIPS), 2023
20 March 2023
Noah Shinn
Federico Cassano
Beck Labash
A. Gopinath
Karthik Narasimhan
Shunyu Yao
    LLMAGKELM
ArXiv (abs)PDFHTMLHuggingFace (5 upvotes)

Papers citing "Reflexion: Language Agents with Verbal Reinforcement Learning"

50 / 1,260 papers shown
Title
MathSE: Improving Multimodal Mathematical Reasoning via Self-Evolving Iterative Reflection and Reward-Guided Fine-Tuning
MathSE: Improving Multimodal Mathematical Reasoning via Self-Evolving Iterative Reflection and Reward-Guided Fine-Tuning
Jinhao Chen
Zhen Yang
Jianxin Shi
Tianyu Wo
J. Tang
ReLMLRM
244
0
0
10 Nov 2025
Recursive Dynamics in Fast-Weights Homeostatic Reentry Networks: Toward Reflective Intelligence
Recursive Dynamics in Fast-Weights Homeostatic Reentry Networks: Toward Reflective Intelligence
B. G. Chae
141
2
0
10 Nov 2025
FLEX: Continuous Agent Evolution via Forward Learning from Experience
FLEX: Continuous Agent Evolution via Forward Learning from Experience
Zhicheng Cai
Xinyuan Guo
Yu Pei
Jiangtao Feng
Jiangjie Chen
Ya Zhang
Wei-Ying Ma
Mingxuan Wang
Hao Zhou
Hao Zhou
CLLLLMAGLRM
258
3
0
09 Nov 2025
Self-Abstraction from Grounded Experience for Plan-Guided Policy Refinement
Self-Abstraction from Grounded Experience for Plan-Guided Policy Refinement
Hiroaki Hayashi
Bo Pang
Wenting Zhao
Ye Liu
Akash Gokul
Srijan Bansal
Caiming Xiong
Semih Yavuz
Yingbo Zhou
LLMAGLM&RoLRM
292
0
0
08 Nov 2025
Evaluation of retrieval-based QA on QUEST-LOFT
Evaluation of retrieval-based QA on QUEST-LOFT
Nathan Scales
Nathanael Scharli
Olivier Bousquet
RALM
344
0
0
08 Nov 2025
Long Grounded Thoughts: Distilling Compositional Visual Reasoning Chains at Scale
Long Grounded Thoughts: Distilling Compositional Visual Reasoning Chains at ScaleAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
David Acuna
Chao-Han Huck Yang
Yuntian Deng
Jaehun Jung
Ximing Lu
Prithviraj Ammanabrolu
Hyunwoo J. Kim
Yuan-Hong Liao
Yejin Choi
ReLMOffRLLRM
327
1
0
07 Nov 2025
KLASS: KL-Guided Fast Inference in Masked Diffusion Models
KLASS: KL-Guided Fast Inference in Masked Diffusion Models
S. Kim
S. Hong
Hojung Jung
Youngrok Park
Se-Young Yun
DiffMVLM
116
0
0
07 Nov 2025
Collaborative Agents for Automated Program Repair in Ruby
Collaborative Agents for Automated Program Repair in Ruby
Nikta Akbarpour
Mahdieh Sadat Benis
Fatemeh H. Fard
Ali Ouni
Mohamed Aymen Saied
KELM
273
0
0
06 Nov 2025
RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring
RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring
Khouloud Oueslati
Maxime Lamothe
Foutse Khomh
LLMAG
144
0
0
05 Nov 2025
Secure Code Generation at Scale with Reflexion
Secure Code Generation at Scale with Reflexion
Arup Datta
Ahmed Aljohani
Hyunsook Do
ELM
116
0
0
05 Nov 2025
Leveraging LLM-based agents for social science research: insights from citation network simulations
Leveraging LLM-based agents for social science research: insights from citation network simulations
Jiarui Ji
Runlin Lei
X. Pan
Zhewei Wei
Hao Sun
...
X. Chen
Yongzheng Yang
Yaliang Li
Bolin Ding
Ji-Rong Wen
LLMAG
266
0
0
05 Nov 2025
Context-Guided Decompilation: A Step Towards Re-executability
Context-Guided Decompilation: A Step Towards Re-executability
Xiaohan Wang
Yuxin Hu
Kevin Leach
91
0
0
03 Nov 2025
Continual Learning, Not Training: Online Adaptation For Agents
Continual Learning, Not Training: Online Adaptation For Agents
Aman Jaglan
Jarrod Barnes
CLL
168
0
0
02 Nov 2025
DRIP: Defending Prompt Injection via Token-wise Representation Editing and Residual Instruction Fusion
DRIP: Defending Prompt Injection via Token-wise Representation Editing and Residual Instruction Fusion
Ruofan Liu
Yun Lin
Zhiyong Huang
Jin Song Dong
AAMLSILM
342
0
0
01 Nov 2025
A CPU-Centric Perspective on Agentic AI
A CPU-Centric Perspective on Agentic AI
Ritik Raj
Hong Wang
Tushar Krishna
217
0
0
01 Nov 2025
CATArena: Evaluation of LLM Agents through Iterative Tournament Competitions
CATArena: Evaluation of LLM Agents through Iterative Tournament Competitions
Lingyue Fu
Xin Ding
Yaoming Zhu
Shao Zhang
Lin Qiu
...
W. Zhang
Xuezhi Cao
Xunliang Cai
Jiaxin Ding
Yong Yu
LLMAGELM
203
0
0
30 Oct 2025
FELA: A Multi-Agent Evolutionary System for Feature Engineering of Industrial Event Log Data
FELA: A Multi-Agent Evolutionary System for Feature Engineering of Industrial Event Log Data
Kun ouyang
Haoyu Wang
Dong Fang
LLMAGAI4CE
174
0
0
29 Oct 2025
Iterative Critique-Refine Framework for Enhancing LLM Personalization
Iterative Critique-Refine Framework for Enhancing LLM Personalization
Durga Prasad Maram
Dhruvin Gandhi
Z. Yao
Gayathri Akkinapalli
Franck Dernoncourt
Yu Wang
Ryan Rossi
Nesreen K. Ahmed
124
0
0
28 Oct 2025
Evidence-Bound Autonomous Research (EviBound): A Governance Framework for Eliminating False Claims
Evidence-Bound Autonomous Research (EviBound): A Governance Framework for Eliminating False Claims
Ruiying Chen
72
0
0
28 Oct 2025
Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning
Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning
Zhiheng Xi
Jixuan Huang
Xin Guo
Boyang Hong
Dingwen Yang
...
Jiecao Chen
Rui Zheng
Tao Gui
Qi Zhang
Xuanjing Huang
OffRLLRM
166
0
0
28 Oct 2025
Compositional Image Synthesis with Inference-Time Scaling
Compositional Image Synthesis with Inference-Time Scaling
Minsuk Ji
Sanghyeok Lee
Namhyuk Ahn
DiffMMLLMEGVMVLM
246
0
0
28 Oct 2025
AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis
AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis
Xuanzhong Chen
Zile Qiao
Guoxin Chen
L. Su
Zhen Zhang
Xinyu Wang
Pengjun Xie
Fei Huang
Jingren Zhou
Yong Jiang
LLMAGELM
141
2
0
28 Oct 2025
From Observability Data to Diagnosis: An Evolving Multi-agent System for Incident Management in Cloud Systems
From Observability Data to Diagnosis: An Evolving Multi-agent System for Incident Management in Cloud Systems
Yu Luo
Jiamin Jiang
J. Feng
Lei Tao
Qingliang Zhang
Xidao Wen
Yongqian Sun
Shenglin Zhang
Jielong Huang
157
0
0
28 Oct 2025
Language Server CLI Empowers Language Agents with Process Rewards
Language Server CLI Empowers Language Agents with Process Rewards
Yifan Zhang
Lanser Contributors
44
0
0
27 Oct 2025
COOPERA: Continual Open-Ended Human-Robot Assistance
COOPERA: Continual Open-Ended Human-Robot Assistance
Chenyang Ma
Kai Lu
Ruta Desai
Xavier Puig
Andrew Markham
Niki Trigoni
128
1
0
27 Oct 2025
SwiftSolve: A Self-Iterative, Complexity-Aware Multi-Agent Framework for Competitive Programming
SwiftSolve: A Self-Iterative, Complexity-Aware Multi-Agent Framework for Competitive Programming
Adhyayan Veer Singh
Aaron Shen
Brian Law
Ahmed Ismail
Jonas Rohweder
Sean O'Brien
Kevin Zhu
LRM
81
0
0
26 Oct 2025
Agentic Meta-Orchestrator for Multi-task Copilots
Agentic Meta-Orchestrator for Multi-task Copilots
Xiaofeng Zhu
Yunshen Zhou
LLMAG
257
0
0
26 Oct 2025
Embracing Trustworthy Brain-Agent Collaboration as Paradigm Extension for Intelligent Assistive Technologies
Embracing Trustworthy Brain-Agent Collaboration as Paradigm Extension for Intelligent Assistive Technologies
Yankai Chen
Xinni Zhang
Yifei Zhang
Yangning Li
Henry Peng Zou
Chunyu Miao
Weizhi Zhang
Xue Liu
Philip S. Yu
129
1
0
25 Oct 2025
Co-Sight: Enhancing LLM-Based Agents via Conflict-Aware Meta-Verification and Trustworthy Reasoning with Structured Facts
Co-Sight: Enhancing LLM-Based Agents via Conflict-Aware Meta-Verification and Trustworthy Reasoning with Structured Facts
Hongwei Zhang
Ji Lu
Shiqing Jiang
Chenxiang Zhu
Li Xie
...
Baoyu Tang
Lingjun Huang
Baoli Wang
Fang Tan
Peng Zou
LRM
170
1
0
24 Oct 2025
Automated Detection of Visual Attribute Reliance with a Self-Reflective Agent
Automated Detection of Visual Attribute Reliance with a Self-Reflective Agent
Christy Li
Josep Lopez Camunas
Jake Thomas Touchet
Jacob Andreas
Àgata Lapedriza
Antonio Torralba
Tamar Rott Shaham
183
0
0
24 Oct 2025
LLM-AR: LLM-powered Automated Reasoning Framework
LLM-AR: LLM-powered Automated Reasoning Framework
Rick Chen
Joseph Ternasky
Aaron Ontoyin Yin
Xianling Mu
Fuat Alican
Yigit Ihlamur
LRM
52
0
0
24 Oct 2025
RETuning: Upgrading Inference-Time Scaling for Stock Movement Prediction with Large Language Models
RETuning: Upgrading Inference-Time Scaling for Stock Movement Prediction with Large Language Models
Xueyuan Lin
Cehao Yang
Ye Ma
Ming Li
Rongjunchen Zhang
Yang Ni
Xiaojun Wu
Chengjin Xu
Jian Guo
Hui Xiong
AIFinLRM
170
0
0
24 Oct 2025
Boosting Accuracy and Efficiency of Budget Forcing in LLMs via Reinforcement Learning for Mathematical Reasoning
Boosting Accuracy and Efficiency of Budget Forcing in LLMs via Reinforcement Learning for Mathematical Reasoning
Ravindra Aribowo Tarunokusumo
Rafael Fernandes Cunha
OffRLReLMLRM
128
0
0
24 Oct 2025
Integrating Machine Learning into Belief-Desire-Intention Agents: Current Advances and Open Challenges
Integrating Machine Learning into Belief-Desire-Intention Agents: Current Advances and Open Challenges
Andrea Agiollo
Andrea Omicini
LM&RoAI4CE
148
0
0
23 Oct 2025
Automated Cloud Infrastructure-as-Code Reconciliation with AI Agents
Automated Cloud Infrastructure-as-Code Reconciliation with AI Agents
Zhenning Yang
Hui Guan
Victor Nicolet
Brandon Paulsen
Joey Dodds
Daniel Kroening
Ang Chen
96
0
0
23 Oct 2025
Code-enabled language models can outperform reasoning models on diverse tasks
Code-enabled language models can outperform reasoning models on diverse tasks
Cedegao E. Zhang
Cédric Colas
Gabriel Poesia
Joshua B. Tenenbaum
Jacob Andreas
ReLMALMLRMAI4CE
164
0
0
23 Oct 2025
NeSyPr: Neurosymbolic Proceduralization For Efficient Embodied Reasoning
NeSyPr: Neurosymbolic Proceduralization For Efficient Embodied Reasoning
Wonje Choi
Jooyoung Kim
Honguk Woo
LRM
124
0
0
22 Oct 2025
See, Think, Act: Online Shopper Behavior Simulation with VLM Agents
See, Think, Act: Online Shopper Behavior Simulation with VLM Agents
Yimeng Zhang
Jiri Gesi
Ran Xue
Tian Wang
Ziyi Wang
...
Qingjun Cui
Yufan Guo
Jing Huang
Mubarak Shah
Dakuo Wang
OffRL
156
0
0
22 Oct 2025
Beyond Reactivity: Measuring Proactive Problem Solving in LLM Agents
Beyond Reactivity: Measuring Proactive Problem Solving in LLM Agents
Gil Pasternak
Dheeraj Rajagopal
Julia White
Dhruv Atreja
Matthew Thomas
George Hurn-Maloney
Ash Lewis
LLMAG
171
0
0
22 Oct 2025
Teaming LLMs to Detect and Mitigate Hallucinations
Teaming LLMs to Detect and Mitigate Hallucinations
Demian Till
John Smeaton
Peter Haubrick
Gouse Saheb
Florian Graef
David Berman
HILM
314
0
0
22 Oct 2025
StarBench: A Turn-Based RPG Benchmark for Agentic Multimodal Decision-Making and Information Seeking
StarBench: A Turn-Based RPG Benchmark for Agentic Multimodal Decision-Making and Information Seeking
Haoran Zhang
C. Zhu
Sicong Guo
Hanzhe Guo
Haiming Li
Donglin Yu
122
0
0
21 Oct 2025
PlanU: Large Language Model Reasoning through Planning under Uncertainty
PlanU: Large Language Model Reasoning through Planning under Uncertainty
Ziwei Deng
Mian Deng
Chenjing Liang
Zeming Gao
Chennan Ma
Chenxing Lin
Haipeng Zhang
Songzhu Mei
Cheng-Yu Wang
Siqi Shen
141
0
0
21 Oct 2025
Illusions of reflection: open-ended task reveals systematic failures in Large Language Models' reflective reasoning
Illusions of reflection: open-ended task reveals systematic failures in Large Language Models' reflective reasoning
Sion Weatherhead
Flora D. Salim
Aaron Belbasis
ReLMLRMELM
186
0
0
21 Oct 2025
AlphaOPT: Formulating Optimization Programs with Self-Improving LLM Experience Library
AlphaOPT: Formulating Optimization Programs with Self-Improving LLM Experience Library
Minwei Kong
Ao Qu
Xiaotong Guo
Wenbin Ouyang
Chonghe Jiang
...
Hai Wang
Cathy Wu
Jinhua Zhao
Cathy Wu
Jinhua Zhao
96
0
0
21 Oct 2025
Certified Self-Consistency: Statistical Guarantees and Test-Time Training for Reliable Reasoning in LLMs
Certified Self-Consistency: Statistical Guarantees and Test-Time Training for Reliable Reasoning in LLMs
Paula Cordero-Encinar
Andrew Duncan
LRM
193
1
0
20 Oct 2025
OPTAGENT: Optimizing Multi-Agent LLM Interactions Through Verbal Reinforcement Learning for Enhanced Reasoning
OPTAGENT: Optimizing Multi-Agent LLM Interactions Through Verbal Reinforcement Learning for Enhanced Reasoning
Zhenyu Bi
Meng Lu
Yang Li
Swastik Roy
Weijie Guan
Morteza Ziyadi
Xuan Wang
LLMAGLRM
130
1
0
20 Oct 2025
Enterprise Deep Research: Steerable Multi-Agent Deep Research for Enterprise Analytics
Enterprise Deep Research: Steerable Multi-Agent Deep Research for Enterprise Analytics
Akshara Prabhakar
Roshan Ram
Zixiang Chen
Silvio Savarese
Frank Wang
Caiming Xiong
Huan Wang
Weiran Yao
154
0
0
20 Oct 2025
Reasoning Distillation and Structural Alignment for Improved Code Generation
Reasoning Distillation and Structural Alignment for Improved Code Generation
Amir Jalilifard
Anderson de Rezende Rocha
Marcos Medeiros Raimundo
OffRLLRM
108
0
0
20 Oct 2025
DynaQuery: A Self-Adapting Framework for Querying Structured and Multimodal Data
DynaQuery: A Self-Adapting Framework for Querying Structured and Multimodal Data
Aymane Hassini
80
0
0
20 Oct 2025
Empowering Real-World: A Survey on the Technology, Practice, and Evaluation of LLM-driven Industry Agents
Empowering Real-World: A Survey on the Technology, Practice, and Evaluation of LLM-driven Industry Agents
Yihong Tang
Kehai Chen
Liang Yue
Jinxin Fan
Caishen Zhou
...
Kaiyang Guo
Xingshan Zeng
Wenjing Cun
L. Shang
Min Zhang
LLMAG
142
0
0
20 Oct 2025
Previous
12345...242526
Next