ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.11366
  4. Cited By
Reflexion: Language Agents with Verbal Reinforcement Learning
v1v2v3v4 (latest)

Reflexion: Language Agents with Verbal Reinforcement Learning

Neural Information Processing Systems (NeurIPS), 2023
20 March 2023
Noah Shinn
Federico Cassano
Beck Labash
A. Gopinath
Karthik Narasimhan
Shunyu Yao
    LLMAGKELM
ArXiv (abs)PDFHTMLHuggingFace (5 upvotes)

Papers citing "Reflexion: Language Agents with Verbal Reinforcement Learning"

50 / 1,278 papers shown
An Operational Kardashev-Style Scale for Autonomous AI - Towards AGI and Superintelligence
An Operational Kardashev-Style Scale for Autonomous AI - Towards AGI and Superintelligence
Przemyslaw Chojecki
ELM
118
3
0
17 Nov 2025
WebCoach: Self-Evolving Web Agents with Cross-Session Memory Guidance
WebCoach: Self-Evolving Web Agents with Cross-Session Memory Guidance
Genglin Liu
Shijie Geng
Sha Li
Hejie Cui
Sarah Zhang
Xin Liu
Tianyi Liu
CLL
729
1
0
17 Nov 2025
Generative Caching for Structurally Similar Prompts and Responses
Generative Caching for Structurally Similar Prompts and Responses
Sarthak Chakraborty
Suman Nath
Xuchao Zhang
Chetan Bansal
Indranil Gupta
184
1
0
14 Nov 2025
Beyond Elicitation: Provision-based Prompt Optimization for Knowledge-Intensive Tasks
Beyond Elicitation: Provision-based Prompt Optimization for Knowledge-Intensive Tasks
Yunzhe Xu
Zhuosheng Zhang
Zhe Liu
183
0
0
13 Nov 2025
AgentPRM: Process Reward Models for LLM Agents via Step-Wise Promise and Progress
AgentPRM: Process Reward Models for LLM Agents via Step-Wise Promise and Progress
Zhiheng Xi
Chenyang Liao
Guanyu Li
Y. Yang
Wenxiang Chen
...
Wei Wu
Tao Ji
Tao Gui
Qi Zhang
Xuanjing Huang
LRM
166
6
0
11 Nov 2025
Analyzing Political Text at Scale with Online Tensor LDA
Analyzing Political Text at Scale with Online Tensor LDA
Sara Kangaslahti
Danny Ebanks
Jean Kossaifi
Anqi Liu
R. Alvarez
A. Anandkumar
113
2
0
11 Nov 2025
Last Layer Logits to Logic: Empowering LLMs with Logic-Consistent Structured Knowledge Reasoning
Last Layer Logits to Logic: Empowering LLMs with Logic-Consistent Structured Knowledge Reasoning
Songze Li
Zhiqiang Liu
Zhaoyan Gong
Xiaoke Guo
Zhengke Gui
H. Chen
Wen Zhang
LRM
247
1
0
11 Nov 2025
Meta-cognitive Multi-scale Hierarchical Reasoning for Motor Imagery Decoding
Meta-cognitive Multi-scale Hierarchical Reasoning for Motor Imagery Decoding
Si-Hyun Kim
Heon Kwak
Byoung-Hee Kwon
Seong-Whan Lee
179
0
0
11 Nov 2025
MathSE: Improving Multimodal Mathematical Reasoning via Self-Evolving Iterative Reflection and Reward-Guided Fine-Tuning
MathSE: Improving Multimodal Mathematical Reasoning via Self-Evolving Iterative Reflection and Reward-Guided Fine-Tuning
Jinhao Chen
Zhen Yang
Jianxin Shi
Tianyu Wo
J. Tang
ReLMLRM
255
1
0
10 Nov 2025
Recursive Dynamics in Fast-Weights Homeostatic Reentry Networks: Toward Reflective Intelligence
Recursive Dynamics in Fast-Weights Homeostatic Reentry Networks: Toward Reflective Intelligence
B. G. Chae
191
5
0
10 Nov 2025
Procedural Knowledge Improves Agentic LLM Workflows
Procedural Knowledge Improves Agentic LLM Workflows
Vincent Hsiao
Mark Roberts
Leslie Smith
AIFin
465
0
0
10 Nov 2025
FLEX: Continuous Agent Evolution via Forward Learning from Experience
FLEX: Continuous Agent Evolution via Forward Learning from Experience
Zhicheng Cai
Xinyuan Guo
Yu Pei
Jiangtao Feng
Jiangjie Chen
Ya Zhang
Wei-Ying Ma
Mingxuan Wang
Hao Zhou
Hao Zhou
CLLLLMAGLRM
320
15
0
09 Nov 2025
Evaluation of retrieval-based QA on QUEST-LOFT
Evaluation of retrieval-based QA on QUEST-LOFT
Nathan Scales
Nathanael Scharli
Olivier Bousquet
RALM
416
0
0
08 Nov 2025
Self-Abstraction from Grounded Experience for Plan-Guided Policy Refinement
Self-Abstraction from Grounded Experience for Plan-Guided Policy Refinement
Hiroaki Hayashi
Bo Pang
Wenting Zhao
Ye Liu
Akash Gokul
Srijan Bansal
Caiming Xiong
Semih Yavuz
Yingbo Zhou
LLMAGLM&RoLRM
338
1
0
08 Nov 2025
Large Language Models Develop Novel Social Biases Through Adaptive Exploration
Large Language Models Develop Novel Social Biases Through Adaptive Exploration
Addison J. Wu
Ryan Liu
Xuechunzi Bai
Thomas Griffiths
220
0
0
08 Nov 2025
KLASS: KL-Guided Fast Inference in Masked Diffusion Models
KLASS: KL-Guided Fast Inference in Masked Diffusion Models
S. Kim
S. Hong
Hojung Jung
Youngrok Park
Se-Young Yun
DiffMVLM
150
8
0
07 Nov 2025
Long Grounded Thoughts: Synthesizing Visual Problems and Reasoning Chains at Scale
Long Grounded Thoughts: Synthesizing Visual Problems and Reasoning Chains at ScaleAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
David Acuna
Chao-Han Huck Yang
Yuntian Deng
Jaehun Jung
Ximing Lu
Prithviraj Ammanabrolu
Hyunwoo J. Kim
Yuan-Hong Liao
Yejin Choi
ReLMOffRLLRM
375
2
0
07 Nov 2025
Collaborative Agents for Automated Program Repair in Ruby
Collaborative Agents for Automated Program Repair in Ruby
Nikta Akbarpour
Mahdieh Sadat Benis
Fatemeh H. Fard
Ali Ouni
Mohamed Aymen Saied
KELM
289
0
0
06 Nov 2025
RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring
RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring
Khouloud Oueslati
Maxime Lamothe
Foutse Khomh
LLMAG
170
1
0
05 Nov 2025
Secure Code Generation at Scale with Reflexion
Secure Code Generation at Scale with Reflexion
Arup Datta
Ahmed Aljohani
Hyunsook Do
ELM
145
0
0
05 Nov 2025
Leveraging LLM-based agents for social science research: insights from citation network simulations
Leveraging LLM-based agents for social science research: insights from citation network simulations
Jiarui Ji
Runlin Lei
X. Pan
Zhewei Wei
Hao Sun
...
X. Chen
Yongzheng Yang
Yaliang Li
Bolin Ding
Ji-Rong Wen
LLMAG
305
0
0
05 Nov 2025
Context-Guided Decompilation: A Step Towards Re-executability
Context-Guided Decompilation: A Step Towards Re-executability
Xiaohan Wang
Yuxin Hu
Kevin Leach
122
0
0
03 Nov 2025
Continual Learning, Not Training: Online Adaptation For Agents
Continual Learning, Not Training: Online Adaptation For Agents
Aman Jaglan
Jarrod Barnes
CLL
212
1
0
02 Nov 2025
A CPU-Centric Perspective on Agentic AI
A CPU-Centric Perspective on Agentic AI
Ritik Raj
Hong Wang
Tushar Krishna
412
0
0
01 Nov 2025
DRIP: Defending Prompt Injection via Token-wise Representation Editing and Residual Instruction Fusion
DRIP: Defending Prompt Injection via Token-wise Representation Editing and Residual Instruction Fusion
Ruofan Liu
Yun Lin
Zhiyong Huang
Jin Song Dong
AAMLSILM
425
0
0
01 Nov 2025
CATArena: Evaluating Evolutionary Capabilities of Code Agents via Iterative Tournaments
CATArena: Evaluating Evolutionary Capabilities of Code Agents via Iterative Tournaments
Lingyue Fu
Xin Ding
Yaoming Zhu
Shao Zhang
Lin Qiu
...
Xuezhi Cao
Xunliang Cai
Jiaxin Ding
Yong Yu
Yong Yu
LLMAGELM
256
0
0
30 Oct 2025
FELA: A Multi-Agent Evolutionary System for Feature Engineering of Industrial Event Log Data
FELA: A Multi-Agent Evolutionary System for Feature Engineering of Industrial Event Log Data
Kun ouyang
Haoyu Wang
Dong Fang
LLMAGAI4CE
210
0
0
29 Oct 2025
Iterative Critique-Refine Framework for Enhancing LLM Personalization
Iterative Critique-Refine Framework for Enhancing LLM Personalization
Durga Prasad Maram
Dhruvin Gandhi
Z. Yao
Gayathri Akkinapalli
Franck Dernoncourt
Yu Wang
Ryan Rossi
Nesreen K. Ahmed
169
0
0
28 Oct 2025
Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning
Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning
Zhiheng Xi
Jixuan Huang
Xin Guo
Boyang Hong
Dingwen Yang
...
Jiecao Chen
Rui Zheng
Tao Gui
Qi Zhang
Xuanjing Huang
OffRLLRM
195
1
0
28 Oct 2025
Verifying Large Language Models' Reasoning Paths via Correlation Matrix Rank
Verifying Large Language Models' Reasoning Paths via Correlation Matrix Rank
Jiayu Liu
Wei Dai
Zhenya Huang
Ning Miao
Enhong Chen
LRM
97
2
0
28 Oct 2025
From Observability Data to Diagnosis: An Evolving Multi-agent System for Incident Management in Cloud Systems
From Observability Data to Diagnosis: An Evolving Multi-agent System for Incident Management in Cloud Systems
Yu Luo
Jiamin Jiang
J. Feng
Lei Tao
Qingliang Zhang
Xidao Wen
Yongqian Sun
Shenglin Zhang
Jielong Huang
210
0
0
28 Oct 2025
Compositional Image Synthesis with Inference-Time Scaling
Compositional Image Synthesis with Inference-Time Scaling
Minsuk Ji
Sanghyeok Lee
Namhyuk Ahn
DiffMMLLMEGVMVLM
286
0
0
28 Oct 2025
AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis
AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis
Xuanzhong Chen
Zile Qiao
Guoxin Chen
L. Su
Zhen Zhang
Xinyu Wang
Pengjun Xie
Fei Huang
Jingren Zhou
Yong Jiang
LLMAGELM
183
6
0
28 Oct 2025
Evidence-Bound Autonomous Research (EviBound): A Governance Framework for Eliminating False Claims
Evidence-Bound Autonomous Research (EviBound): A Governance Framework for Eliminating False Claims
Ruiying Chen
110
0
0
28 Oct 2025
Language Server CLI Empowers Language Agents with Process Rewards
Language Server CLI Empowers Language Agents with Process Rewards
Yifan Zhang
Lanser Contributors
75
0
0
27 Oct 2025
COOPERA: Continual Open-Ended Human-Robot Assistance
COOPERA: Continual Open-Ended Human-Robot Assistance
Chenyang Ma
Kai Lu
Ruta Desai
Xavier Puig
Andrew Markham
Niki Trigoni
164
3
0
27 Oct 2025
Agentic Meta-Orchestrator for Multi-task Copilots
Agentic Meta-Orchestrator for Multi-task Copilots
Xiaofeng Zhu
Yunshen Zhou
LLMAG
301
0
0
26 Oct 2025
SwiftSolve: A Self-Iterative, Complexity-Aware Multi-Agent Framework for Competitive Programming
SwiftSolve: A Self-Iterative, Complexity-Aware Multi-Agent Framework for Competitive Programming
Adhyayan Veer Singh
Aaron Shen
Brian Law
Ahmed Ismail
Jonas Rohweder
Sean O'Brien
Kevin Zhu
LRM
96
0
0
26 Oct 2025
Embracing Trustworthy Brain-Agent Collaboration as Paradigm Extension for Intelligent Assistive Technologies
Embracing Trustworthy Brain-Agent Collaboration as Paradigm Extension for Intelligent Assistive Technologies
Yankai Chen
Xinni Zhang
Yifei Zhang
Yangning Li
Henry Peng Zou
Chunyu Miao
Weizhi Zhang
Xue Liu
Philip S. Yu
173
1
0
25 Oct 2025
RETuning: Upgrading Inference-Time Scaling for Stock Movement Prediction with Large Language Models
RETuning: Upgrading Inference-Time Scaling for Stock Movement Prediction with Large Language Models
Xueyuan Lin
Cehao Yang
Ye Ma
Ming Li
Rongjunchen Zhang
Yang Ni
Xiaojun Wu
Chengjin Xu
Jian Guo
Hui Xiong
AIFinLRM
194
0
0
24 Oct 2025
Co-Sight: Enhancing LLM-Based Agents via Conflict-Aware Meta-Verification and Trustworthy Reasoning with Structured Facts
Co-Sight: Enhancing LLM-Based Agents via Conflict-Aware Meta-Verification and Trustworthy Reasoning with Structured Facts
Hongwei Zhang
Ji Lu
Shiqing Jiang
Chenxiang Zhu
Li Xie
...
Baoyu Tang
Lingjun Huang
Baoli Wang
Fang Tan
Peng Zou
LRM
180
6
0
24 Oct 2025
Automated Detection of Visual Attribute Reliance with a Self-Reflective Agent
Automated Detection of Visual Attribute Reliance with a Self-Reflective Agent
Christy Li
Josep Lopez Camunas
Jake Thomas Touchet
Jacob Andreas
Àgata Lapedriza
Antonio Torralba
Tamar Rott Shaham
212
1
0
24 Oct 2025
LLM-AR: LLM-powered Automated Reasoning Framework
LLM-AR: LLM-powered Automated Reasoning Framework
Rick Chen
Joseph Ternasky
Aaron Ontoyin Yin
Xianling Mu
Fuat Alican
Yigit Ihlamur
LRM
85
0
0
24 Oct 2025
Boosting Accuracy and Efficiency of Budget Forcing in LLMs via Reinforcement Learning for Mathematical Reasoning
Boosting Accuracy and Efficiency of Budget Forcing in LLMs via Reinforcement Learning for Mathematical Reasoning
Ravindra Aribowo Tarunokusumo
Rafael Fernandes Cunha
OffRLReLMLRM
162
0
0
24 Oct 2025
Integrating Machine Learning into Belief-Desire-Intention Agents: Current Advances and Open Challenges
Integrating Machine Learning into Belief-Desire-Intention Agents: Current Advances and Open Challenges
Andrea Agiollo
Andrea Omicini
LM&RoAI4CE
181
0
0
23 Oct 2025
Automated Cloud Infrastructure-as-Code Reconciliation with AI Agents
Automated Cloud Infrastructure-as-Code Reconciliation with AI Agents
Zhenning Yang
Hui Guan
Victor Nicolet
Brandon Paulsen
Joey Dodds
Daniel Kroening
Ang Chen
132
0
0
23 Oct 2025
Code-enabled language models can outperform reasoning models on diverse tasks
Code-enabled language models can outperform reasoning models on diverse tasks
Cedegao E. Zhang
Cédric Colas
Gabriel Poesia
Joshua B. Tenenbaum
Jacob Andreas
ReLMALMLRMAI4CE
217
0
0
23 Oct 2025
Beyond Reactivity: Measuring Proactive Problem Solving in LLM Agents
Beyond Reactivity: Measuring Proactive Problem Solving in LLM Agents
Gil Pasternak
Dheeraj Rajagopal
Julia White
Dhruv Atreja
Matthew Thomas
George Hurn-Maloney
Ash Lewis
LLMAG
180
1
0
22 Oct 2025
NeSyPr: Neurosymbolic Proceduralization For Efficient Embodied Reasoning
NeSyPr: Neurosymbolic Proceduralization For Efficient Embodied Reasoning
Wonje Choi
Jooyoung Kim
Honguk Woo
LRM
150
0
0
22 Oct 2025
See, Think, Act: Online Shopper Behavior Simulation with VLM Agents
See, Think, Act: Online Shopper Behavior Simulation with VLM Agents
Yimeng Zhang
Jiri Gesi
Ran Xue
Tian Wang
Ziyi Wang
...
Qingjun Cui
Yufan Guo
Jing Huang
Mubarak Shah
Dakuo Wang
OffRL
171
1
0
22 Oct 2025
Previous
12345...242526
Next
Page 2 of 26
Pageof 26