ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.11366
  4. Cited By
Reflexion: Language Agents with Verbal Reinforcement Learning
v1v2v3v4 (latest)

Reflexion: Language Agents with Verbal Reinforcement Learning

Neural Information Processing Systems (NeurIPS), 2023
20 March 2023
Noah Shinn
Federico Cassano
Beck Labash
A. Gopinath
Karthik Narasimhan
Shunyu Yao
    LLMAGKELM
ArXiv (abs)PDFHTMLHuggingFace (5 upvotes)

Papers citing "Reflexion: Language Agents with Verbal Reinforcement Learning"

50 / 1,271 papers shown
Last Layer Logits to Logic: Empowering LLMs with Logic-Consistent Structured Knowledge Reasoning
Last Layer Logits to Logic: Empowering LLMs with Logic-Consistent Structured Knowledge Reasoning
Songze Li
Zhiqiang Liu
Zhaoyan Gong
Xiaoke Guo
Zhengke Gui
H. Chen
Wen Zhang
LRM
241
1
0
11 Nov 2025
AgentPRM: Process Reward Models for LLM Agents via Step-Wise Promise and Progress
AgentPRM: Process Reward Models for LLM Agents via Step-Wise Promise and Progress
Zhiheng Xi
Chenyang Liao
Guanyu Li
Y. Yang
Wenxiang Chen
...
Wei Wu
Tao Ji
Tao Gui
Qi Zhang
Xuanjing Huang
LRM
140
1
0
11 Nov 2025
Analyzing Political Text at Scale with Online Tensor LDA
Analyzing Political Text at Scale with Online Tensor LDA
Sara Kangaslahti
Danny Ebanks
Jean Kossaifi
Anqi Liu
R. Alvarez
A. Anandkumar
107
0
0
11 Nov 2025
Meta-cognitive Multi-scale Hierarchical Reasoning for Motor Imagery Decoding
Meta-cognitive Multi-scale Hierarchical Reasoning for Motor Imagery Decoding
Si-Hyun Kim
Heon Kwak
Byoung-Hee Kwon
Seong-Whan Lee
173
0
0
11 Nov 2025
Procedural Knowledge Improves Agentic LLM Workflows
Procedural Knowledge Improves Agentic LLM Workflows
Vincent Hsiao
Mark Roberts
Leslie Smith
AIFin
445
0
0
10 Nov 2025
MathSE: Improving Multimodal Mathematical Reasoning via Self-Evolving Iterative Reflection and Reward-Guided Fine-Tuning
MathSE: Improving Multimodal Mathematical Reasoning via Self-Evolving Iterative Reflection and Reward-Guided Fine-Tuning
Jinhao Chen
Zhen Yang
Jianxin Shi
Tianyu Wo
J. Tang
ReLMLRM
245
1
0
10 Nov 2025
Recursive Dynamics in Fast-Weights Homeostatic Reentry Networks: Toward Reflective Intelligence
Recursive Dynamics in Fast-Weights Homeostatic Reentry Networks: Toward Reflective Intelligence
B. G. Chae
171
4
0
10 Nov 2025
FLEX: Continuous Agent Evolution via Forward Learning from Experience
FLEX: Continuous Agent Evolution via Forward Learning from Experience
Zhicheng Cai
Xinyuan Guo
Yu Pei
Jiangtao Feng
Jiangjie Chen
Ya Zhang
Wei-Ying Ma
Mingxuan Wang
Hao Zhou
Hao Zhou
CLLLLMAGLRM
301
7
0
09 Nov 2025
Large Language Models Develop Novel Social Biases Through Adaptive Exploration
Large Language Models Develop Novel Social Biases Through Adaptive Exploration
Addison J. Wu
Ryan Liu
Xuechunzi Bai
Thomas Griffiths
196
0
0
08 Nov 2025
Self-Abstraction from Grounded Experience for Plan-Guided Policy Refinement
Self-Abstraction from Grounded Experience for Plan-Guided Policy Refinement
Hiroaki Hayashi
Bo Pang
Wenting Zhao
Ye Liu
Akash Gokul
Srijan Bansal
Caiming Xiong
Semih Yavuz
Yingbo Zhou
LLMAGLM&RoLRM
327
0
0
08 Nov 2025
Evaluation of retrieval-based QA on QUEST-LOFT
Evaluation of retrieval-based QA on QUEST-LOFT
Nathan Scales
Nathanael Scharli
Olivier Bousquet
RALM
391
0
0
08 Nov 2025
Long Grounded Thoughts: Distilling Compositional Visual Reasoning Chains at Scale
Long Grounded Thoughts: Distilling Compositional Visual Reasoning Chains at ScaleAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
David Acuna
Chao-Han Huck Yang
Yuntian Deng
Jaehun Jung
Ximing Lu
Prithviraj Ammanabrolu
Hyunwoo J. Kim
Yuan-Hong Liao
Yejin Choi
ReLMOffRLLRM
344
1
0
07 Nov 2025
KLASS: KL-Guided Fast Inference in Masked Diffusion Models
KLASS: KL-Guided Fast Inference in Masked Diffusion Models
S. Kim
S. Hong
Hojung Jung
Youngrok Park
Se-Young Yun
DiffMVLM
139
0
0
07 Nov 2025
Collaborative Agents for Automated Program Repair in Ruby
Collaborative Agents for Automated Program Repair in Ruby
Nikta Akbarpour
Mahdieh Sadat Benis
Fatemeh H. Fard
Ali Ouni
Mohamed Aymen Saied
KELM
286
0
0
06 Nov 2025
RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring
RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring
Khouloud Oueslati
Maxime Lamothe
Foutse Khomh
LLMAG
155
0
0
05 Nov 2025
Leveraging LLM-based agents for social science research: insights from citation network simulations
Leveraging LLM-based agents for social science research: insights from citation network simulations
Jiarui Ji
Runlin Lei
X. Pan
Zhewei Wei
Hao Sun
...
X. Chen
Yongzheng Yang
Yaliang Li
Bolin Ding
Ji-Rong Wen
LLMAG
298
0
0
05 Nov 2025
Secure Code Generation at Scale with Reflexion
Secure Code Generation at Scale with Reflexion
Arup Datta
Ahmed Aljohani
Hyunsook Do
ELM
132
0
0
05 Nov 2025
Context-Guided Decompilation: A Step Towards Re-executability
Context-Guided Decompilation: A Step Towards Re-executability
Xiaohan Wang
Yuxin Hu
Kevin Leach
119
0
0
03 Nov 2025
Continual Learning, Not Training: Online Adaptation For Agents
Continual Learning, Not Training: Online Adaptation For Agents
Aman Jaglan
Jarrod Barnes
CLL
194
0
0
02 Nov 2025
A CPU-Centric Perspective on Agentic AI
A CPU-Centric Perspective on Agentic AI
Ritik Raj
Hong Wang
Tushar Krishna
326
0
0
01 Nov 2025
DRIP: Defending Prompt Injection via Token-wise Representation Editing and Residual Instruction Fusion
DRIP: Defending Prompt Injection via Token-wise Representation Editing and Residual Instruction Fusion
Ruofan Liu
Yun Lin
Zhiyong Huang
Jin Song Dong
AAMLSILM
396
0
0
01 Nov 2025
CATArena: Evaluating Evolutionary Capabilities of Code Agents via Iterative Tournaments
CATArena: Evaluating Evolutionary Capabilities of Code Agents via Iterative Tournaments
Lingyue Fu
Xin Ding
Yaoming Zhu
Shao Zhang
Lin Qiu
...
Xuezhi Cao
Xunliang Cai
Jiaxin Ding
Yong Yu
Yong Yu
LLMAGELM
228
0
0
30 Oct 2025
FELA: A Multi-Agent Evolutionary System for Feature Engineering of Industrial Event Log Data
FELA: A Multi-Agent Evolutionary System for Feature Engineering of Industrial Event Log Data
Kun ouyang
Haoyu Wang
Dong Fang
LLMAGAI4CE
200
0
0
29 Oct 2025
Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning
Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning
Zhiheng Xi
Jixuan Huang
Xin Guo
Boyang Hong
Dingwen Yang
...
Jiecao Chen
Rui Zheng
Tao Gui
Qi Zhang
Xuanjing Huang
OffRLLRM
172
0
0
28 Oct 2025
AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis
AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis
Xuanzhong Chen
Zile Qiao
Guoxin Chen
L. Su
Zhen Zhang
Xinyu Wang
Pengjun Xie
Fei Huang
Jingren Zhou
Yong Jiang
LLMAGELM
174
5
0
28 Oct 2025
Compositional Image Synthesis with Inference-Time Scaling
Compositional Image Synthesis with Inference-Time Scaling
Minsuk Ji
Sanghyeok Lee
Namhyuk Ahn
DiffMMLLMEGVMVLM
265
0
0
28 Oct 2025
From Observability Data to Diagnosis: An Evolving Multi-agent System for Incident Management in Cloud Systems
From Observability Data to Diagnosis: An Evolving Multi-agent System for Incident Management in Cloud Systems
Yu Luo
Jiamin Jiang
J. Feng
Lei Tao
Qingliang Zhang
Xidao Wen
Yongqian Sun
Shenglin Zhang
Jielong Huang
185
0
0
28 Oct 2025
Iterative Critique-Refine Framework for Enhancing LLM Personalization
Iterative Critique-Refine Framework for Enhancing LLM Personalization
Durga Prasad Maram
Dhruvin Gandhi
Z. Yao
Gayathri Akkinapalli
Franck Dernoncourt
Yu Wang
Ryan Rossi
Nesreen K. Ahmed
151
0
0
28 Oct 2025
Evidence-Bound Autonomous Research (EviBound): A Governance Framework for Eliminating False Claims
Evidence-Bound Autonomous Research (EviBound): A Governance Framework for Eliminating False Claims
Ruiying Chen
99
0
0
28 Oct 2025
Language Server CLI Empowers Language Agents with Process Rewards
Language Server CLI Empowers Language Agents with Process Rewards
Yifan Zhang
Lanser Contributors
73
0
0
27 Oct 2025
COOPERA: Continual Open-Ended Human-Robot Assistance
COOPERA: Continual Open-Ended Human-Robot Assistance
Chenyang Ma
Kai Lu
Ruta Desai
Xavier Puig
Andrew Markham
Niki Trigoni
151
2
0
27 Oct 2025
Agentic Meta-Orchestrator for Multi-task Copilots
Agentic Meta-Orchestrator for Multi-task Copilots
Xiaofeng Zhu
Yunshen Zhou
LLMAG
280
0
0
26 Oct 2025
SwiftSolve: A Self-Iterative, Complexity-Aware Multi-Agent Framework for Competitive Programming
SwiftSolve: A Self-Iterative, Complexity-Aware Multi-Agent Framework for Competitive Programming
Adhyayan Veer Singh
Aaron Shen
Brian Law
Ahmed Ismail
Jonas Rohweder
Sean O'Brien
Kevin Zhu
LRM
90
0
0
26 Oct 2025
Embracing Trustworthy Brain-Agent Collaboration as Paradigm Extension for Intelligent Assistive Technologies
Embracing Trustworthy Brain-Agent Collaboration as Paradigm Extension for Intelligent Assistive Technologies
Yankai Chen
Xinni Zhang
Yifei Zhang
Yangning Li
Henry Peng Zou
Chunyu Miao
Weizhi Zhang
Xue Liu
Philip S. Yu
161
1
0
25 Oct 2025
Boosting Accuracy and Efficiency of Budget Forcing in LLMs via Reinforcement Learning for Mathematical Reasoning
Boosting Accuracy and Efficiency of Budget Forcing in LLMs via Reinforcement Learning for Mathematical Reasoning
Ravindra Aribowo Tarunokusumo
Rafael Fernandes Cunha
OffRLReLMLRM
142
0
0
24 Oct 2025
Automated Detection of Visual Attribute Reliance with a Self-Reflective Agent
Automated Detection of Visual Attribute Reliance with a Self-Reflective Agent
Christy Li
Josep Lopez Camunas
Jake Thomas Touchet
Jacob Andreas
Àgata Lapedriza
Antonio Torralba
Tamar Rott Shaham
197
0
0
24 Oct 2025
LLM-AR: LLM-powered Automated Reasoning Framework
LLM-AR: LLM-powered Automated Reasoning Framework
Rick Chen
Joseph Ternasky
Aaron Ontoyin Yin
Xianling Mu
Fuat Alican
Yigit Ihlamur
LRM
65
0
0
24 Oct 2025
Co-Sight: Enhancing LLM-Based Agents via Conflict-Aware Meta-Verification and Trustworthy Reasoning with Structured Facts
Co-Sight: Enhancing LLM-Based Agents via Conflict-Aware Meta-Verification and Trustworthy Reasoning with Structured Facts
Hongwei Zhang
Ji Lu
Shiqing Jiang
Chenxiang Zhu
Li Xie
...
Baoyu Tang
Lingjun Huang
Baoli Wang
Fang Tan
Peng Zou
LRM
175
1
0
24 Oct 2025
RETuning: Upgrading Inference-Time Scaling for Stock Movement Prediction with Large Language Models
RETuning: Upgrading Inference-Time Scaling for Stock Movement Prediction with Large Language Models
Xueyuan Lin
Cehao Yang
Ye Ma
Ming Li
Rongjunchen Zhang
Yang Ni
Xiaojun Wu
Chengjin Xu
Jian Guo
Hui Xiong
AIFinLRM
184
0
0
24 Oct 2025
Integrating Machine Learning into Belief-Desire-Intention Agents: Current Advances and Open Challenges
Integrating Machine Learning into Belief-Desire-Intention Agents: Current Advances and Open Challenges
Andrea Agiollo
Andrea Omicini
LM&RoAI4CE
171
0
0
23 Oct 2025
Automated Cloud Infrastructure-as-Code Reconciliation with AI Agents
Automated Cloud Infrastructure-as-Code Reconciliation with AI Agents
Zhenning Yang
Hui Guan
Victor Nicolet
Brandon Paulsen
Joey Dodds
Daniel Kroening
Ang Chen
120
0
0
23 Oct 2025
Code-enabled language models can outperform reasoning models on diverse tasks
Code-enabled language models can outperform reasoning models on diverse tasks
Cedegao E. Zhang
Cédric Colas
Gabriel Poesia
Joshua B. Tenenbaum
Jacob Andreas
ReLMALMLRMAI4CE
198
0
0
23 Oct 2025
Teaming LLMs to Detect and Mitigate Hallucinations
Teaming LLMs to Detect and Mitigate Hallucinations
Demian Till
John Smeaton
Peter Haubrick
Gouse Saheb
Florian Graef
David Berman
HILM
340
0
0
22 Oct 2025
Beyond Reactivity: Measuring Proactive Problem Solving in LLM Agents
Beyond Reactivity: Measuring Proactive Problem Solving in LLM Agents
Gil Pasternak
Dheeraj Rajagopal
Julia White
Dhruv Atreja
Matthew Thomas
George Hurn-Maloney
Ash Lewis
LLMAG
176
0
0
22 Oct 2025
See, Think, Act: Online Shopper Behavior Simulation with VLM Agents
See, Think, Act: Online Shopper Behavior Simulation with VLM Agents
Yimeng Zhang
Jiri Gesi
Ran Xue
Tian Wang
Ziyi Wang
...
Qingjun Cui
Yufan Guo
Jing Huang
Mubarak Shah
Dakuo Wang
OffRL
168
0
0
22 Oct 2025
NeSyPr: Neurosymbolic Proceduralization For Efficient Embodied Reasoning
NeSyPr: Neurosymbolic Proceduralization For Efficient Embodied Reasoning
Wonje Choi
Jooyoung Kim
Honguk Woo
LRM
130
0
0
22 Oct 2025
AlphaOPT: Formulating Optimization Programs with Self-Improving LLM Experience Library
AlphaOPT: Formulating Optimization Programs with Self-Improving LLM Experience Library
Minwei Kong
Ao Qu
Xiaotong Guo
Wenbin Ouyang
Chonghe Jiang
...
Hai Wang
Cathy Wu
Jinhua Zhao
Cathy Wu
Jinhua Zhao
142
0
0
21 Oct 2025
StarBench: A Turn-Based RPG Benchmark for Agentic Multimodal Decision-Making and Information Seeking
StarBench: A Turn-Based RPG Benchmark for Agentic Multimodal Decision-Making and Information Seeking
Haoran Zhang
C. Zhu
Sicong Guo
Hanzhe Guo
Haiming Li
Donglin Yu
135
0
0
21 Oct 2025
PlanU: Large Language Model Reasoning through Planning under Uncertainty
PlanU: Large Language Model Reasoning through Planning under Uncertainty
Ziwei Deng
Mian Deng
Chenjing Liang
Zeming Gao
Chennan Ma
Chenxing Lin
Haipeng Zhang
Songzhu Mei
Cheng-Yu Wang
Siqi Shen
161
0
0
21 Oct 2025
Illusions of reflection: open-ended task reveals systematic failures in Large Language Models' reflective reasoning
Illusions of reflection: open-ended task reveals systematic failures in Large Language Models' reflective reasoning
Sion Weatherhead
Flora D. Salim
Aaron Belbasis
ReLMLRMELM
200
0
0
21 Oct 2025
Previous
12345...242526
Next
Page 2 of 26
Pageof 26