ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.11366
  4. Cited By
Reflexion: Language Agents with Verbal Reinforcement Learning
v1v2v3v4 (latest)

Reflexion: Language Agents with Verbal Reinforcement Learning

Neural Information Processing Systems (NeurIPS), 2023
20 March 2023
Noah Shinn
Federico Cassano
Beck Labash
A. Gopinath
Karthik Narasimhan
Shunyu Yao
    LLMAGKELM
ArXiv (abs)PDFHTMLHuggingFace (5 upvotes)

Papers citing "Reflexion: Language Agents with Verbal Reinforcement Learning"

50 / 1,255 papers shown
Title
Large Language Models Develop Novel Social Biases Through Adaptive Exploration
Large Language Models Develop Novel Social Biases Through Adaptive Exploration
Addison J. Wu
Ryan Liu
Xuechunzi Bai
Thomas Griffiths
140
0
0
24 Dec 2025
In-Context Distillation with Self-Consistency Cascades: A Simple, Training-Free Way to Reduce LLM Agent Costs
In-Context Distillation with Self-Consistency Cascades: A Simple, Training-Free Way to Reduce LLM Agent Costs
Vishnu Sarukkai
Asanshay Gupta
James Hong
Michael Gharbi
Kayvon Fatahalian
55
0
0
02 Dec 2025
When Does Verification Pay Off? A Closer Look at LLMs as Solution Verifiers
When Does Verification Pay Off? A Closer Look at LLMs as Solution Verifiers
Jack Lu
Ryan Teehan
Jinran Jin
Mengye Ren
LRM
108
0
0
02 Dec 2025
InEx: Hallucination Mitigation via Introspection and Cross-Modal Multi-Agent Collaboration
InEx: Hallucination Mitigation via Introspection and Cross-Modal Multi-Agent Collaboration
Zhongyu Yang
Yingfang Yuan
Xuanming Jiang
Baoyi An
Wei Pang
LLMAGHILMLRM
96
0
0
02 Dec 2025
Process-Centric Analysis of Agentic Software Systems
Process-Centric Analysis of Agentic Software Systems
Shuyang Liu
Yang Chen
Rahul Krishna
Saurabh Sinha
Jatin Ganhotra
Reyhan Jabbarvand
28
0
0
02 Dec 2025
WISE: Weighted Iterative Society-of-Experts for Robust Multimodal Multi-Agent Debate
WISE: Weighted Iterative Society-of-Experts for Robust Multimodal Multi-Agent Debate
A. Cherian
River Doyle
Eyal Ben-Dov
Suhas Lohit
Kuan-Chuan Peng
LLMAGMoE
80
0
0
02 Dec 2025
STRIDE: A Systematic Framework for Selecting AI Modalities - Agentic AI, AI Assistants, or LLM Calls
STRIDE: A Systematic Framework for Selecting AI Modalities - Agentic AI, AI Assistants, or LLM Calls
Shubhi Asthana
Bing Zhang
Chad DeLuca
Ruchi Mahindru
Hima Patel
20
0
0
01 Dec 2025
Orchestration Framework for Financial Agents: From Algorithmic Trading to Agentic Trading
Orchestration Framework for Financial Agents: From Algorithmic Trading to Agentic Trading
Jifeng Li
Arnav Grover
Abraham Alpuerto
Yupeng Cao
Xiao-Yang Liu
AIFin
140
0
0
01 Dec 2025
Adapting Like Humans: A Metacognitive Agent with Test-time Reasoning
Adapting Like Humans: A Metacognitive Agent with Test-time Reasoning
Yang Li
Z. He
Y. Huang
Zhuhanling Xiao
Chao Yu
Meng Fang
Kun Shao
Jun Wang
LRMVLM
109
0
0
28 Nov 2025
Beyond Curve Fitting: Neuro-Symbolic Agents for Context-Aware Epidemic Forecasting
Beyond Curve Fitting: Neuro-Symbolic Agents for Context-Aware Epidemic Forecasting
Joongwon Chae
Runming Wang
Chen Xiong
Gong Yunhan
Lian Zhang
Ji Jiansong
Dongmei Yu
Peiwu Qin
52
0
0
28 Nov 2025
Multi-chain Graph Refinement and Selection for Reliable Reasoning in Large Language Models
Multi-chain Graph Refinement and Selection for Reliable Reasoning in Large Language Models
Yujiao Yang
Jing Lian
Linhui Li
LRM
132
0
0
28 Nov 2025
MCP vs RAG vs NLWeb vs HTML: A Comparison of the Effectiveness and Efficiency of Different Agent Interfaces to the Web (Technical Report)
MCP vs RAG vs NLWeb vs HTML: A Comparison of the Effectiveness and Efficiency of Different Agent Interfaces to the Web (Technical Report)
Aaron Steiner
Ralph Peeters
Christian Bizer
LLMAG
113
0
0
28 Nov 2025
TinyLLM: Evaluation and Optimization of Small Language Models for Agentic Tasks on Edge Devices
TinyLLM: Evaluation and Optimization of Small Language Models for Agentic Tasks on Edge Devices
Mohd Ariful Haque
Fahad Rahman
Kishor Datta Gupta
Khalil Shujaee
Roy George
LLMAG
110
0
0
27 Nov 2025
SkeletonAgent: An Agentic Interaction Framework for Skeleton-based Action Recognition
SkeletonAgent: An Agentic Interaction Framework for Skeleton-based Action Recognition
Hongda Liu
Yunfan Liu
Changlu Wang
Yunlong Wang
Zhenan Sun
LLMAG
140
0
0
27 Nov 2025
Agentic Learner with Grow-and-Refine Multimodal Semantic Memory
Agentic Learner with Grow-and-Refine Multimodal Semantic Memory
Weihao Bo
Shan Zhang
Yanpeng Sun
Jingjing Wu
Qunyi Xie
...
Wei He
Xiaofan Li
Na Zhao
Jingdong Wang
Z. Li
LRM
186
0
0
26 Nov 2025
MADRA: Multi-Agent Debate for Risk-Aware Embodied Planning
MADRA: Multi-Agent Debate for Risk-Aware Embodied Planning
Junjian Wang
Lidan Zhao
Xi Sheryl Zhang
153
0
0
26 Nov 2025
BRIDGE: Building Representations In Domain Guided Program Verification
BRIDGE: Building Representations In Domain Guided Program Verification
Robert Joseph George
Carson Eisenach
Udaya Ghai
Dominique C. Perrault-Joncas
A. Anandkumar
Dean Phillips Foster
ALMLRM
373
0
0
26 Nov 2025
Improving Language Agents through BREW
Improving Language Agents through BREW
Shashank Kirtania
Param Biyani
Priyanshu Gupta
Yasharth Bajpai
Roshni Iyer
Sumit Gulwani
Gustavo Soares
LLMAGOffRL
242
0
0
25 Nov 2025
ST-PPO: Stabilized Off-Policy Proximal Policy Optimization for Multi-Turn Agents Training
ST-PPO: Stabilized Off-Policy Proximal Policy Optimization for Multi-Turn Agents Training
Chenliang Li
Adel Elmahdy
Alex Boyd
Zhongruo Wang
Alfredo García
Parminder Bhatia
Taha A. Kass-Hout
Cao Xiao
Mingyi Hong
OffRL
143
0
0
25 Nov 2025
Evo-Memory: Benchmarking LLM Agent Test-time Learning with Self-Evolving Memory
Evo-Memory: Benchmarking LLM Agent Test-time Learning with Self-Evolving Memory
Tianxin Wei
Noveen Sachdeva
Benjamin Coleman
Zhankui He
Yuanchen Bei
...
C. Wang
Shuo Chen
Fernando Pereira
Wang-Cheng Kang
D. Cheng
LLMAG
207
1
0
25 Nov 2025
CLIMATEAGENT: Multi-Agent Orchestration for Complex Climate Data Science Workflows
CLIMATEAGENT: Multi-Agent Orchestration for Complex Climate Data Science Workflows
Hyeonjae Kim
Chenyue Li
Wen Deng
Mengxi Jin
Wen Huang
Mengqian Lu
Binhang Yuan
AI4CE
283
0
0
25 Nov 2025
ReEXplore: Improving MLLMs for Embodied Exploration with Contextualized Retrospective Experience Replay
ReEXplore: Improving MLLMs for Embodied Exploration with Contextualized Retrospective Experience Replay
Gengyuan Zhang
Mingcong Ding
Jingpei Wu
Ruotong Liao
Volker Tresp
LRM
165
0
0
24 Nov 2025
FHE-Agent: Automating CKKS Configuration for Practical Encrypted Inference via an LLM-Guided Agentic Framework
FHE-Agent: Automating CKKS Configuration for Practical Encrypted Inference via an LLM-Guided Agentic Framework
Nuo Xu
Zhaoting Gong
Ran Ran
Jinwei Tang
Wujie Wen
Caiwen Ding
96
0
0
23 Nov 2025
Cross-Disciplinary Knowledge Retrieval and Synthesis: A Compound AI Architecture for Scientific Discovery
Cross-Disciplinary Knowledge Retrieval and Synthesis: A Compound AI Architecture for Scientific Discovery
Svitlana Volkova
Peter Bautista
Avinash Hiriyanna
Gabriel Ganberg
Isabel Erickson
Zachary Klinefelter
Nick Abele
Hsien-Te Kao
Grant Engberson
92
0
0
23 Nov 2025
LLMs as Firmware Experts: A Runtime-Grown Tree-of-Agents Framework
LLMs as Firmware Experts: A Runtime-Grown Tree-of-Agents Framework
XiangRui Zhang
Zeyu Chen
Haining Wang
Qiang Li
96
0
0
23 Nov 2025
Learning to Debug: LLM-Organized Knowledge Trees for Solving RTL Assertion Failures
Learning to Debug: LLM-Organized Knowledge Trees for Solving RTL Assertion Failures
Yunsheng Bai
Haoxing Ren
92
0
0
21 Nov 2025
Cognitive Inception: Agentic Reasoning against Visual Deceptions by Injecting Skepticism
Cognitive Inception: Agentic Reasoning against Visual Deceptions by Injecting Skepticism
Yinjie Zhao
Heng Zhao
Bihan Wen
Joey Tianyi Zhou
LRM
72
0
0
21 Nov 2025
Hiding in the AI Traffic: Abusing MCP for LLM-Powered Agentic Red Teaming
Hiding in the AI Traffic: Abusing MCP for LLM-Powered Agentic Red Teaming
Strahinja Janjusevic
Anna Baron Garcia
Sohrob Kazerounian
183
0
0
20 Nov 2025
PSM: Prompt Sensitivity Minimization via LLM-Guided Black-Box Optimization
Huseein Jawad
Nicolas Brunel
AAML
132
0
0
20 Nov 2025
NAMeGEn: Creative Name Generation via A Novel Agent-based Multiple Personalized Goal Enhancement Framework
NAMeGEn: Creative Name Generation via A Novel Agent-based Multiple Personalized Goal Enhancement Framework
Shanlin Zhou
Xinpeng Wang
Jianxun Lian
Zhenghao Liu
L. Lakshmanan
Xiaoyuan Yi
Yongtao Hao
LLMAG
326
0
0
19 Nov 2025
AVATAAR: Agentic Video Answering via Temporal Adaptive Alignment and Reasoning
AVATAAR: Agentic Video Answering via Temporal Adaptive Alignment and Reasoning
Urjitkumar Patel
Fang-Chun Yeh
Chinmay Gondhalekar
204
0
0
19 Nov 2025
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
Mingyue Cheng
Jie Ouyang
Shuo Yu
Ruiran Yan
Yucong Luo
Zirui Liu
Daoyu Wang
Qi Liu
Enhong Chen
116
3
0
18 Nov 2025
Beyond Accuracy: A Multi-Dimensional Framework for Evaluating Enterprise Agentic AI Systems
Beyond Accuracy: A Multi-Dimensional Framework for Evaluating Enterprise Agentic AI Systems
Sushant Mehta
ELM
136
0
0
18 Nov 2025
AutoTool: Efficient Tool Selection for Large Language Model Agents
AutoTool: Efficient Tool Selection for Large Language Model Agents
Jingyi Jia
Qinbin Li
LLMAG
132
0
0
18 Nov 2025
Extending Test-Time Scaling: A 3D Perspective with Context, Batch, and Turn
Extending Test-Time Scaling: A 3D Perspective with Context, Batch, and Turn
Chao Yu
Qixin Tan
Jiaxuan Gao
Shi Yu
Hong Lu
Xinting Yang
Zelai Xu
Yu Wang
Yi Wu
Eugene Vinitsky
LRM
116
0
0
18 Nov 2025
An Operational Kardashev-Style Scale for Autonomous AI - Towards AGI and Superintelligence
An Operational Kardashev-Style Scale for Autonomous AI - Towards AGI and Superintelligence
Przemyslaw Chojecki
ELM
80
3
0
17 Nov 2025
Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO
Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO
Haoyang Hong
Jiajun Yin
Y. Wang
Jingnan Liu
Zhe Chen
...
M. Yang
Chunxiao Guo
Junwei Liu
Peng Wei
Jinjie Gu
103
0
0
17 Nov 2025
From Perception to Reasoning: Deep Thinking Empowers Multimodal Large Language Models
From Perception to Reasoning: Deep Thinking Empowers Multimodal Large Language Models
Wenxin Zhu
Andong Chen
Yuchen Song
Kehai Chen
Conghui Zhu
Ziyan Chen
Tiejun Zhao
LRM
433
0
0
17 Nov 2025
WebCoach: Self-Evolving Web Agents with Cross-Session Memory Guidance
WebCoach: Self-Evolving Web Agents with Cross-Session Memory Guidance
Genglin Liu
Shijie Geng
Sha Li
Hejie Cui
Sarah Zhang
Xin Liu
Tianyi Liu
CLL
538
0
0
17 Nov 2025
Generative Caching for Structurally Similar Prompts and Responses
Generative Caching for Structurally Similar Prompts and Responses
Sarthak Chakraborty
Suman Nath
Xuchao Zhang
Chetan Bansal
Indranil Gupta
142
1
0
14 Nov 2025
Beyond Elicitation: Provision-based Prompt Optimization for Knowledge-Intensive Tasks
Beyond Elicitation: Provision-based Prompt Optimization for Knowledge-Intensive Tasks
Yunzhe Xu
Zhuosheng Zhang
Zhe Liu
157
0
0
13 Nov 2025
AgentPRM: Process Reward Models for LLM Agents via Step-Wise Promise and Progress
AgentPRM: Process Reward Models for LLM Agents via Step-Wise Promise and Progress
Zhiheng Xi
Chenyang Liao
Guanyu Li
Y. Yang
Wenxiang Chen
...
Wei Wu
Tao Ji
Tao Gui
Qi Zhang
Xuanjing Huang
LRM
108
0
0
11 Nov 2025
Meta-cognitive Multi-scale Hierarchical Reasoning for Motor Imagery Decoding
Meta-cognitive Multi-scale Hierarchical Reasoning for Motor Imagery Decoding
Si-Hyun Kim
Heon Kwak
Byoung-Hee Kwon
Seong-Whan Lee
156
0
0
11 Nov 2025
Analyzing Political Text at Scale with Online Tensor LDA
Analyzing Political Text at Scale with Online Tensor LDA
Sara Kangaslahti
Danny Ebanks
Jean Kossaifi
Anqi Liu
R. Alvarez
A. Anandkumar
96
0
0
11 Nov 2025
Last Layer Logits to Logic: Empowering LLMs with Logic-Consistent Structured Knowledge Reasoning
Last Layer Logits to Logic: Empowering LLMs with Logic-Consistent Structured Knowledge Reasoning
Songze Li
Zhiqiang Liu
Zhaoyan Gong
Xiaoke Guo
Zhengke Gui
H. Chen
Wen Zhang
LRM
222
0
0
11 Nov 2025
Procedural Knowledge Improves Agentic LLM Workflows
Procedural Knowledge Improves Agentic LLM Workflows
Vincent Hsiao
Mark Roberts
Leslie Smith
AIFin
387
0
0
10 Nov 2025
MathSE: Improving Multimodal Mathematical Reasoning via Self-Evolving Iterative Reflection and Reward-Guided Fine-Tuning
MathSE: Improving Multimodal Mathematical Reasoning via Self-Evolving Iterative Reflection and Reward-Guided Fine-Tuning
Jinhao Chen
Zhen Yang
Jianxin Shi
Tianyu Wo
J. Tang
ReLMLRM
232
0
0
10 Nov 2025
Recursive Dynamics in Fast-Weights Homeostatic Reentry Networks: Toward Reflective Intelligence
Recursive Dynamics in Fast-Weights Homeostatic Reentry Networks: Toward Reflective Intelligence
B. G. Chae
137
2
0
10 Nov 2025
FLEX: Continuous Agent Evolution via Forward Learning from Experience
FLEX: Continuous Agent Evolution via Forward Learning from Experience
Zhicheng Cai
Xinyuan Guo
Yu Pei
Jiangtao Feng
Jiangjie Chen
Ya Zhang
Wei-Ying Ma
Mingxuan Wang
Hao Zhou
Hao Zhou
CLLLLMAGLRM
254
3
0
09 Nov 2025
Evaluation of retrieval-based QA on QUEST-LOFT
Evaluation of retrieval-based QA on QUEST-LOFT
Nathan Scales
Nathanael Scharli
Olivier Bousquet
RALM
336
0
0
08 Nov 2025
1234...242526
Next