ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.11366
  4. Cited By
Reflexion: Language Agents with Verbal Reinforcement Learning
v1v2v3v4 (latest)

Reflexion: Language Agents with Verbal Reinforcement Learning

Neural Information Processing Systems (NeurIPS), 2023
20 March 2023
Noah Shinn
Federico Cassano
Beck Labash
A. Gopinath
Karthik Narasimhan
Shunyu Yao
    LLMAGKELM
ArXiv (abs)PDFHTMLHuggingFace (5 upvotes)Github

Papers citing "Reflexion: Language Agents with Verbal Reinforcement Learning"

50 / 1,280 papers shown
AgentPack: A Dataset of Code Changes, Co-Authored by Agents and Humans
AgentPack: A Dataset of Code Changes, Co-Authored by Agents and Humans
Yangtian Zi
Zixuan Wu
Aleksander Boruch-Gruszecki
Jonathan Bell
Arjun Guha
204
2
0
30 Mar 2026
ProbGuard: Probabilistic Runtime Monitoring for LLM Agent Safety
ProbGuard: Probabilistic Runtime Monitoring for LLM Agent Safety
Haoyu Wang
Chris M. Poskitt
Jun Sun
Jiali Wei
418
8
0
30 Mar 2026
Defending Against Knowledge Poisoning Attacks During Retrieval-Augmented Generation
Defending Against Knowledge Poisoning Attacks During Retrieval-Augmented Generation
Kennedy Edemacu
Vinay M. Shashidhar
Micheal Tuape
Dan Abudu
Beakcheol Jang
Jong Wook Kim
SILMAAMLKELM
216
5
0
30 Mar 2026
Compositional Image Synthesis with Inference-Time Scaling
Compositional Image Synthesis with Inference-Time Scaling
Minsuk Ji
Sanghyeok Lee
Namhyuk Ahn
MLLMEGVM
312
0
0
30 Mar 2026
RevoNAD: Reflective Evolutionary Exploration for Neural Architecture Design
RevoNAD: Reflective Evolutionary Exploration for Neural Architecture Design
Gyusam Chang
Jeongyoon Yoon
Shin han yi
JaeHyeok Lee
Sujin Jang
S. Kim
152
0
0
05 Dec 2025
Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction
Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction
Nex-AGI Team
Yuxuan Cai
Lu Chen
Qiaoling Chen
Yuyang Ding
...
Qin Chen
Liang He
Qi Zhang
Xuanjing Huang
Xipeng Qiu
LLMAG
208
8
0
04 Dec 2025
Natural Language Actor-Critic: Scalable Off-Policy Learning in Language Space
Natural Language Actor-Critic: Scalable Off-Policy Learning in Language Space
Joey Hong
Kang Liu
Zhan Ling
Jiecao Chen
Sergey Levine
LLMAGOffRL
264
4
0
04 Dec 2025
Reason-Plan-ReAct: A Reasoner-Planner Supervising a ReAct Executor for Complex Enterprise Tasks
Reason-Plan-ReAct: A Reasoner-Planner Supervising a ReAct Executor for Complex Enterprise Tasks
Gianni Molinari
Fabio Ciravegna
90
0
0
03 Dec 2025
Evaluating Long-Context Reasoning in LLM-Based WebAgents
Evaluating Long-Context Reasoning in LLM-Based WebAgents
Andy Chung
Yichi Zhang
Kaixiang Lin
Aditya Rawal
Qiaozi Gao
Joyce Chai
LLMAGLRM
178
3
0
03 Dec 2025
InEx: Hallucination Mitigation via Introspection and Cross-Modal Multi-Agent Collaboration
InEx: Hallucination Mitigation via Introspection and Cross-Modal Multi-Agent Collaboration
Zhongyu Yang
Yingfang Yuan
Xuanming Jiang
Baoyi An
Wei Pang
LLMAGHILMLRM
200
4
0
02 Dec 2025
LeechHijack: Covert Computational Resource Exploitation in Intelligent Agent Systems
LeechHijack: Covert Computational Resource Exploitation in Intelligent Agent Systems
Yuanhe Zhang
Weiliu Wang
Zhenhong Zhou
Kun Wang
Jie Zhang
Li Sun
Yang Liu
Sen Su
165
3
0
02 Dec 2025
In-Context Distillation with Self-Consistency Cascades: A Simple, Training-Free Way to Reduce LLM Agent Costs
In-Context Distillation with Self-Consistency Cascades: A Simple, Training-Free Way to Reduce LLM Agent Costs
Vishnu Sarukkai
Asanshay Gupta
James Hong
Michael Gharbi
Kayvon Fatahalian
108
0
0
02 Dec 2025
Process-Centric Analysis of Agentic Software Systems
Process-Centric Analysis of Agentic Software Systems
Shuyang Liu
Yang Chen
Rahul Krishna
Saurabh Sinha
Jatin Ganhotra
Reyhan Jabbarvand
101
4
0
02 Dec 2025
WISE: Weighted Iterative Society-of-Experts for Robust Multimodal Multi-Agent Debate
WISE: Weighted Iterative Society-of-Experts for Robust Multimodal Multi-Agent Debate
A. Cherian
River Doyle
Eyal Ben-Dov
Suhas Lohit
Kuan-Chuan Peng
LLMAGMoE
142
0
0
02 Dec 2025
Self-Improving VLM Judges Without Human Annotations
Self-Improving VLM Judges Without Human Annotations
Inna Wanyin Lin
Yushi Hu
Shuyue Stella Li
Scott Geng
Pang Wei Koh
Luke Zettlemoyer
Tim Althoff
Marjan Ghazvininejad
VLMLRM
51
3
0
02 Dec 2025
When Does Verification Pay Off? A Closer Look at LLMs as Solution Verifiers
When Does Verification Pay Off? A Closer Look at LLMs as Solution Verifiers
Jack Lu
Ryan Teehan
Jinran Jin
Mengye Ren
LRM
186
4
0
02 Dec 2025
STRIDE: A Systematic Framework for Selecting AI Modalities - Agentic AI, AI Assistants, or LLM Calls
STRIDE: A Systematic Framework for Selecting AI Modalities - Agentic AI, AI Assistants, or LLM Calls
Shubhi Asthana
Bing Zhang
Chad DeLuca
Ruchi Mahindru
Hima Patel
87
1
0
01 Dec 2025
Orchestration Framework for Financial Agents: From Algorithmic Trading to Agentic Trading
Orchestration Framework for Financial Agents: From Algorithmic Trading to Agentic Trading
Jifeng Li
Arnav Grover
Abraham Alpuerto
Yupeng Cao
Xiao-Yang Liu
AIFin
268
0
0
01 Dec 2025
The Art of Scaling Test-Time Compute for Large Language Models
The Art of Scaling Test-Time Compute for Large Language Models
Aradhye Agarwal
Ayan Sengupta
Tanmoy Chakraborty
LRM
400
6
0
01 Dec 2025
Transforming Monolithic Foundation Models into Embodied Multi-Agent Architectures for Human-Robot Collaboration
Transforming Monolithic Foundation Models into Embodied Multi-Agent Architectures for Human-Robot Collaboration
Nan Sun
Bo Mao
Yongchang Li
Chenxu Wang
Di Guo
Huaping Liu
LM&Ro
139
0
0
30 Nov 2025
Towards Continuous Intelligence Growth: Self-Training, Continual Learning, and Dual-Scale Memory in SuperIntelliAgent
Towards Continuous Intelligence Growth: Self-Training, Continual Learning, and Dual-Scale Memory in SuperIntelliAgent
Jianzhe Lin
Zeyu Pan
Yun Zhu
Ruiqi Song
Jining Yang
LRM
164
0
0
28 Nov 2025
Beyond Curve Fitting: Neuro-Symbolic Agents for Context-Aware Epidemic Forecasting
Beyond Curve Fitting: Neuro-Symbolic Agents for Context-Aware Epidemic Forecasting
Joongwon Chae
Runming Wang
Chen Xiong
Gong Yunhan
Lian Zhang
Ji Jiansong
Dongmei Yu
Peiwu Qin
135
0
0
28 Nov 2025
ThetaEvolve: Test-time Learning on Open Problems
ThetaEvolve: Test-time Learning on Open Problems
Y. Wang
Shao-Rong Su
Zhiyuan Zeng
Eva Xu
Liliang Ren
...
Pengcheng He
Weizhu Chen
Shuohang Wang
S. Du
Yelong Shen
368
13
0
28 Nov 2025
Multi-chain Graph Refinement and Selection for Reliable Reasoning in Large Language Models
Multi-chain Graph Refinement and Selection for Reliable Reasoning in Large Language Models
Yujiao Yang
Jing Lian
Linhui Li
LRM
255
0
0
28 Nov 2025
Evaluating LLMs for One-Shot Patching of Real and Artificial Vulnerabilities
Evaluating LLMs for One-Shot Patching of Real and Artificial Vulnerabilities
Aayush Garg
Zanis Ali Khan
Renzo Degiovanni
Qiang Tang
AAML
177
0
0
28 Nov 2025
MCP vs RAG vs NLWeb vs HTML: A Comparison of the Effectiveness and Efficiency of Different Agent Interfaces to the Web (Technical Report)
MCP vs RAG vs NLWeb vs HTML: A Comparison of the Effectiveness and Efficiency of Different Agent Interfaces to the Web (Technical Report)
Aaron Steiner
Ralph Peeters
Christian Bizer
LLMAG
198
1
0
28 Nov 2025
Adapting Like Humans: A Metacognitive Agent with Test-time Reasoning
Adapting Like Humans: A Metacognitive Agent with Test-time Reasoning
Yang Li
Z. He
Y. Huang
Zhuhanling Xiao
Chao Yu
Meng Fang
Kun Shao
Jun Wang
LRMVLM
215
1
0
28 Nov 2025
SkeletonAgent: An Agentic Interaction Framework for Skeleton-based Action Recognition
SkeletonAgent: An Agentic Interaction Framework for Skeleton-based Action Recognition
Hongda Liu
Yunfan Liu
Changlu Wang
Yunlong Wang
Zhenan Sun
LLMAG
305
1
0
27 Nov 2025
TinyLLM: Evaluation and Optimization of Small Language Models for Agentic Tasks on Edge Devices
TinyLLM: Evaluation and Optimization of Small Language Models for Agentic Tasks on Edge Devices
Mohd Ariful Haque
Fahad Rahman
Kishor Datta Gupta
Khalil Shujaee
Roy George
LLMAG
204
1
0
27 Nov 2025
Real-Time Procedural Learning From Experience for AI Agents
Real-Time Procedural Learning From Experience for AI Agents
Dasheng Bi
Yubin Hu
Mohammed N. Nasir
107
0
0
27 Nov 2025
Agentic Learner with Grow-and-Refine Multimodal Semantic Memory
Agentic Learner with Grow-and-Refine Multimodal Semantic Memory
Weihao Bo
Shan Zhang
Yanpeng Sun
Jingjing Wu
Qunyi Xie
...
Wei He
Xiaofan Li
Na Zhao
Jingdong Wang
Z. Li
LRM
269
3
0
26 Nov 2025
MADRA: Multi-Agent Debate for Risk-Aware Embodied Planning
MADRA: Multi-Agent Debate for Risk-Aware Embodied Planning
Junjian Wang
Lidan Zhao
Xi Sheryl Zhang
252
0
0
26 Nov 2025
BRIDGE: Building Representations In Domain Guided Program Synthesis
BRIDGE: Building Representations In Domain Guided Program Synthesis
Robert Joseph George
Carson Eisenach
Udaya Ghai
Dominique C. Perrault-Joncas
A. Anandkumar
Dean Phillips Foster
ALMLRM
489
0
0
26 Nov 2025
Improving Language Agents through BREW
Improving Language Agents through BREW
Shashank Kirtania
Param Biyani
Priyanshu Gupta
Yasharth Bajpai
Roshni Iyer
Sumit Gulwani
Gustavo Soares
LLMAGOffRL
297
1
0
25 Nov 2025
Stabilizing Off-Policy Training for Long-Horizon LLM Agent via Turn-Level Importance Sampling and Clipping-Triggered Normalization
Stabilizing Off-Policy Training for Long-Horizon LLM Agent via Turn-Level Importance Sampling and Clipping-Triggered Normalization
Chenliang Li
Adel Elmahdy
Alex Boyd
Zhongruo Wang
Alfredo García
Parminder Bhatia
Taha A. Kass-Hout
Cao Xiao
Mingyi Hong
Mingyi Hong
OffRL
262
1
0
25 Nov 2025
Evo-Memory: Benchmarking LLM Agent Test-time Learning with Self-Evolving Memory
Evo-Memory: Benchmarking LLM Agent Test-time Learning with Self-Evolving Memory
Tianxin Wei
Noveen Sachdeva
Benjamin Coleman
Zhankui He
Yuanchen Bei
...
C. Wang
Shuo Chen
Fernando Pereira
Wang-Cheng Kang
D. Cheng
LLMAG
251
35
0
25 Nov 2025
CLIMATEAGENT: Multi-Agent Orchestration for Complex Climate Data Science Workflows
CLIMATEAGENT: Multi-Agent Orchestration for Complex Climate Data Science Workflows
Hyeonjae Kim
Chenyue Li
Wen Deng
Mengxi Jin
Wen Huang
Mengqian Lu
Binhang Yuan
AI4CE
353
2
0
25 Nov 2025
ReEXplore: Improving MLLMs for Embodied Exploration with Contextualized Retrospective Experience Replay
ReEXplore: Improving MLLMs for Embodied Exploration with Contextualized Retrospective Experience Replay
Gengyuan Zhang
Mingcong Ding
Jingpei Wu
Ruotong Liao
Volker Tresp
LRM
239
1
0
24 Nov 2025
FHE-Agent: Automating CKKS Configuration for Practical Encrypted Inference via an LLM-Guided Agentic Framework
FHE-Agent: Automating CKKS Configuration for Practical Encrypted Inference via an LLM-Guided Agentic Framework
Nuo Xu
Zhaoting Gong
Ran Ran
Jinwei Tang
Wujie Wen
Caiwen Ding
171
0
0
23 Nov 2025
Cross-Disciplinary Knowledge Retrieval and Synthesis: A Compound AI Architecture for Scientific Discovery
Cross-Disciplinary Knowledge Retrieval and Synthesis: A Compound AI Architecture for Scientific Discovery
Svitlana Volkova
Peter Bautista
Avinash Hiriyanna
Gabriel Ganberg
Isabel Erickson
Zachary Klinefelter
Nick Abele
Hsien-Te Kao
Grant Engberson
159
1
0
23 Nov 2025
Reasoning With a Star: A Heliophysics Dataset and Benchmark for Agentic Scientific Reasoning
Reasoning With a Star: A Heliophysics Dataset and Benchmark for Agentic Scientific Reasoning
Kevin Lee
Russell Spiewak
James Walsh
LRM
131
0
0
23 Nov 2025
LLMs as Firmware Experts: A Runtime-Grown Tree-of-Agents Framework
LLMs as Firmware Experts: A Runtime-Grown Tree-of-Agents Framework
XiangRui Zhang
Zeyu Chen
Haining Wang
Qiang Li
142
0
0
23 Nov 2025
Learning to Debug: LLM-Organized Knowledge Trees for Solving RTL Assertion Failures
Learning to Debug: LLM-Organized Knowledge Trees for Solving RTL Assertion Failures
Yunsheng Bai
Haoxing Ren
146
0
0
21 Nov 2025
A Benchmark for Procedural Memory Retrieval in Language Agents
A Benchmark for Procedural Memory Retrieval in Language Agents
Ishant Kohar
Aswanth Krishnan
81
1
0
21 Nov 2025
Cognitive Inception: Agentic Reasoning against Visual Deceptions by Injecting Skepticism
Cognitive Inception: Agentic Reasoning against Visual Deceptions by Injecting Skepticism
Yinjie Zhao
Heng Zhao
Bihan Wen
Joey Tianyi Zhou
LRM
148
0
0
21 Nov 2025
Hiding in the AI Traffic: Abusing MCP for LLM-Powered Agentic Red Teaming
Hiding in the AI Traffic: Abusing MCP for LLM-Powered Agentic Red Teaming
Strahinja Janjusevic
Anna Baron Garcia
Sohrob Kazerounian
257
1
0
20 Nov 2025
PSM: Prompt Sensitivity Minimization via LLM-Guided Black-Box Optimization
PSM: Prompt Sensitivity Minimization via LLM-Guided Black-Box Optimization
Huseein Jawad
Nicolas Brunel
AAML
207
0
0
20 Nov 2025
NAMeGEn: Creative Name Generation via A Novel Agent-based Multiple Personalized Goal Enhancement Framework
NAMeGEn: Creative Name Generation via A Novel Agent-based Multiple Personalized Goal Enhancement Framework
Shanlin Zhou
Xinpeng Wang
Jianxun Lian
Zhenghao Liu
L. Lakshmanan
Xiaoyuan Yi
Yongtao Hao
LLMAG
424
0
0
19 Nov 2025
AVATAAR: Agentic Video Answering via Temporal Adaptive Alignment and Reasoning
AVATAAR: Agentic Video Answering via Temporal Adaptive Alignment and Reasoning
Urjitkumar Patel
Fang-Chun Yeh
Chinmay Gondhalekar
262
0
0
19 Nov 2025
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
Mingyue Cheng
Jie Ouyang
Shuo Yu
Ruiran Yan
Yucong Luo
Zirui Liu
Daoyu Wang
Qi Liu
Enhong Chen
197
23
0
18 Nov 2025
1234...242526
Next
Page 1 of 26
Pageof 26