Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2303.11366
Cited By
v1
v2
v3
v4 (latest)
Reflexion: Language Agents with Verbal Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2023
20 March 2023
Noah Shinn
Federico Cassano
Beck Labash
A. Gopinath
Karthik Narasimhan
Shunyu Yao
LLMAG
KELM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (5 upvotes)
Github
Papers citing
"Reflexion: Language Agents with Verbal Reinforcement Learning"
50 / 1,280 papers shown
AgentPack: A Dataset of Code Changes, Co-Authored by Agents and Humans
Yangtian Zi
Zixuan Wu
Aleksander Boruch-Gruszecki
Jonathan Bell
Arjun Guha
204
2
0
30 Mar 2026
ProbGuard: Probabilistic Runtime Monitoring for LLM Agent Safety
Haoyu Wang
Chris M. Poskitt
Jun Sun
Jiali Wei
418
8
0
30 Mar 2026
Defending Against Knowledge Poisoning Attacks During Retrieval-Augmented Generation
Kennedy Edemacu
Vinay M. Shashidhar
Micheal Tuape
Dan Abudu
Beakcheol Jang
Jong Wook Kim
SILM
AAML
KELM
216
5
0
30 Mar 2026
Compositional Image Synthesis with Inference-Time Scaling
Minsuk Ji
Sanghyeok Lee
Namhyuk Ahn
MLLM
EGVM
312
0
0
30 Mar 2026
RevoNAD: Reflective Evolutionary Exploration for Neural Architecture Design
Gyusam Chang
Jeongyoon Yoon
Shin han yi
JaeHyeok Lee
Sujin Jang
S. Kim
152
0
0
05 Dec 2025
Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction
Nex-AGI Team
Yuxuan Cai
Lu Chen
Qiaoling Chen
Yuyang Ding
...
Qin Chen
Liang He
Qi Zhang
Xuanjing Huang
Xipeng Qiu
LLMAG
208
8
0
04 Dec 2025
Natural Language Actor-Critic: Scalable Off-Policy Learning in Language Space
Joey Hong
Kang Liu
Zhan Ling
Jiecao Chen
Sergey Levine
LLMAG
OffRL
264
4
0
04 Dec 2025
Reason-Plan-ReAct: A Reasoner-Planner Supervising a ReAct Executor for Complex Enterprise Tasks
Gianni Molinari
Fabio Ciravegna
90
0
0
03 Dec 2025
Evaluating Long-Context Reasoning in LLM-Based WebAgents
Andy Chung
Yichi Zhang
Kaixiang Lin
Aditya Rawal
Qiaozi Gao
Joyce Chai
LLMAG
LRM
178
3
0
03 Dec 2025
InEx: Hallucination Mitigation via Introspection and Cross-Modal Multi-Agent Collaboration
Zhongyu Yang
Yingfang Yuan
Xuanming Jiang
Baoyi An
Wei Pang
LLMAG
HILM
LRM
200
4
0
02 Dec 2025
LeechHijack: Covert Computational Resource Exploitation in Intelligent Agent Systems
Yuanhe Zhang
Weiliu Wang
Zhenhong Zhou
Kun Wang
Jie Zhang
Li Sun
Yang Liu
Sen Su
165
3
0
02 Dec 2025
In-Context Distillation with Self-Consistency Cascades: A Simple, Training-Free Way to Reduce LLM Agent Costs
Vishnu Sarukkai
Asanshay Gupta
James Hong
Michael Gharbi
Kayvon Fatahalian
108
0
0
02 Dec 2025
Process-Centric Analysis of Agentic Software Systems
Shuyang Liu
Yang Chen
Rahul Krishna
Saurabh Sinha
Jatin Ganhotra
Reyhan Jabbarvand
101
4
0
02 Dec 2025
WISE: Weighted Iterative Society-of-Experts for Robust Multimodal Multi-Agent Debate
A. Cherian
River Doyle
Eyal Ben-Dov
Suhas Lohit
Kuan-Chuan Peng
LLMAG
MoE
142
0
0
02 Dec 2025
Self-Improving VLM Judges Without Human Annotations
Inna Wanyin Lin
Yushi Hu
Shuyue Stella Li
Scott Geng
Pang Wei Koh
Luke Zettlemoyer
Tim Althoff
Marjan Ghazvininejad
VLM
LRM
51
3
0
02 Dec 2025
When Does Verification Pay Off? A Closer Look at LLMs as Solution Verifiers
Jack Lu
Ryan Teehan
Jinran Jin
Mengye Ren
LRM
186
4
0
02 Dec 2025
STRIDE: A Systematic Framework for Selecting AI Modalities - Agentic AI, AI Assistants, or LLM Calls
Shubhi Asthana
Bing Zhang
Chad DeLuca
Ruchi Mahindru
Hima Patel
87
1
0
01 Dec 2025
Orchestration Framework for Financial Agents: From Algorithmic Trading to Agentic Trading
Jifeng Li
Arnav Grover
Abraham Alpuerto
Yupeng Cao
Xiao-Yang Liu
AIFin
268
0
0
01 Dec 2025
The Art of Scaling Test-Time Compute for Large Language Models
Aradhye Agarwal
Ayan Sengupta
Tanmoy Chakraborty
LRM
400
6
0
01 Dec 2025
Transforming Monolithic Foundation Models into Embodied Multi-Agent Architectures for Human-Robot Collaboration
Nan Sun
Bo Mao
Yongchang Li
Chenxu Wang
Di Guo
Huaping Liu
LM&Ro
139
0
0
30 Nov 2025
Towards Continuous Intelligence Growth: Self-Training, Continual Learning, and Dual-Scale Memory in SuperIntelliAgent
Jianzhe Lin
Zeyu Pan
Yun Zhu
Ruiqi Song
Jining Yang
LRM
164
0
0
28 Nov 2025
Beyond Curve Fitting: Neuro-Symbolic Agents for Context-Aware Epidemic Forecasting
Joongwon Chae
Runming Wang
Chen Xiong
Gong Yunhan
Lian Zhang
Ji Jiansong
Dongmei Yu
Peiwu Qin
135
0
0
28 Nov 2025
ThetaEvolve: Test-time Learning on Open Problems
Y. Wang
Shao-Rong Su
Zhiyuan Zeng
Eva Xu
Liliang Ren
...
Pengcheng He
Weizhu Chen
Shuohang Wang
S. Du
Yelong Shen
368
13
0
28 Nov 2025
Multi-chain Graph Refinement and Selection for Reliable Reasoning in Large Language Models
Yujiao Yang
Jing Lian
Linhui Li
LRM
255
0
0
28 Nov 2025
Evaluating LLMs for One-Shot Patching of Real and Artificial Vulnerabilities
Aayush Garg
Zanis Ali Khan
Renzo Degiovanni
Qiang Tang
AAML
177
0
0
28 Nov 2025
MCP vs RAG vs NLWeb vs HTML: A Comparison of the Effectiveness and Efficiency of Different Agent Interfaces to the Web (Technical Report)
Aaron Steiner
Ralph Peeters
Christian Bizer
LLMAG
198
1
0
28 Nov 2025
Adapting Like Humans: A Metacognitive Agent with Test-time Reasoning
Yang Li
Z. He
Y. Huang
Zhuhanling Xiao
Chao Yu
Meng Fang
Kun Shao
Jun Wang
LRM
VLM
215
1
0
28 Nov 2025
SkeletonAgent: An Agentic Interaction Framework for Skeleton-based Action Recognition
Hongda Liu
Yunfan Liu
Changlu Wang
Yunlong Wang
Zhenan Sun
LLMAG
305
1
0
27 Nov 2025
TinyLLM: Evaluation and Optimization of Small Language Models for Agentic Tasks on Edge Devices
Mohd Ariful Haque
Fahad Rahman
Kishor Datta Gupta
Khalil Shujaee
Roy George
LLMAG
204
1
0
27 Nov 2025
Real-Time Procedural Learning From Experience for AI Agents
Dasheng Bi
Yubin Hu
Mohammed N. Nasir
107
0
0
27 Nov 2025
Agentic Learner with Grow-and-Refine Multimodal Semantic Memory
Weihao Bo
Shan Zhang
Yanpeng Sun
Jingjing Wu
Qunyi Xie
...
Wei He
Xiaofan Li
Na Zhao
Jingdong Wang
Z. Li
LRM
269
3
0
26 Nov 2025
MADRA: Multi-Agent Debate for Risk-Aware Embodied Planning
Junjian Wang
Lidan Zhao
Xi Sheryl Zhang
252
0
0
26 Nov 2025
BRIDGE: Building Representations In Domain Guided Program Synthesis
Robert Joseph George
Carson Eisenach
Udaya Ghai
Dominique C. Perrault-Joncas
A. Anandkumar
Dean Phillips Foster
ALM
LRM
489
0
0
26 Nov 2025
Improving Language Agents through BREW
Shashank Kirtania
Param Biyani
Priyanshu Gupta
Yasharth Bajpai
Roshni Iyer
Sumit Gulwani
Gustavo Soares
LLMAG
OffRL
297
1
0
25 Nov 2025
Stabilizing Off-Policy Training for Long-Horizon LLM Agent via Turn-Level Importance Sampling and Clipping-Triggered Normalization
Chenliang Li
Adel Elmahdy
Alex Boyd
Zhongruo Wang
Alfredo García
Parminder Bhatia
Taha A. Kass-Hout
Cao Xiao
Mingyi Hong
Mingyi Hong
OffRL
262
1
0
25 Nov 2025
Evo-Memory: Benchmarking LLM Agent Test-time Learning with Self-Evolving Memory
Tianxin Wei
Noveen Sachdeva
Benjamin Coleman
Zhankui He
Yuanchen Bei
...
C. Wang
Shuo Chen
Fernando Pereira
Wang-Cheng Kang
D. Cheng
LLMAG
251
35
0
25 Nov 2025
CLIMATEAGENT: Multi-Agent Orchestration for Complex Climate Data Science Workflows
Hyeonjae Kim
Chenyue Li
Wen Deng
Mengxi Jin
Wen Huang
Mengqian Lu
Binhang Yuan
AI4CE
353
2
0
25 Nov 2025
ReEXplore: Improving MLLMs for Embodied Exploration with Contextualized Retrospective Experience Replay
Gengyuan Zhang
Mingcong Ding
Jingpei Wu
Ruotong Liao
Volker Tresp
LRM
239
1
0
24 Nov 2025
FHE-Agent: Automating CKKS Configuration for Practical Encrypted Inference via an LLM-Guided Agentic Framework
Nuo Xu
Zhaoting Gong
Ran Ran
Jinwei Tang
Wujie Wen
Caiwen Ding
171
0
0
23 Nov 2025
Cross-Disciplinary Knowledge Retrieval and Synthesis: A Compound AI Architecture for Scientific Discovery
Svitlana Volkova
Peter Bautista
Avinash Hiriyanna
Gabriel Ganberg
Isabel Erickson
Zachary Klinefelter
Nick Abele
Hsien-Te Kao
Grant Engberson
159
1
0
23 Nov 2025
Reasoning With a Star: A Heliophysics Dataset and Benchmark for Agentic Scientific Reasoning
Kevin Lee
Russell Spiewak
James Walsh
LRM
131
0
0
23 Nov 2025
LLMs as Firmware Experts: A Runtime-Grown Tree-of-Agents Framework
XiangRui Zhang
Zeyu Chen
Haining Wang
Qiang Li
142
0
0
23 Nov 2025
Learning to Debug: LLM-Organized Knowledge Trees for Solving RTL Assertion Failures
Yunsheng Bai
Haoxing Ren
146
0
0
21 Nov 2025
A Benchmark for Procedural Memory Retrieval in Language Agents
Ishant Kohar
Aswanth Krishnan
81
1
0
21 Nov 2025
Cognitive Inception: Agentic Reasoning against Visual Deceptions by Injecting Skepticism
Yinjie Zhao
Heng Zhao
Bihan Wen
Joey Tianyi Zhou
LRM
148
0
0
21 Nov 2025
Hiding in the AI Traffic: Abusing MCP for LLM-Powered Agentic Red Teaming
Strahinja Janjusevic
Anna Baron Garcia
Sohrob Kazerounian
257
1
0
20 Nov 2025
PSM: Prompt Sensitivity Minimization via LLM-Guided Black-Box Optimization
Huseein Jawad
Nicolas Brunel
AAML
207
0
0
20 Nov 2025
NAMeGEn: Creative Name Generation via A Novel Agent-based Multiple Personalized Goal Enhancement Framework
Shanlin Zhou
Xinpeng Wang
Jianxun Lian
Zhenghao Liu
L. Lakshmanan
Xiaoyuan Yi
Yongtao Hao
LLMAG
424
0
0
19 Nov 2025
AVATAAR: Agentic Video Answering via Temporal Adaptive Alignment and Reasoning
Urjitkumar Patel
Fang-Chun Yeh
Chinmay Gondhalekar
262
0
0
19 Nov 2025
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
Mingyue Cheng
Jie Ouyang
Shuo Yu
Ruiran Yan
Yucong Luo
Zirui Liu
Daoyu Wang
Qi Liu
Enhong Chen
197
23
0
18 Nov 2025
1
2
3
4
...
24
25
26
Next
Page 1 of 26
Page
of 26
Go