Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2303.11366
Cited By
v1
v2
v3
v4 (latest)
Reflexion: Language Agents with Verbal Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2023
20 March 2023
Noah Shinn
Federico Cassano
Beck Labash
A. Gopinath
Karthik Narasimhan
Shunyu Yao
LLMAG
KELM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (5 upvotes)
Papers citing
"Reflexion: Language Agents with Verbal Reinforcement Learning"
50 / 1,254 papers shown
Title
Large Language Models Develop Novel Social Biases Through Adaptive Exploration
Addison J. Wu
Ryan Liu
Xuechunzi Bai
Thomas Griffiths
140
0
0
24 Dec 2025
In-Context Distillation with Self-Consistency Cascades: A Simple, Training-Free Way to Reduce LLM Agent Costs
Vishnu Sarukkai
Asanshay Gupta
James Hong
Michael Gharbi
Kayvon Fatahalian
20
0
0
02 Dec 2025
Process-Centric Analysis of Agentic Software Systems
Shuyang Liu
Yang Chen
Rahul Krishna
Saurabh Sinha
Jatin Ganhotra
Reyhan Jabbarvand
16
0
0
02 Dec 2025
InEx: Hallucination Mitigation via Introspection and Cross-Modal Multi-Agent Collaboration
Zhongyu Yang
Yingfang Yuan
Xuanming Jiang
Baoyi An
Wei Pang
LLMAG
HILM
LRM
60
0
0
02 Dec 2025
When Does Verification Pay Off? A Closer Look at LLMs as Solution Verifiers
Jack Lu
Ryan Teehan
Jinran Jin
Mengye Ren
LRM
92
0
0
02 Dec 2025
WISE: Weighted Iterative Society-of-Experts for Robust Multimodal Multi-Agent Debate
A. Cherian
River Doyle
Eyal Ben-Dov
Suhas Lohit
Kuan-Chuan Peng
LLMAG
MoE
72
0
0
02 Dec 2025
STRIDE: A Systematic Framework for Selecting AI Modalities - Agentic AI, AI Assistants, or LLM Calls
Shubhi Asthana
Bing Zhang
Chad DeLuca
Ruchi Mahindru
Hima Patel
12
0
0
01 Dec 2025
Orchestration Framework for Financial Agents: From Algorithmic Trading to Agentic Trading
Jifeng Li
Arnav Grover
Abraham Alpuerto
Yupeng Cao
Xiao-Yang Liu
AIFin
104
0
0
01 Dec 2025
Beyond Curve Fitting: Neuro-Symbolic Agents for Context-Aware Epidemic Forecasting
Joongwon Chae
Runming Wang
Chen Xiong
Gong Yunhan
Lian Zhang
Ji Jiansong
Dongmei Yu
Peiwu Qin
32
0
0
28 Nov 2025
Multi-chain Graph Refinement and Selection for Reliable Reasoning in Large Language Models
Yujiao Yang
Jing Lian
Linhui Li
LRM
80
0
0
28 Nov 2025
Adapting Like Humans: A Metacognitive Agent with Test-time Reasoning
Yang Li
Z. He
Y. Huang
Zhuhanling Xiao
Chao Yu
Meng Fang
Kun Shao
Jun Wang
LRM
VLM
77
0
0
28 Nov 2025
MCP vs RAG vs NLWeb vs HTML: A Comparison of the Effectiveness and Efficiency of Different Agent Interfaces to the Web (Technical Report)
Aaron Steiner
Ralph Peeters
Christian Bizer
LLMAG
93
0
0
28 Nov 2025
SkeletonAgent: An Agentic Interaction Framework for Skeleton-based Action Recognition
Hongda Liu
Yunfan Liu
Changlu Wang
Yunlong Wang
Zhenan Sun
LLMAG
92
0
0
27 Nov 2025
TinyLLM: Evaluation and Optimization of Small Language Models for Agentic Tasks on Edge Devices
Mohd Ariful Haque
Fahad Rahman
Kishor Datta Gupta
Khalil Shujaee
Roy George
LLMAG
98
0
0
27 Nov 2025
Agentic Learner with Grow-and-Refine Multimodal Semantic Memory
Weihao Bo
Shan Zhang
Yanpeng Sun
Jingjing Wu
Qunyi Xie
...
Wei He
Xiaofan Li
Na Zhao
Jingdong Wang
Z. Li
LRM
186
0
0
26 Nov 2025
BRIDGE: Building Representations In Domain Guided Program Verification
Robert Joseph George
Carson Eisenach
Udaya Ghai
Dominique C. Perrault-Joncas
A. Anandkumar
Dean Phillips Foster
ALM
LRM
361
0
0
26 Nov 2025
MADRA: Multi-Agent Debate for Risk-Aware Embodied Planning
Junjian Wang
Lidan Zhao
Xi Sheryl Zhang
145
0
0
26 Nov 2025
Improving Language Agents through BREW
Shashank Kirtania
Param Biyani
Priyanshu Gupta
Yasharth Bajpai
Roshni Iyer
Sumit Gulwani
Gustavo Soares
LLMAG
OffRL
230
0
0
25 Nov 2025
CLIMATEAGENT: Multi-Agent Orchestration for Complex Climate Data Science Workflows
Hyeonjae Kim
Chenyue Li
Wen Deng
Mengxi Jin
Wen Huang
Mengqian Lu
Binhang Yuan
AI4CE
271
0
0
25 Nov 2025
ST-PPO: Stabilized Off-Policy Proximal Policy Optimization for Multi-Turn Agents Training
Chenliang Li
Adel Elmahdy
Alex Boyd
Zhongruo Wang
Alfredo García
Parminder Bhatia
Taha A. Kass-Hout
Cao Xiao
Mingyi Hong
OffRL
143
0
0
25 Nov 2025
Evo-Memory: Benchmarking LLM Agent Test-time Learning with Self-Evolving Memory
Tianxin Wei
Noveen Sachdeva
Benjamin Coleman
Zhankui He
Yuanchen Bei
...
C. Wang
Shuo Chen
Fernando Pereira
Wang-Cheng Kang
D. Cheng
LLMAG
207
1
0
25 Nov 2025
ReEXplore: Improving MLLMs for Embodied Exploration with Contextualized Retrospective Experience Replay
Gengyuan Zhang
Mingcong Ding
Jingpei Wu
Ruotong Liao
Volker Tresp
LRM
161
0
0
24 Nov 2025
FHE-Agent: Automating CKKS Configuration for Practical Encrypted Inference via an LLM-Guided Agentic Framework
Nuo Xu
Zhaoting Gong
Ran Ran
Jinwei Tang
Wujie Wen
Caiwen Ding
92
0
0
23 Nov 2025
Cross-Disciplinary Knowledge Retrieval and Synthesis: A Compound AI Architecture for Scientific Discovery
Svitlana Volkova
Peter Bautista
Avinash Hiriyanna
Gabriel Ganberg
Isabel Erickson
Zachary Klinefelter
Nick Abele
Hsien-Te Kao
Grant Engberson
88
0
0
23 Nov 2025
LLMs as Firmware Experts: A Runtime-Grown Tree-of-Agents Framework
XiangRui Zhang
Zeyu Chen
Haining Wang
Qiang Li
96
0
0
23 Nov 2025
Learning to Debug: LLM-Organized Knowledge Trees for Solving RTL Assertion Failures
Yunsheng Bai
Haoxing Ren
80
0
0
21 Nov 2025
Cognitive Inception: Agentic Reasoning against Visual Deceptions by Injecting Skepticism
Yinjie Zhao
Heng Zhao
Bihan Wen
Joey Tianyi Zhou
LRM
68
0
0
21 Nov 2025
PSM: Prompt Sensitivity Minimization via LLM-Guided Black-Box Optimization
Huseein Jawad
Nicolas Brunel
AAML
128
0
0
20 Nov 2025
Hiding in the AI Traffic: Abusing MCP for LLM-Powered Agentic Red Teaming
Strahinja Janjusevic
Anna Baron Garcia
Sohrob Kazerounian
183
0
0
20 Nov 2025
NAMeGEn: Creative Name Generation via A Novel Agent-based Multiple Personalized Goal Enhancement Framework
Shanlin Zhou
Xinpeng Wang
Jianxun Lian
Zhenghao Liu
L. Lakshmanan
Xiaoyuan Yi
Yongtao Hao
LLMAG
322
0
0
19 Nov 2025
AVATAAR: Agentic Video Answering via Temporal Adaptive Alignment and Reasoning
Urjitkumar Patel
Fang-Chun Yeh
Chinmay Gondhalekar
200
0
0
19 Nov 2025
Beyond Accuracy: A Multi-Dimensional Framework for Evaluating Enterprise Agentic AI Systems
Sushant Mehta
ELM
136
0
0
18 Nov 2025
AutoTool: Efficient Tool Selection for Large Language Model Agents
Jingyi Jia
Qinbin Li
LLMAG
104
0
0
18 Nov 2025
Extending Test-Time Scaling: A 3D Perspective with Context, Batch, and Turn
Chao Yu
Qixin Tan
Jiaxuan Gao
Shi Yu
Hong Lu
Xinting Yang
Zelai Xu
Yu Wang
Yi Wu
Eugene Vinitsky
LRM
116
0
0
18 Nov 2025
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
Mingyue Cheng
Jie Ouyang
Shuo Yu
Ruiran Yan
Yucong Luo
Zirui Liu
Daoyu Wang
Qi Liu
Enhong Chen
116
3
0
18 Nov 2025
WebCoach: Self-Evolving Web Agents with Cross-Session Memory Guidance
Genglin Liu
Shijie Geng
Sha Li
Hejie Cui
Sarah Zhang
Xin Liu
Tianyi Liu
CLL
518
0
0
17 Nov 2025
An Operational Kardashev-Style Scale for Autonomous AI - Towards AGI and Superintelligence
Przemyslaw Chojecki
ELM
76
3
0
17 Nov 2025
Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO
Haoyang Hong
Jiajun Yin
Y. Wang
Jingnan Liu
Zhe Chen
...
M. Yang
Chunxiao Guo
Junwei Liu
Peng Wei
Jinjie Gu
95
0
0
17 Nov 2025
From Perception to Reasoning: Deep Thinking Empowers Multimodal Large Language Models
Wenxin Zhu
Andong Chen
Yuchen Song
Kehai Chen
Conghui Zhu
Ziyan Chen
Tiejun Zhao
LRM
402
0
0
17 Nov 2025
Generative Caching for Structurally Similar Prompts and Responses
Sarthak Chakraborty
Suman Nath
Xuchao Zhang
Chetan Bansal
Indranil Gupta
130
1
0
14 Nov 2025
Beyond Elicitation: Provision-based Prompt Optimization for Knowledge-Intensive Tasks
Yunzhe Xu
Zhuosheng Zhang
Zhe Liu
137
0
0
13 Nov 2025
AgentPRM: Process Reward Models for LLM Agents via Step-Wise Promise and Progress
Zhiheng Xi
Chenyang Liao
Guanyu Li
Y. Yang
Wenxiang Chen
...
Wei Wu
Tao Ji
Tao Gui
Qi Zhang
Xuanjing Huang
LRM
100
0
0
11 Nov 2025
Analyzing Political Text at Scale with Online Tensor LDA
Sara Kangaslahti
Danny Ebanks
Jean Kossaifi
Anqi Liu
R. Alvarez
A. Anandkumar
92
0
0
11 Nov 2025
Meta-cognitive Multi-scale Hierarchical Reasoning for Motor Imagery Decoding
Si-Hyun Kim
Heon Kwak
Byoung-Hee Kwon
Seong-Whan Lee
152
0
0
11 Nov 2025
Last Layer Logits to Logic: Empowering LLMs with Logic-Consistent Structured Knowledge Reasoning
Songze Li
Zhiqiang Liu
Zhaoyan Gong
Xiaoke Guo
Zhengke Gui
H. Chen
Wen Zhang
LRM
214
0
0
11 Nov 2025
Procedural Knowledge Improves Agentic LLM Workflows
Vincent Hsiao
Mark Roberts
Leslie Smith
AIFin
363
0
0
10 Nov 2025
Recursive Dynamics in Fast-Weights Homeostatic Reentry Networks: Toward Reflective Intelligence
B. G. Chae
137
2
0
10 Nov 2025
MathSE: Improving Multimodal Mathematical Reasoning via Self-Evolving Iterative Reflection and Reward-Guided Fine-Tuning
Jinhao Chen
Zhen Yang
Jianxin Shi
Tianyu Wo
J. Tang
ReLM
LRM
228
0
0
10 Nov 2025
FLEX: Continuous Agent Evolution via Forward Learning from Experience
Zhicheng Cai
Xinyuan Guo
Yu Pei
Jiangtao Feng
Jiangjie Chen
Ya Zhang
Wei-Ying Ma
Mingxuan Wang
Hao Zhou
Hao Zhou
CLL
LLMAG
LRM
250
3
0
09 Nov 2025
Evaluation of retrieval-based QA on QUEST-LOFT
Nathan Scales
Nathanael Scharli
Olivier Bousquet
RALM
336
0
0
08 Nov 2025
1
2
3
4
...
24
25
26
Next