Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2303.11366
Cited By
v1
v2
v3
v4 (latest)
Reflexion: Language Agents with Verbal Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2023
20 March 2023
Noah Shinn
Federico Cassano
Beck Labash
A. Gopinath
Karthik Narasimhan
Shunyu Yao
LLMAG
KELM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (5 upvotes)
Papers citing
"Reflexion: Language Agents with Verbal Reinforcement Learning"
50 / 1,270 papers shown
LLM Agents can Autonomously Hack Websites
Richard Fang
R. Bindu
Akul Gupta
Qiusi Zhan
Daniel Kang
LLMAG
282
89
0
06 Feb 2024
Learning to Generate Explainable Stock Predictions using Self-Reflective Large Language Models
The Web Conference (WWW), 2024
Kelvin J.L. Koa
Yunshan Ma
Ritchie Ng
Tat-Seng Chua
AIFin
LLMAG
351
50
0
06 Feb 2024
Professional Agents -- Evolving Large Language Models into Autonomous Experts with Human-Level Competencies
Zhixuan Chu
Yan Wang
Feng Zhu
Lu Yu
Longfei Li
Jinjie Gu
LLMAG
234
12
0
06 Feb 2024
RAP: Retrieval-Augmented Planning with Contextual Memory for Multimodal LLM Agents
Tomoyuki Kagaya
Thong Jing Yuan
Yuxuan Lou
J. Karlekar
Sugiri Pranata
Akira Kinose
Koki Oguri
Felix Wick
Yang You
LLMAG
212
64
0
06 Feb 2024
Toward Human-AI Alignment in Large-Scale Multi-Player Games
Sugandha Sharma
Guy Davidson
Khimya Khetarpal
Anssi Kanervisto
Udit Arora
Katja Hofmann
Ida Momennejad
186
3
0
05 Feb 2024
Unified Hallucination Detection for Multimodal Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Xiang Chen
Chenxi Wang
Yida Xue
Ningyu Zhang
Xiaoyan Yang
Qian Li
Yue Shen
Lei Liang
Jinjie Gu
Huajun Chen
HILM
450
67
0
05 Feb 2024
Graph-enhanced Large Language Models in Asynchronous Plan Reasoning
Fangru Lin
Emanuele La Malfa
Valentin Hofmann
Elle Michelle Yang
Anthony Cohn
J. Pierrehumbert
LRM
377
29
0
05 Feb 2024
Understanding the planning of LLM agents: A survey
Xu Huang
Weiwen Liu
Xiaolong Chen
Xingmei Wang
Hao Wang
Defu Lian
Yasheng Wang
Ruiming Tang
Enhong Chen
LLMAG
LM&Ro
314
353
0
05 Feb 2024
Enhance Reasoning for Large Language Models in the Game Werewolf
Shuang Wu
Liwen Zhu
Tao Yang
Shiwei Xu
Qiang Fu
Yang Wei
Haobo Fu
LRM
LLMAG
307
32
0
04 Feb 2024
More Agents Is All You Need
Junyou Li
Qin Zhang
Yangbin Yu
Qiang Fu
Deheng Ye
LLMAG
371
117
0
03 Feb 2024
Calibration and Correctness of Language Models for Code
Claudio Spiess
David Gros
Kunal Suresh Pai
Michael Pradel
Md Rafiqul Islam Rabin
Amin Alipour
Susmit Jha
Prem Devanbu
Toufique Ahmed
339
61
0
03 Feb 2024
TravelPlanner: A Benchmark for Real-World Planning with Language Agents
Jian Xie
Kai Zhang
Jiangjie Chen
Tinghui Zhu
Renze Lou
Yuandong Tian
Yanghua Xiao
Yu-Chuan Su
LLMAG
LM&Ro
326
300
0
02 Feb 2024
Foundation Model Sherpas: Guiding Foundation Models through Knowledge and Reasoning
D. Bhattacharjya
Junkyu Lee
Don Joven Agravante
Balaji Ganesan
Radu Marinescu
LLMAG
215
3
0
02 Feb 2024
AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback
Jian Guan
Wei Wu
Zujie Wen
Peng Xu
Hongning Wang
Shiyu Huang
LRM
200
31
0
02 Feb 2024
LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks
Subbarao Kambhampati
Kaya Stechly
L. Guan
Mudit Verma
Kaya Stechly
Siddhant Bhambri
Lucas Saldyt
Anil Murthy
LRM
575
173
0
02 Feb 2024
Reasoning Capacity in Multi-Agent Systems: Limitations, Challenges and Human-Centered Solutions
Pouya Pezeshkpour
Eser Kandogan
Nikita Bhutani
Sajjadur Rahman
Tom Mitchell
Estevam R. Hruschka
LLMAG
LRM
195
11
0
02 Feb 2024
Learning Planning-based Reasoning by Trajectories Collection and Process Reward Synthesizing
Fangkai Jiao
Chengwei Qin
Zhengyuan Liu
Nancy F. Chen
Shafiq Joty
LRM
254
51
0
01 Feb 2024
WSC+: Enhancing The Winograd Schema Challenge Using Tree-of-Experts
Pardis Sadat Zahraei
Ali Emami
164
7
0
31 Jan 2024
Investigate-Consolidate-Exploit: A General Strategy for Inter-Task Agent Self-Evolution
Cheng Qian
Shihao Liang
Yujia Qin
Yining Ye
Xin Cong
Yankai Lin
Yesai Wu
Zhiyuan Liu
Maosong Sun
LLMAG
221
23
0
25 Jan 2024
Demystifying Chains, Trees, and Graphs of Thoughts
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Maciej Besta
Florim Memedi
Zhenyu Zhang
Robert Gerstenberger
Guangyuan Piao
...
Aleš Kubíček
H. Niewiadomski
Aidan O'Mahony
Onur Mutlu
Torsten Hoefler
AI4CE
LRM
1.0K
54
0
25 Jan 2024
Multi-granularity Knowledge Transfer for Continual Reinforcement Learning
International Joint Conference on Artificial Intelligence (IJCAI), 2024
Chaofan Pan
Lingfei Ren
Hao Wang
Linbo Xiong
Wei Wei
Yonghao Li
Xin Yang
564
2
0
25 Jan 2024
TPD: Enhancing Student Language Model Reasoning via Principle Discovery and Guidance
Haorui Wang
Rongzhi Zhang
Yinghao Li
Lingkai Kong
Yuchen Zhuang
Xiusi Chen
Chao Zhang
LRM
259
7
0
24 Jan 2024
AgentBoard: An Analytical Evaluation Board of Multi-turn LLM Agents
Neural Information Processing Systems (NeurIPS), 2024
Chang Ma
Junlei Zhang
Zhihao Zhu
Cheng Yang
Yujiu Yang
Yaohui Jin
Zhenzhong Lan
Lingpeng Kong
Junxian He
ELM
LLMAG
236
132
0
24 Jan 2024
AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents
Michael Ahn
Debidatta Dwibedi
Chelsea Finn
Montse Gonzalez Arenas
K. Gopalakrishnan
...
Fei Xia
Ted Xiao
Peng Xu
Steve Xu
Zhuo Xu
LM&Ro
244
93
0
23 Jan 2024
Large Language Model based Multi-Agents: A Survey of Progress and Challenges
International Joint Conference on Artificial Intelligence (IJCAI), 2024
Taicheng Guo
Preslav Nakov
Yaqi Wang
Ruidi Chang
Shichao Pei
Nitesh Chawla
Olaf Wiest
Xiangliang Zhang
LLMAG
LM&Ro
AI4CE
LRM
513
629
0
21 Jan 2024
R-Judge: Benchmarking Safety Risk Awareness for LLM Agents
Tongxin Yuan
Zhiwei He
Lingzhong Dong
Yiming Wang
Ruijie Zhao
...
Binglin Zhou
Fangqi Li
Zhuosheng Zhang
Rui Wang
Gongshen Liu
ELM
411
142
0
18 Jan 2024
A Study on Training and Developing Large Language Models for Behavior Tree Generation
Fu Li
Xueying Wang
Bin Li
Yunlong Wu
Yanzhen Wang
Xiaodong Yi
259
10
0
16 Jan 2024
DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models (Exemplified as A Video Agent)
International Conference on Machine Learning (ICML), 2024
Zongxin Yang
Guikun Chen
Xiaodi Li
Wenguan Wang
Yi Yang
LM&Ro
LLMAG
513
64
0
16 Jan 2024
Small Language Model Can Self-correct
AAAI Conference on Artificial Intelligence (AAAI), 2024
Haixia Han
Jiaqing Liang
Jie Shi
Qi He
Yanghua Xiao
LRM
SyDa
ReLM
KELM
269
26
0
14 Jan 2024
EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health Records
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Wenqi Shi
Ran Xu
Yuchen Zhuang
Yue Yu
Jieyu Zhang
Hang Wu
Yuanda Zhu
Joyce C. Ho
Carl Yang
Hang Wu
203
75
0
13 Jan 2024
Mutual Enhancement of Large Language and Reinforcement Learning Models through Bi-Directional Feedback Mechanisms: A Planning Case Study
Shangding Gu
LLMAG
357
1
0
12 Jan 2024
AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Shuofei Qiao
Ningyu Zhang
Runnan Fang
Yujie Luo
Wangchunshu Zhou
Yuchen Eleanor Jiang
Chengfei Lv
Huajun Chen
LLMAG
351
68
0
10 Jan 2024
Agent Alignment in Evolving Social Norms
Shimin Li
Tianxiang Sun
Qinyuan Cheng
Xipeng Qiu
LLMAG
301
12
0
09 Jan 2024
A Philosophical Introduction to Language Models -- Part I: Continuity With Classic Debates
Raphael Milliere
Cameron Buckner
LRM
ELM
191
37
0
08 Jan 2024
CogGPT: Unleashing the Power of Cognitive Dynamics on Large Language Models
Yaojia Lv
Haojie Pan
Ruiji Fu
Ming Liu
Zhongyuan Wang
Bing Qin
214
4
0
06 Jan 2024
AFSPP: Agent Framework for Shaping Preference and Personality with Large Language Models
Zihong He
Changwang Zhang
LLMAG
185
6
0
05 Jan 2024
Using LLM to select the right SQL Query from candidates
Zhenwen Li
Tao Xie
LLMAG
164
14
0
04 Jan 2024
Towards Truly Zero-shot Compositional Visual Reasoning with LLMs as Programmers
Aleksandar Stanić
Sergi Caelles
Michael Tschannen
LRM
VLM
322
13
0
03 Jan 2024
State Machine of Thoughts: Leveraging Past Reasoning Trajectories for Enhancing Problem Solving
Jia Liu
Jie Shuai
Xiyao Li
LRM
189
3
0
29 Dec 2023
Experiential Co-Learning of Software-Developing Agents
Cheng Qian
Yufan Dang
Jiahao Li
Wei Liu
Zihao Xie
...
Cheng Yang
Xin Cong
Xiaoyin Che
Zhiyuan Liu
Maosong Sun
LLMAG
369
64
0
28 Dec 2023
AutoTask: Executing Arbitrary Voice Commands by Exploring and Learning from Mobile GUI
Lihang Pan
Bowen Wang
Chun Yu
Yuxuan Chen
Xiangyu Zhang
Yuanchun Shi
166
5
0
26 Dec 2023
LARP: Language-Agent Role Play for Open-World Games
Ming Yan
Ruihao Li
Hao Zhang
Hao Wang
Zhilan Yang
Ji Yan
LLMAG
LM&Ro
AI4CE
247
23
0
24 Dec 2023
NPHardEval: Dynamic Benchmark on Reasoning Ability of Large Language Models via Complexity Classes
Lizhou Fan
Qingfeng Lan
Jinkui Chi
Haoyang Ling
Yongfeng Zhang
LRM
357
90
0
22 Dec 2023
Towards Message Brokers for Generative AI: Survey, Challenges, and Opportunities
Alaa Saleh
Roberto Morabito
Sasu Tarkoma
Susanna Pirttikangas
Lauri Lovén
340
10
0
22 Dec 2023
ASSISTGUI: Task-Oriented Desktop Graphical User Interface Automation
Difei Gao
Lei Ji
Zechen Bai
Mingyu Ouyang
Peiran Li
...
Peiyi Wang
Xiangwu Guo
Hengxu Wang
Luowei Zhou
Mike Zheng Shou
LLMAG
323
37
0
20 Dec 2023
AgentCoder: Multi-Agent-based Code Generation with Iterative Testing and Optimisation
Dong Huang
Jie M.Zhang
Michael Luck
Qi Bu
Yuhao Qing
Heming Cui
LLMAG
237
0
0
20 Dec 2023
Large Language Models Empowered Agent-based Modeling and Simulation: A Survey and Perspectives
Chen Gao
Xiaochong Lan
Nian Li
Yuan Yuan
Jingtao Ding
Zhilun Zhou
Fengli Xu
Yong Li
LLMAG
AI4CE
LM&Ro
287
291
0
19 Dec 2023
Evaluating Language-Model Agents on Realistic Autonomous Tasks
Megan Kinniment
Lucas Jun Koba Sato
Haoxing Du
Brian Goodrich
Max Hasin
...
H. Wijk
Joel Burget
Aaron Ho
Elizabeth Barnes
Paul Christiano
ELM
LLMAG
323
96
0
18 Dec 2023
Evaluating and Enhancing Large Language Models for Conversational Reasoning on Knowledge Graphs
Yuxuan Huang
Lida Shi
Anqi Liu
Hao Xu
LLMAG
ELM
KELM
LRM
167
4
0
18 Dec 2023
CLOVA: A Closed-Loop Visual Assistant with Tool Usage and Update
Zhi Gao
Yuntao Du
Xintong Zhang
Xiaojian Ma
Wenjuan Han
Song-Chun Zhu
Qing Li
LLMAG
VLM
395
45
0
18 Dec 2023
Previous
1
2
3
...
22
23
24
25
26
Next
Page 23 of 26
Page
of 26
Go