ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.11366
  4. Cited By
Reflexion: Language Agents with Verbal Reinforcement Learning
v1v2v3v4 (latest)

Reflexion: Language Agents with Verbal Reinforcement Learning

Neural Information Processing Systems (NeurIPS), 2023
20 March 2023
Noah Shinn
Federico Cassano
Beck Labash
A. Gopinath
Karthik Narasimhan
Shunyu Yao
    LLMAGKELM
ArXiv (abs)PDFHTMLHuggingFace (5 upvotes)

Papers citing "Reflexion: Language Agents with Verbal Reinforcement Learning"

50 / 1,268 papers shown
A Review of Repository Level Prompting for LLMs
A Review of Repository Level Prompting for LLMs
Douglas Schonholtz
51
1
0
15 Dec 2023
Modeling Complex Mathematical Reasoning via Large Language Model based MathAgent
Haoran Liao
Qinyi Du
Shaohua Hu
Hao He
Yanyan Xu
Jidong Tian
Yaohui Jin
LRMAI4CE
194
2
0
14 Dec 2023
Can It Edit? Evaluating the Ability of Large Language Models to Follow
  Code Editing Instructions
Can It Edit? Evaluating the Ability of Large Language Models to Follow Code Editing Instructions
Federico Cassano
Luisa Li
Akul Sethi
Noah Shinn
Abby Brennan-Jones
...
Edward Berman
George Chakhnashvili
Anton Lozhkov
C. Anderson
Arjun Guha
ELMKELM
674
45
0
11 Dec 2023
Language Models, Agent Models, and World Models: The LAW for Machine
  Reasoning and Planning
Language Models, Agent Models, and World Models: The LAW for Machine Reasoning and Planning
Zhiting Hu
Tianmin Shu
LLMAGLM&RoLRM
331
47
0
08 Dec 2023
KwaiAgents: Generalized Information-seeking Agent System with Large
  Language Models
KwaiAgents: Generalized Information-seeking Agent System with Large Language Models
Haojie Pan
Zepeng Zhai
Hao Yuan
Yaojia Lv
Ruiji Fu
Ming Liu
Zhongyuan Wang
Bing Qin
LLMAGRALM
255
14
0
08 Dec 2023
Fortify the Shortest Stave in Attention: Enhancing Context Awareness of
  Large Language Models for Effective Tool Use
Fortify the Shortest Stave in Attention: Enhancing Context Awareness of Large Language Models for Effective Tool Use
Yuhan Chen
Ang Lv
Ting-En Lin
Cai Chen
Yuchuan Wu
Fei Huang
Yongbin Li
Rui Yan
231
39
0
07 Dec 2023
Towards Knowledge-driven Autonomous Driving
Towards Knowledge-driven Autonomous Driving
Xin Li
Yeqi Bai
Pinlong Cai
Licheng Wen
Daocheng Fu
...
Yikang Li
Ding Wang
Yong-Jin Liu
Xiaoling Wang
Yu Qiao
409
36
0
07 Dec 2023
LLM as OS, Agents as Apps: Envisioning AIOS, Agents and the AIOS-Agent
  Ecosystem
LLM as OS, Agents as Apps: Envisioning AIOS, Agents and the AIOS-Agent Ecosystem
Yingqiang Ge
Yujie Ren
Qingfeng Lan
Shuyuan Xu
Juntao Tan
Zelong Li
LLMAG
248
38
0
06 Dec 2023
D-Bot: Database Diagnosis System using Large Language Models
D-Bot: Database Diagnosis System using Large Language ModelsProceedings of the VLDB Endowment (PVLDB), 2023
Xuanhe Zhou
Guoliang Li
Zhaoyan Sun
Zhiyuan Liu
Weize Chen
Jianming Wu
Jiesi Liu
Ruohang Feng
Guoyang Zeng
LLMAG
217
33
0
03 Dec 2023
Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld
Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorldComputer Vision and Pattern Recognition (CVPR), 2023
Yijun Yang
Tianyi Zhou
Kanxue Li
Dapeng Tao
Lusong Li
Li Shen
Xiaodong He
Jing Jiang
Yuhui Shi
LLMAGLM&Ro
184
70
0
28 Nov 2023
Agents meet OKR: An Object and Key Results Driven Agent System with
  Hierarchical Self-Collaboration and Self-Evaluation
Agents meet OKR: An Object and Key Results Driven Agent System with Hierarchical Self-Collaboration and Self-Evaluation
Yi Zheng
Chongyang Ma
Kanle Shi
Haibin Huang
174
4
0
28 Nov 2023
(Ir)rationality in AI: State of the Art, Research Challenges and Open Questions
(Ir)rationality in AI: State of the Art, Research Challenges and Open QuestionsArtificial Intelligence Review (AIR), 2023
Olivia Macmillan-Scott
Mirco Musolesi
412
3
0
28 Nov 2023
Function-constrained Program Synthesis
Function-constrained Program Synthesis
Patrick Hajali
Ignas Budvytis
188
1
0
27 Nov 2023
LLM-Assisted Code Cleaning For Training Accurate Code Generators
LLM-Assisted Code Cleaning For Training Accurate Code GeneratorsInternational Conference on Learning Representations (ICLR), 2023
Naman Jain
Tianjun Zhang
Wei-Lin Chiang
Joseph E. Gonzalez
Koushik Sen
Ion Stoica
185
43
0
25 Nov 2023
Large Language Model as a Policy Teacher for Training Reinforcement
  Learning Agents
Large Language Model as a Policy Teacher for Training Reinforcement Learning AgentsInternational Joint Conference on Artificial Intelligence (IJCAI), 2023
Zihao Zhou
Bin-Bin Hu
Chenyang Zhao
Pu Zhang
Yinan Han
LLMAG
520
29
0
22 Nov 2023
AcademicGPT: Empowering Academic Research
AcademicGPT: Empowering Academic Research
Shufa Wei
Xiaolong Xu
Xianbiao Qi
Xi Yin
Jun Xia
...
Chihao Dai
Lihua Wang
Xiaohui Liu
Lei Zhang
Yutao Xie
LM&MA
211
5
0
21 Nov 2023
Igniting Language Intelligence: The Hitchhiker's Guide From
  Chain-of-Thought Reasoning to Language Agents
Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
Zhuosheng Zhang
Yao Yao
Aston Zhang
Xiangru Tang
Xinbei Ma
...
Yiming Wang
Mark B. Gerstein
Rui Wang
Gongshen Liu
Hai Zhao
LLMAGLM&RoLRM
363
92
0
20 Nov 2023
Meta Prompting for AI Systems
Meta Prompting for AI Systems
Yifan Zhang
Yang Yuan
Andrew Chi-Chih Yao
LLMAGLRM
741
16
0
20 Nov 2023
TPTU-v2: Boosting Task Planning and Tool Usage of Large Language
  Model-based Agents in Real-world Systems
TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems
Yilun Kong
Jingqing Ruan
Yihong Chen
Bin Zhang
Tianpeng Bao
...
Xiaoru Hu
Hangyu Mao
Ziyue Li
Xingyu Zeng
Rui Zhao
LLMAG
292
50
0
19 Nov 2023
Understanding the Effectiveness of Large Language Models in Detecting
  Security Vulnerabilities
Understanding the Effectiveness of Large Language Models in Detecting Security Vulnerabilities
Avishree Khare
Saikat Dutta
Ziyang Li
Alaia Solko-Breslin
Rajeev Alur
Mayur Naik
ELM
368
85
0
16 Nov 2023
INTERVENOR: Prompting the Coding Ability of Large Language Models with
  the Interactive Chain of Repair
INTERVENOR: Prompting the Coding Ability of Large Language Models with the Interactive Chain of Repair
Hanbin Wang
Zhenghao Liu
Shuo Wang
Ganqu Cui
Ning Ding
Zhiyuan Liu
Ge Yu
KELMLRM
405
19
0
16 Nov 2023
ML-Bench: Evaluating Large Language Models and Agents for Machine
  Learning Tasks on Repository-Level Code
ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code
Xiangru Tang
Yuliang Liu
Zefan Cai
Yan Shao
Junjie Lu
...
Yujia Qin
Wangchunshu Zhou
Yilun Zhao
Arman Cohan
Mark B. Gerstein
ELMLLMAG
330
44
0
16 Nov 2023
On Evaluating the Integration of Reasoning and Action in LLM Agents with
  Database Question Answering
On Evaluating the Integration of Reasoning and Action in LLM Agents with Database Question Answering
Linyong Nan
Ellen Zhang
Weijin Zou
Yilun Zhao
Wenfei Zhou
Arman Cohan
LLMAG
294
16
0
16 Nov 2023
Rescue: Ranking LLM Responses with Partial Ordering to Improve Response
  Generation
Rescue: Ranking LLM Responses with Partial Ordering to Improve Response GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Yikun Wang
Rui Zheng
Haoming Li
Tao Gui
Tao Gui
Fei Liu
OffRL
250
5
0
15 Nov 2023
Towards A Unified View of Answer Calibration for Multi-Step Reasoning
Towards A Unified View of Answer Calibration for Multi-Step Reasoning
Shumin Deng
Ningyu Zhang
Nay Oo
Bryan Hooi
LRM
298
3
0
15 Nov 2023
MAgIC: Investigation of Large Language Model Powered Multi-Agent in
  Cognition, Adaptability, Rationality and Collaboration
MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and CollaborationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Lin Xu
Zhiyuan Hu
Daquan Zhou
Hongyu Ren
Zhen Dong
Kurt Keutzer
See Kiong Ng
Jiashi Feng
LRMLLMAGELM
228
51
0
14 Nov 2023
LLMs cannot find reasoning errors, but can correct them given the error
  location
LLMs cannot find reasoning errors, but can correct them given the error locationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Gladys Tyen
Hassan Mansoor
Victor Carbune
Peter Chen
Tony Mak
LRM
498
86
0
14 Nov 2023
Towards Reasoning in Large Language Models via Multi-Agent Peer Review
  Collaboration
Towards Reasoning in Large Language Models via Multi-Agent Peer Review Collaboration
Zhenran Xu
Senbao Shi
Baotian Hu
Jindi Yu
Dongfang Li
Min Zhang
Yuxiang Wu
LRMLLMAGALM
266
47
0
14 Nov 2023
A Closer Look at the Self-Verification Abilities of Large Language
  Models in Logical Reasoning
A Closer Look at the Self-Verification Abilities of Large Language Models in Logical ReasoningNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Ruixin Hong
Hongming Zhang
Xinyu Pang
Dong Yu
Changshui Zhang
LRM
224
43
0
14 Nov 2023
CPopQA: Ranking Cultural Concept Popularity by LLMs
CPopQA: Ranking Cultural Concept Popularity by LLMsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Ming Jiang
Mansi Joshi
202
8
0
14 Nov 2023
GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone
  GUI Navigation
GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation
An Yan
Zhengyuan Yang
Wanrong Zhu
Kevin Qinghong Lin
Linjie Li
...
Yiwu Zhong
Julian McAuley
Jianfeng Gao
Zicheng Liu
Lijuan Wang
LLMAGLM&Ro
388
143
0
13 Nov 2023
Past as a Guide: Leveraging Retrospective Learning for Python Code
  Completion
Past as a Guide: Leveraging Retrospective Learning for Python Code Completion
Seunggyoon Shin
Seunggyu Chang
Sungjoon Choi
KELM
182
1
0
13 Nov 2023
Volcano: Mitigating Multimodal Hallucination through Self-Feedback
  Guided Revision
Volcano: Mitigating Multimodal Hallucination through Self-Feedback Guided RevisionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Seongyun Lee
Sue Hyun Park
Yongrae Jo
Minjoon Seo
285
88
0
13 Nov 2023
Coffee: Boost Your Code LLMs by Fixing Bugs with Feedback
Coffee: Boost Your Code LLMs by Fixing Bugs with Feedback
Seungjun Moon
Hyungjoo Chae
Yongho Song
Taeyoon Kwon
Dongjin Kang
Kai Tzu-iunn Ong
Seung-won Hwang
Jinyoung Yeo
KELM
207
16
0
13 Nov 2023
On the Discussion of Large Language Models: Symmetry of Agents and
  Interplay with Prompts
On the Discussion of Large Language Models: Symmetry of Agents and Interplay with Prompts
Qineng Wang
Zihao Wang
Ying Su
Yangqiu Song
AI4CELLMAG
302
2
0
13 Nov 2023
Large Language Models are Zero Shot Hypothesis Proposers
Large Language Models are Zero Shot Hypothesis Proposers
Biqing Qi
Kaiyan Zhang
Haoxiang Li
Kai Tian
Sihang Zeng
Zhang-Ren Chen
Bowen Zhou
265
49
0
10 Nov 2023
AI-native Interconnect Framework for Integration of Large Language Model
  Technologies in 6G Systems
AI-native Interconnect Framework for Integration of Large Language Model Technologies in 6G Systems
Sasu Tarkoma
Roberto Morabito
Jaakko Sauvola
351
32
0
10 Nov 2023
Large Language Models can Strategically Deceive their Users when Put
  Under Pressure
Large Language Models can Strategically Deceive their Users when Put Under Pressure
Jérémy Scheurer
Mikita Balesni
Marius Hobbhahn
LLMAG
434
91
0
09 Nov 2023
Prompt Engineering a Prompt Engineer
Prompt Engineering a Prompt Engineer
Qinyuan Ye
Maxamed Axmed
Reid Pryzant
Fereshte Khani
VLMLLMAGLRM
334
84
0
09 Nov 2023
ADaPT: As-Needed Decomposition and Planning with Language Models
ADaPT: As-Needed Decomposition and Planning with Language Models
Archiki Prasad
Alexander Koller
Mareike Hartmann
Peter Clark
Ashish Sabharwal
Mohit Bansal
Tushar Khot
LM&Ro
259
140
0
08 Nov 2023
Human-Centered Planning
Human-Centered Planning
Yuliang Li
Nitin Kamra
Ruta Desai
A. Halevy
134
1
0
08 Nov 2023
Rephrase and Respond: Let Large Language Models Ask Better Questions for
  Themselves
Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves
Yihe Deng
Weitong Zhang
Zixiang Chen
Quanquan Gu
LRM
480
132
0
07 Nov 2023
Large Language Models Illuminate a Progressive Pathway to Artificial
  Healthcare Assistant: A Review
Large Language Models Illuminate a Progressive Pathway to Artificial Healthcare Assistant: A Review
Mingze Yuan
Peng Bao
Jiajia Yuan
Yunhao Shen
Zi Chen
...
Jie Zhao
Yang Chen
Li Zhang
Lin Shen
Bin Dong
ELMLM&MA
279
20
0
03 Nov 2023
Multi-Agent Consensus Seeking via Large Language Models
Multi-Agent Consensus Seeking via Large Language Models
Huaben Chen
Wenkang Ji
Lufeng Xu
Shiyu Zhao
LM&RoLLMAG
382
46
0
31 Oct 2023
LILO: Learning Interpretable Libraries by Compressing and Documenting
  Code
LILO: Learning Interpretable Libraries by Compressing and Documenting CodeInternational Conference on Learning Representations (ICLR), 2023
Gabriel Grand
L. Wong
Matthew Bowers
Theo X. Olausson
Muxin Liu
Joshua B. Tenenbaum
Jacob Andreas
315
31
0
30 Oct 2023
MM-VID: Advancing Video Understanding with GPT-4V(ision)
MM-VID: Advancing Video Understanding with GPT-4V(ision)
Kevin Qinghong Lin
Faisal Ahmed
Linjie Li
Chung-Ching Lin
E. Azarnasab
...
Lin Liang
Zicheng Liu
Yumao Lu
Ce Liu
Lijuan Wang
MLLM
232
84
0
30 Oct 2023
N-Critics: Self-Refinement of Large Language Models with Ensemble of
  Critics
N-Critics: Self-Refinement of Large Language Models with Ensemble of Critics
Sajad Mousavi
Ricardo Luna Gutierrez
Desik Rengarajan
Vineet Gundecha
Ashwin Ramesh Babu
Avisek Naug
Antonio Guillen-Perez
Soumyendu Sarkar
LRMHILMKELM
187
7
0
28 Oct 2023
Personalised Distillation: Empowering Open-Sourced LLMs with Adaptive
  Learning for Code Generation
Personalised Distillation: Empowering Open-Sourced LLMs with Adaptive Learning for Code Generation
Hailin Chen
Amrita Saha
Steven C. H. Hoi
Shafiq Joty
222
9
0
28 Oct 2023
ASPIRO: Any-shot Structured Parsing-error-Induced ReprOmpting for
  Consistent Data-to-Text Generation
ASPIRO: Any-shot Structured Parsing-error-Induced ReprOmpting for Consistent Data-to-Text GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Martin Vejvar
Yasutaka Fujimoto
168
1
0
27 Oct 2023
PromptAgent: Strategic Planning with Language Models Enables
  Expert-level Prompt Optimization
PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt OptimizationInternational Conference on Learning Representations (ICLR), 2023
Xinyuan Wang
Chenxi Li
Zhen Wang
Fan Bai
Haotian Luo
Jiayou Zhang
Nebojsa Jojic
Eric P. Xing
Zhiting Hu
458
188
0
25 Oct 2023
Previous
123...23242526
Next