Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.16291
Cited By
Voyager: An Open-Ended Embodied Agent with Large Language Models
25 May 2023
Guanzhi Wang
Yuqi Xie
Yunfan Jiang
Ajay Mandlekar
Chaowei Xiao
Yuke Zhu
Linxi Fan
Anima Anandkumar
LM&Ro
SyDa
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Voyager: An Open-Ended Embodied Agent with Large Language Models"
50 / 134 papers shown
Title
Adaptive Stress Testing Black-Box LLM Planners
Neeloy Chakraborty
John Pohovey
Melkior Ornik
Katherine Driggs-Campbell
23
0
0
08 May 2025
Implicitly Aligning Humans and Autonomous Agents through Shared Task Abstractions
Stéphane Aroca-Ouellette
Miguel Aroca-Ouellette
K. Wense
A. Roncone
32
0
0
07 May 2025
Optimization Problem Solving Can Transition to Evolutionary Agentic Workflows
Wenhao Li
Bo Jin
Mingyi Hong
Changhong Lu
Xiangfeng Wang
44
0
0
07 May 2025
The Cognitive Foundations of Economic Exchange: A Modular Framework Grounded in Behavioral Evidence
Egil Diau
25
0
0
05 May 2025
Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning
Lang Feng
Weihao Tan
Zhiyi Lyu
Longtao Zheng
Haiyang Xu
M. Yan
Fei Huang
Bo An
22
0
0
01 May 2025
Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks
Vishnu Sarukkai
Zhiqiang Xie
Kayvon Fatahalian
LLMAG
68
0
0
01 May 2025
Generative AI in Embodied Systems: System-Level Analysis of Performance, Efficiency and Scalability
Zishen Wan
Jiayi Qian
Yuhang Du
Jason J. Jabbour
Yilun Du
Yang Katie Zhao
A. Raychowdhury
Tushar Krishna
Vijay Janapa Reddi
LM&Ro
86
0
0
26 Apr 2025
AI Awareness
X. Li
Haoyuan Shi
Rongwu Xu
Wei Xu
54
0
0
25 Apr 2025
Collaborating Action by Action: A Multi-agent LLM Framework for Embodied Reasoning
Isadora White
Kolby Nottingham
Ayush Maniar
Max Robinson
Hansen Lillemark
Mehul Maheshwari
Lianhui Qin
Prithviraj Ammanabrolu
LLMAG
LM&Ro
115
0
0
24 Apr 2025
Enhancing LLM-Based Agents via Global Planning and Hierarchical Execution
Junjie Chen
H. Li
Jingli Yang
Y. Liu
Qingyao Ai
LLMAG
82
0
0
23 Apr 2025
Planet as a Brain: Towards Internet of AgentSites based on AIOS Server
Xiang Zhang
Yongfeng Zhang
39
0
0
19 Apr 2025
Exploring Expert Failures Improves LLM Agent Tuning
Li-Cheng Lan
Andrew Bai
Minhao Cheng
Ruochen Wang
Cho-Jui Hsieh
LRM
86
0
0
17 Apr 2025
AgentSpec: Customizable Runtime Enforcement for Safe and Reliable LLM Agents
Haoyu Wang
Christopher M. Poskitt
Jun Sun
37
0
0
24 Mar 2025
VisEscape: A Benchmark for Evaluating Exploration-driven Decision-making in Virtual Escape Rooms
Seungwon Lim
Sungwoong Kim
Jihwan Yu
Sungjae Lee
Jiwan Chung
Youngjae Yu
64
1
0
18 Mar 2025
Eval-PPO: Building an Efficient Threat Evaluator Using Proximal Policy Optimization
Wuzhou Sun
Siyi Li
Qingxiang Zou
Zixing Liao
AAML
54
0
0
15 Mar 2025
How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in An Extensible Escape Game
Z. Wang
Yurui Dong
Fuwen Luo
Minyuan Ruan
Zhili Cheng
C. L. P. Chen
Peng Li
Yang Liu
LRM
79
0
0
13 Mar 2025
Fine-Tuning Diffusion Generative Models via Rich Preference Optimization
Hanyang Zhao
Haoxian Chen
Yucheng Guo
Genta Indra Winata
Tingting Ou
Ziyu Huang
D. Yao
Wenpin Tang
54
0
0
13 Mar 2025
EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments
Dongping Li
Tielong Cai
Tianci Tang
Wenhao Chai
Katherine Rose Driggs-Campbell
Gaoang Wang
LM&Ro
56
0
0
11 Mar 2025
Generator-Assistant Stepwise Rollback Framework for Large Language Model Agent
Xingzuo Li
Kehai Chen
Yunfei Long
X. Bai
Yong-mei Xu
Min Zhang
LRM
LLMAG
79
1
0
04 Mar 2025
SFO: Piloting VLM Feedback for Offline RL
Jacob Beck
OffRL
31
0
0
02 Mar 2025
Construction and Evaluation of LLM-based agents for Semi-Autonomous penetration testing
Masaya Kobayashi
Masane Fuchi
Amar Zanashir
Tomonori Yoneda
Tomohiro Takagi
LLMAG
42
1
0
24 Feb 2025
Synthesizing Post-Training Data for LLMs through Multi-Agent Simulation
Shuo Tang
Xianghe Pang
Zexi Liu
Bohan Tang
Rui Ye
Xiaowen Dong
Y. Wang
Yanfeng Wang
S. Chen
SyDa
LLMAG
127
3
0
21 Feb 2025
AIDE: AI-Driven Exploration in the Space of Code
Zhengyao Jiang
Dominik Schmidt
Dhruv Srikanth
Dixing Xu
Ian Kaplan
Deniss Jacenko
Yuxiang Wu
64
5
0
18 Feb 2025
Policy-to-Language: Train LLMs to Explain Decisions with Flow-Matching Generated Rewards
Xinyi Yang
Liang Zeng
Heng Dong
C. Yu
X. Wu
H. Yang
Yu Wang
Milind Tambe
Tonghan Wang
68
2
0
18 Feb 2025
AgentStudio: A Toolkit for Building General Virtual Agents
Longtao Zheng
Zhiyuan Huang
Zhenghai Xue
Xinrun Wang
Bo An
Shuicheng Yan
77
14
0
17 Feb 2025
EvoFlow: Evolving Diverse Agentic Workflows On The Fly
Guibin Zhang
Kaijie Chen
Guancheng Wan
Heng Chang
Hong Cheng
K. Wang
Shuyue Hu
Lei Bai
73
2
0
11 Feb 2025
Self-Supervised Prompt Optimization
Jinyu Xiang
Jiayi Zhang
Zhaoyang Yu
Fengwei Teng
Jinhao Tu
Xinbing Liang
Sirui Hong
Chenglin Wu
Yuyu Luo
OffRL
LRM
66
5
0
07 Feb 2025
WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
Zehan Qi
Xiao-Chang Liu
Iat Long Iong
Hanyu Lai
X. Sun
...
Shuntian Yao
Tianjie Zhang
Wei Xu
J. Tang
Yuxiao Dong
93
14
0
28 Jan 2025
From Cool Demos to Production-Ready FMware: Core Challenges and a Technology Roadmap
Gopi Krishnan Rajbahadur
G. Oliva
Dayi Lin
Ahmed E. Hassan
41
1
0
28 Jan 2025
Episodic memory in AI agents poses risks that should be studied and mitigated
Chad DeChant
57
1
0
20 Jan 2025
Exposing Limitations of Language Model Agents in Sequential-Task Compositions on the Web
Hiroki Furuta
Yutaka Matsuo
Aleksandra Faust
Izzeddin Gur
CLL
85
13
0
03 Jan 2025
Beyond Numeric Awards: In-Context Dueling Bandits with LLM Agents
Fanzeng Xia
Hao Liu
Yisong Yue
Tongxin Li
57
1
0
03 Jan 2025
Harnessing Language for Coordination: A Framework and Benchmark for LLM-Driven Multi-Agent Control
Timothée Anne
Noah Syrkis
Meriem Elhosni
Florian Turati
Franck Legendre
Alain Jaquier
Sebastian Risi
LLMAG
90
2
0
16 Dec 2024
Cocoa: Co-Planning and Co-Execution with AI Agents
K. J. Kevin Feng
Kevin Pu
Matt Latzke
Tal August
Pao Siangliulue
Jonathan Bragg
Daniel S. Weld
Amy X. Zhang
Joseph Chee Chang
LM&Ro
LLMAG
87
4
0
14 Dec 2024
Active Inference for Self-Organizing Multi-LLM Systems: A Bayesian Thermodynamic Approach to Adaptation
Rithvik Prakki
LLMAG
AI4CE
128
0
0
10 Dec 2024
Simulating Human-like Daily Activities with Desire-driven Autonomy
Yiding Wang
Yuxuan Chen
Fangwei Zhong
Long Ma
Yizhou Wang
70
2
0
09 Dec 2024
AnyBimanual: Transferring Unimanual Policy for General Bimanual Manipulation
Guanxing Lu
Tengbo Yu
Haoyuan Deng
Season Si Chen
Yansong Tang
Ziwei Wang
70
3
0
09 Dec 2024
VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning
Xueqing Wu
Yuheng Ding
Bingxuan Li
Pan Lu
Da Yin
Kai-Wei Chang
Nanyun Peng
LRM
100
3
0
03 Dec 2024
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
Davide Paglieri
Bartłomiej Cupiał
Samuel Coward
Ulyana Piterbarg
Maciej Wolczyk
...
Lerrel Pinto
Rob Fergus
Jakob Foerster
Jack Parker-Holder
Tim Rocktaschel
LLMAG
LRM
106
10
0
20 Nov 2024
CaPo: Cooperative Plan Optimization for Efficient Embodied Multi-Agent Cooperation
Jie Liu
Pan Zhou
Yingjun Du
Ah-Hwee Tan
Cees G. M. Snoek
J. Sonke
E. Gavves
LLMAG
29
1
0
07 Nov 2024
Interacting Large Language Model Agents. Interpretable Models and Social Learning
Adit Jain
Vikram Krishnamurthy
LLMAG
28
0
0
02 Nov 2024
Human-inspired Perspectives: A Survey on AI Long-term Memory
Zihong He
Weizhe Lin
Hao Zheng
Fan Zhang
Matt Jones
Laurence Aitchison
X. Xu
Miao Liu
Per Ola Kristensson
Junxiao Shen
77
2
0
01 Nov 2024
SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation
Jingxuan Chen
Derek Yuen
Bin Xie
Y. Yang
Gongwei Chen
...
Liqiang Nie
Yasheng Wang
Jianye Hao
Jun Wang
Kun Shao
LLMAG
38
5
0
19 Oct 2024
In-Context Learning Enables Robot Action Prediction in LLMs
Yida Yin
Zekai Wang
Yuvan Sharma
Dantong Niu
Trevor Darrell
Roei Herzig
LM&Ro
80
1
0
16 Oct 2024
Denial-of-Service Poisoning Attacks against Large Language Models
Kuofeng Gao
Tianyu Pang
Chao Du
Yong Yang
Shu-Tao Xia
Min-Bin Lin
SILM
AAML
54
4
0
14 Oct 2024
Diversity of Thought Elicits Stronger Reasoning Capabilities in Multi-Agent Debate Frameworks
Mahmood Hegazy
LLMAG
LRM
AI4CE
26
0
0
10 Oct 2024
AgentSquare: Automatic LLM Agent Search in Modular Design Space
Yu Shang
Yu Li
Keyu Zhao
Likai Ma
J. Liu
Fengli Xu
Yong Li
LLMAG
42
9
0
08 Oct 2024
Open-World Reinforcement Learning over Long Short-Term Imagination
Jiajian Li
Q. Wang
Yunbo Wang
Xin Jin
Yang Li
Wenjun Zeng
Xiaokang Yang
OCL
VLM
47
1
0
04 Oct 2024
DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life
Yu Ying Chiu
Liwei Jiang
Yejin Choi
51
2
0
03 Oct 2024
SPINE: Online Semantic Planning for Missions with Incomplete Natural Language Specifications in Unstructured Environments
Zachary Ravichandran
Varun Murali
Mariliza Tzes
George J. Pappas
Vijay Kumar
LRM
51
6
0
03 Oct 2024
1
2
3
Next