Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.11807
Cited By
How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments
18 March 2024
Jen-tse Huang
E. Li
Man Ho Lam
Tian Liang
Wenxuan Wang
Youliang Yuan
Wenxiang Jiao
Xing Wang
Zhaopeng Tu
Michael R. Lyu
ELM
LLMAG
Re-assign community
ArXiv
PDF
HTML
Papers citing
"How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments"
23 / 23 papers shown
Title
PLANET: A Collection of Benchmarks for Evaluating LLMs' Planning Capabilities
Haoming Li
Zhaoliang Chen
Jonathan Zhang
Fei Liu
LLMAG
27
0
0
21 Apr 2025
Self-Resource Allocation in Multi-Agent LLM Systems
Alfonso Amayuelas
Jingbo Yang
Saaket Agashe
Ashwin Nagarajan
Antonis Antoniades
X. Wang
William Wang
79
0
0
02 Apr 2025
Are Large Vision Language Models Good Game Players?
Xinyu Wang
Bohan Zhuang
Qi Wu
MLLM
ELM
LRM
79
3
0
04 Mar 2025
MobileSteward: Integrating Multiple App-Oriented Agents with Self-Evolution to Automate Cross-App Instructions
Yuxuan Liu
Hongda Sun
Wei Liu
Jian Luan
Bo Du
Rui Yan
38
1
0
24 Feb 2025
PathFinder: A Multi-Modal Multi-Agent System for Medical Diagnostic Decision-Making Applied to Histopathology
Fatemeh Ghezloo
M. S. Seyfioglu
Rustin Soraki
Wisdom O. Ikezogwo
Beibin Li
Tejoram Vivekanandan
J. Elmore
Ranjay Krishna
Linda G. Shapiro
73
2
0
13 Feb 2025
Game Theory Meets Large Language Models: A Systematic Survey
Haoran Sun
Yusen Wu
Yukun Cheng
Xu Chu
LM&MA
OffRL
AI4CE
44
1
0
13 Feb 2025
Game-theoretic LLM: Agent Workflow for Negotiation Games
Wenyue Hua
Ollie Liu
Lingyao Li
Alfonso Amayuelas
Julie Chen
...
Lizhou Fan
Fei Sun
William Yang Wang
X. Wang
Yongfeng Zhang
38
2
0
08 Nov 2024
TMGBench: A Systematic Game Benchmark for Evaluating Strategic Reasoning Abilities of LLMs
H. Wang
Xiachong Feng
Lei Li
Z. Qin
Dianbo Sui
Lingpeng Kong
LRM
ELM
22
1
0
14 Oct 2024
Intelligent Computing Social Modeling and Methodological Innovations in Political Science in the Era of Large Language Models
Zhenyu Wang
Yi Xu
Dequan Wang
Lingfeng Zhou
Yiqi Zhou
22
0
0
07 Oct 2024
Instigating Cooperation among LLM Agents Using Adaptive Information Modulation
Qiliang Chen
Sepehr Ilami
Nunzio Lore
Babak Heydari
21
1
0
16 Sep 2024
Large Model Strategic Thinking, Small Model Efficiency: Transferring Theory of Mind in Large Language Models
Nunzio Lorè
Alireza Ilami
Babak Heydari
LRM
23
0
0
05 Aug 2024
Are Large Language Models Strategic Decision Makers? A Study of Performance and Bias in Two-Player Non-Zero-Sum Games
Nathan Herr
Fernando Acero
Roberta Raileanu
María Pérez-Ortiz
Zhibin Li
LRM
48
2
0
05 Jul 2024
MALLM-GAN: Multi-Agent Large Language Model as Generative Adversarial Network for Synthesizing Tabular Data
Yaobin Ling
Xiaoqian Jiang
Yejin Kim
SyDa
28
3
0
15 Jun 2024
SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals
Ruihan Yang
Jiangjie Chen
Yikai Zhang
Siyu Yuan
Aili Chen
Kyle Richardson
Yanghua Xiao
Deqing Yang
AI4CE
LM&Ro
41
8
0
07 Jun 2024
STRIDE: A Tool-Assisted LLM Agent Framework for Strategic and Interactive Decision-Making
Chuanhao Li
Runhan Yang
Tiankai Li
Milad Bafarassat
Kourosh Sharifi
Dirk Bergemann
Zhuoran Yang
LLMAG
19
5
0
25 May 2024
LLM-based Multi-Agent Reinforcement Learning: Current and Future Directions
Chuanneng Sun
Songjun Huang
D. Pompili
LLMAG
21
21
0
17 May 2024
How Well Can LLMs Echo Us? Evaluating AI Chatbots' Role-Play Ability with ECHO
Man Tik Ng
Hui Tung Tse
Jen-tse Huang
Jingjing Li
Wenxuan Wang
Michael R. Lyu
LLMAG
24
9
0
22 Apr 2024
LLM as a Mastermind: A Survey of Strategic Reasoning with Large Language Models
Yadong Zhang
Shaoguang Mao
Tao Ge
Xun Wang
Adrian de Wynter
Yan Xia
Wenshan Wu
Ting Song
Man Lan
Furu Wei
LRM
67
48
0
01 Apr 2024
Can Large Language Model Agents Simulate Human Trust Behaviors?
Chengxing Xie
Canyu Chen
Feiran Jia
Ziyu Ye
Kai Shu
Adel Bibi
Ziniu Hu
Philip H. S. Torr
Bernard Ghanem
G. Li
LM&Ro
LLMAG
52
51
0
07 Feb 2024
Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction Arena
Jiangjie Chen
Siyu Yuan
Rong Ye
Bodhisattwa Prasad Majumder
Kyle Richardson
LLMAG
ELM
17
46
0
09 Oct 2023
Sparks of Artificial General Intelligence: Early experiments with GPT-4
Sébastien Bubeck
Varun Chandrasekaran
Ronen Eldan
J. Gehrke
Eric Horvitz
...
Scott M. Lundberg
Harsha Nori
Hamid Palangi
Marco Tulio Ribeiro
Yi Zhang
ELM
AI4MH
AI4CE
ALM
197
2,232
0
22 Mar 2023
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
291
2,712
0
24 May 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
313
8,261
0
28 Jan 2022
1