Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.18134
Cited By
v1
v2 (latest)
VideoGameBench: Can Vision-Language Models complete popular video games?
23 May 2025
Alex Zhang
Thomas Griffiths
Karthik Narasimhan
Ofir Press
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"VideoGameBench: Can Vision-Language Models complete popular video games?"
24 / 24 papers shown
Title
GameTraversalBenchmark: Evaluating Planning Abilities Of Large Language Models Through Traversing 2D Game Maps
Muhammad Umair Nasir
Steven D. James
Julian Togelius
ELM
LRM
81
5
0
10 Oct 2024
Diffusion Models Are Real-Time Game Engines
Dani Valevski
Yaniv Leviathan
Moab Arar
Shlomi Fruchter
DiffM
VGen
AI4CE
136
91
0
27 Aug 2024
Language-Guided World Models: A Model-Based Approach to AI Control
Alex Zhang
Khanh Nguyen
Jens Tuyls
Albert Lin
Karthik Narasimhan
LLMAG
88
7
0
24 Jan 2024
HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models
Tianrui Guan
Fuxiao Liu
Xiyang Wu
Ruiqi Xian
Zongxia Li
...
Lichang Chen
Furong Huang
Yaser Yacoob
Dinesh Manocha
Dinesh Manocha
VLM
MLLM
161
196
0
23 Oct 2023
Eureka: Human-Level Reward Design via Coding Large Language Models
Yecheng Jason Ma
William Liang
Guanzhi Wang
De-An Huang
Osbert Bastani
Dinesh Jayaraman
Yuke Zhu
Linxi Fan
A. Anandkumar
83
324
0
19 Oct 2023
Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory
Xizhou Zhu
Yuntao Chen
Hao Tian
Chenxin Tao
Weijie Su
...
Lewei Lu
Xiaogang Wang
Yu Qiao
Zhaoxiang Zhang
Jifeng Dai
LLMAG
LM&Ro
110
240
0
25 May 2023
Reflexion: Language Agents with Verbal Reinforcement Learning
Noah Shinn
Federico Cassano
Beck Labash
A. Gopinath
Karthik Narasimhan
Shunyu Yao
LLMAG
KELM
141
1,328
0
20 Mar 2023
Mastering Diverse Domains through World Models
Danijar Hafner
J. Pašukonis
Jimmy Ba
Timothy Lillicrap
92
616
0
10 Jan 2023
ReAct: Synergizing Reasoning and Acting in Language Models
Shunyu Yao
Jeffrey Zhao
Dian Yu
Nan Du
Izhak Shafran
Karthik Narasimhan
Yuan Cao
LLMAG
ReLM
LRM
470
2,996
0
06 Oct 2022
WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents
Shunyu Yao
Howard Chen
John Yang
Karthik Narasimhan
LLMAG
LM&Ro
174
522
0
04 Jul 2022
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge
Linxi Fan
Guanzhi Wang
Yunfan Jiang
Ajay Mandlekar
Yuncong Yang
Haoyi Zhu
Andrew Tang
De-An Huang
Yuke Zhu
Anima Anandkumar
LM&Ro
144
388
0
17 Jun 2022
Competition-Level Code Generation with AlphaCode
Yujia Li
David Choi
Junyoung Chung
Nate Kushman
Julian Schrittwieser
...
Esme Sutherland Robson
Pushmeet Kohli
Nando de
Koray Kavukcuoglu
Oriol Vinyals
174
1,437
0
08 Feb 2022
Multi-Stage Episodic Control for Strategic Exploration in Text Games
Jens Tuyls
Shunyu Yao
Sham Kakade
Karthik Narasimhan
81
26
0
04 Jan 2022
The ThreeDWorld Transport Challenge: A Visually Guided Task-and-Motion Planning Benchmark for Physically Realistic Embodied AI
Chuang Gan
Siyuan Zhou
Jeremy Schwartz
S. Alter
Abhishek Bhandwaldar
...
Daniel L. K. Yamins
J. DiCarlo
Josh H. McDermott
Antonio Torralba
J. Tenenbaum
LM&Ro
118
81
0
25 Mar 2021
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
Mohit Shridhar
Xingdi Yuan
Marc-Alexandre Côté
Yonatan Bisk
Adam Trischler
Matthew J. Hausknecht
LM&Ro
LLMAG
115
450
0
08 Oct 2020
Keep CALM and Explore: Language Models for Action Generation in Text-based Games
Shunyu Yao
Rohan Rao
Matthew J. Hausknecht
Karthik Narasimhan
LLMAG
LM&Ro
86
133
0
06 Oct 2020
The NetHack Learning Environment
Heinrich Küttler
Nantas Nardelli
Alexander H. Miller
Roberta Raileanu
Marco Selvatici
Edward Grefenstette
Tim Rocktaschel
113
181
0
24 Jun 2020
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
128
193
0
23 Dec 2019
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNN
VLM
CLL
AI4CE
LRM
176
1,839
0
13 Dec 2019
Recurrent World Models Facilitate Policy Evolution
David R Ha
Jürgen Schmidhuber
SyDa
TPM
126
957
0
04 Sep 2018
Investigating Human Priors for Playing Video Games
Rachit Dubey
Pulkit Agrawal
Deepak Pathak
Thomas Griffiths
Alexei A. Efros
OffRL
123
146
0
28 Feb 2018
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
David Silver
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
Matthew Lai
...
D. Kumaran
T. Graepel
Timothy Lillicrap
Karen Simonyan
Demis Hassabis
183
1,784
0
05 Dec 2017
ViZDoom: A Doom-based AI Research Platform for Visual Reinforcement Learning
Michal Kempka
Marek Wydmuch
Grzegorz Runc
Jakub Toczek
Wojciech Ja'skowski
117
701
0
06 May 2016
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
137
12,288
0
19 Dec 2013
1