Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.00690
Cited By
Playing NetHack with LLMs: Potential & Limitations as Zero-Shot Agents
1 March 2024
Dominik Jeurissen
Diego Perez-Liebana
Jeremy Gow
Duygu Cakmak
James Kwan
LLMAG
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Playing NetHack with LLMs: Potential & Limitations as Zero-Shot Agents"
5 / 5 papers shown
Title
Codenames as a Benchmark for Large Language Models
Matthew Stephenson
Matthew Sidji
Benoît Ronval
LLMAG
LRM
ELM
103
1
0
16 Dec 2024
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
Davide Paglieri
Bartłomiej Cupiał
Samuel Coward
Ulyana Piterbarg
Maciej Wolczyk
...
Lerrel Pinto
Rob Fergus
Jakob Foerster
Jack Parker-Holder
Tim Rocktaschel
LLMAG
LRM
106
10
0
20 Nov 2024
GPT for Games: An Updated Scoping Review (2020-2024)
Daijin Yang
Erica Kleinman
Casper Harteveld
LLMAG
AI4TS
AI4CE
46
3
0
01 Nov 2024
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
315
8,402
0
28 Jan 2022
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Mikayel Samvelyan
Robert Kirk
Vitaly Kurin
Jack Parker-Holder
Minqi Jiang
Eric Hambro
Fabio Petroni
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
OffRL
226
89
0
27 Sep 2021
1