Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2412.13602
Cited By
Beyond Outcomes: Transparent Assessment of LLM Reasoning in Games
18 December 2024
Wenye Lin
Jonathan Roberts
Yunhan Yang
Samuel Albanie
Zongqing Lu
Kai Han
LRM
ELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Beyond Outcomes: Transparent Assessment of LLM Reasoning in Games"
1 / 1 papers shown
Title
MastermindEval: A Simple But Scalable Reasoning Benchmark
Jonas Golde
Patrick Haller
Fabio Barth
Alan Akbik
LRM
ReLM
ELM
51
2
0
07 Mar 2025
1