Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.05656
Cited By
On the Modeling Capabilities of Large Language Models for Sequential Decision Making
8 October 2024
Martin Klissarov
Devon Hjelm
Alexander Toshev
Bogdan Mazoure
LM&Ro
ELM
OffRL
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"On the Modeling Capabilities of Large Language Models for Sequential Decision Making"
2 / 2 papers shown
Title
RM-R1: Reward Modeling as Reasoning
X. Chen
Gaotang Li
Z. Wang
Bowen Jin
Cheng Qian
...
Y. Zhang
D. Zhang
Tong Zhang
Hanghang Tong
Heng Ji
ReLM
OffRL
LRM
42
0
0
05 May 2025
Toward Efficient Exploration by Large Language Model Agents
Dilip Arumugam
Thomas L. Griffiths
LLMAG
82
0
0
29 Apr 2025
1