ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.05656
  4. Cited By
On the Modeling Capabilities of Large Language Models for Sequential
  Decision Making

On the Modeling Capabilities of Large Language Models for Sequential Decision Making

8 October 2024
Martin Klissarov
Devon Hjelm
Alexander Toshev
Bogdan Mazoure
    LM&Ro
    ELM
    OffRL
    LRM
ArXivPDFHTML

Papers citing "On the Modeling Capabilities of Large Language Models for Sequential Decision Making"

2 / 2 papers shown
Title
RM-R1: Reward Modeling as Reasoning
RM-R1: Reward Modeling as Reasoning
X. Chen
Gaotang Li
Z. Wang
Bowen Jin
Cheng Qian
...
Y. Zhang
D. Zhang
Tong Zhang
Hanghang Tong
Heng Ji
ReLM
OffRL
LRM
42
0
0
05 May 2025
Toward Efficient Exploration by Large Language Model Agents
Toward Efficient Exploration by Large Language Model Agents
Dilip Arumugam
Thomas L. Griffiths
LLMAG
82
0
0
29 Apr 2025
1