Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2501.09080
Cited By
Average-Reward Reinforcement Learning with Entropy Regularization
17 January 2025
Jacob Adamczyk
Volodymyr Makarenko
Stas Tiomkin
R. Kulkarni
OOD
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Average-Reward Reinforcement Learning with Entropy Regularization"
1 / 1 papers shown
Title
Entropy-guided sequence weighting for efficient exploration in RL-based LLM fine-tuning
Abdullah Vanlioglu
46
0
0
28 Mar 2025
1