Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.09702
Cited By
GFlowNet Pretraining with Inexpensive Rewards
15 September 2024
Mohit Pandey
G. Subbaraj
Emmanuel Bengio
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"GFlowNet Pretraining with Inexpensive Rewards"
2 / 2 papers shown
Title
Accurate and Diverse LLM Mathematical Reasoning via Automated PRM-Guided GFlowNets
Adam Younsi
Abdalgader Abubaker
M. Seddik
Hakim Hacid
Salem Lahlou
LRM
54
0
0
28 Apr 2025
Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem
Maciej Wolczyk
Bartłomiej Cupiał
M. Ostaszewski
Michal Bortkiewicz
Michal Zajkac
Razvan Pascanu
Lukasz Kuciñski
Piotr Milo's
CLL
43
13
0
05 Feb 2024
1