Papers
Communities
Organizations
Events
Blog
Pricing
Feedback
Contact Sales
Search
Open menu
Home
Papers
2307.13824
Cited By
Offline Reinforcement Learning with On-Policy Q-Function Regularization
25 July 2023
Laixi Shi
Robert Dadashi
Yuejie Chi
Pablo Samuel Castro
Matthieu Geist
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Offline Reinforcement Learning with On-Policy Q-Function Regularization"
2 / 2 papers shown
Title
The Best Instruction-Tuning Data are Those That Fit
Dylan Zhang
Qirun Dai
Hao Peng
ALM
296
11
0
06 Feb 2025
Enhancing Reinforcement Learning Through Guided Search
Jérôme Arjonilla
Abdallah Saffidine
Tristan Cazenave
OffRL
227
0
0
19 Aug 2024
1