ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
  • Feedback
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.13824
  4. Cited By
Offline Reinforcement Learning with On-Policy Q-Function Regularization

Offline Reinforcement Learning with On-Policy Q-Function Regularization

25 July 2023
Laixi Shi
Robert Dadashi
Yuejie Chi
Pablo Samuel Castro
Matthieu Geist
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Offline Reinforcement Learning with On-Policy Q-Function Regularization"

2 / 2 papers shown
Title
The Best Instruction-Tuning Data are Those That Fit
The Best Instruction-Tuning Data are Those That Fit
Dylan Zhang
Qirun Dai
Hao Peng
ALM
296
11
0
06 Feb 2025
Enhancing Reinforcement Learning Through Guided Search
Enhancing Reinforcement Learning Through Guided Search
Jérôme Arjonilla
Abdallah Saffidine
Tristan Cazenave
OffRL
227
0
0
19 Aug 2024
1