ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.11489
  4. Cited By
A Note on Target Q-learning For Solving Finite MDPs with A Generative
  Oracle

A Note on Target Q-learning For Solving Finite MDPs with A Generative Oracle

22 March 2022
Ziniu Li
Tian Xu
Yang Yu
ArXivPDFHTML

Papers citing "A Note on Target Q-learning For Solving Finite MDPs with A Generative Oracle"

2 / 2 papers shown
Title
Stabilizing Q-learning with Linear Architectures for Provably Efficient
  Learning
Stabilizing Q-learning with Linear Architectures for Provably Efficient Learning
Andrea Zanette
Martin J. Wainwright
OOD
28
5
0
01 Jun 2022
Online Target Q-learning with Reverse Experience Replay: Efficiently
  finding the Optimal Policy for Linear MDPs
Online Target Q-learning with Reverse Experience Replay: Efficiently finding the Optimal Policy for Linear MDPs
Naman Agarwal
Syomantak Chaudhuri
Prateek Jain
Dheeraj M. Nagaraj
Praneeth Netrapalli
OffRL
34
21
0
16 Oct 2021
1