ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.06858
  4. Cited By
Soft policy optimization using dual-track advantage estimator

Soft policy optimization using dual-track advantage estimator

15 September 2020
Yubo Huang
Xuechun Wang
Luobao Zou
Zhiwei Zhuang
Weidong Zhang
ArXivPDFHTML

Papers citing "Soft policy optimization using dual-track advantage estimator"

1 / 1 papers shown
Title
Prioritized Experience Replay
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
152
3,777
0
18 Nov 2015
1