ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.10027
  4. Cited By
DNA: Proximal Policy Optimization with a Dual Network Architecture

DNA: Proximal Policy Optimization with a Dual Network Architecture

20 June 2022
Mathew H. Aitchison
Penny Sweetser
    OffRL
ArXivPDFHTML

Papers citing "DNA: Proximal Policy Optimization with a Dual Network Architecture"

2 / 2 papers shown
Title
A Reinforcement Learning Method for Environments with Stochastic Variables: Post-Decision Proximal Policy Optimization with Dual Critic Networks
A Reinforcement Learning Method for Environments with Stochastic Variables: Post-Decision Proximal Policy Optimization with Dual Critic Networks
L. Felizardo
Edoardo Fadda
Paolo Brandimarte
E. Del-Moral-Hernandez
Mariá Cristina Vasconcelos Nascimento
OffRL
30
0
0
07 Apr 2025
Atari-5: Distilling the Arcade Learning Environment down to Five Games
Atari-5: Distilling the Arcade Learning Environment down to Five Games
Matthew Aitchison
Penny Sweetser
Marcus Hutter
50
19
0
05 Oct 2022
1