Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2206.10027
Cited By
DNA: Proximal Policy Optimization with a Dual Network Architecture
20 June 2022
Mathew H. Aitchison
Penny Sweetser
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DNA: Proximal Policy Optimization with a Dual Network Architecture"
2 / 2 papers shown
Title
A Reinforcement Learning Method for Environments with Stochastic Variables: Post-Decision Proximal Policy Optimization with Dual Critic Networks
L. Felizardo
Edoardo Fadda
Paolo Brandimarte
E. Del-Moral-Hernandez
Mariá Cristina Vasconcelos Nascimento
OffRL
30
0
0
07 Apr 2025
Atari-5: Distilling the Arcade Learning Environment down to Five Games
Matthew Aitchison
Penny Sweetser
Marcus Hutter
50
19
0
05 Oct 2022
1