ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1903.03176
  4. Cited By
MinAtar: An Atari-Inspired Testbed for Thorough and Reproducible
  Reinforcement Learning Experiments

MinAtar: An Atari-Inspired Testbed for Thorough and Reproducible Reinforcement Learning Experiments

7 March 2019
K. Young
Tian Tian
ArXivPDFHTML

Papers citing "MinAtar: An Atari-Inspired Testbed for Thorough and Reproducible Reinforcement Learning Experiments"

6 / 6 papers shown
Title
Handling Delay in Real-Time Reinforcement Learning
Handling Delay in Real-Time Reinforcement Learning
Ivan Anokhin
Rishav Rishav
Matthew D Riemer
Stephen Chung
Irina Rish
Samira Ebrahimi Kahou
52
0
0
30 Mar 2025
Memory-efficient Reinforcement Learning with Value-based Knowledge
  Consolidation
Memory-efficient Reinforcement Learning with Value-based Knowledge Consolidation
Qingfeng Lan
Yangchen Pan
Jun Luo
A. R. Mahmood
OffRL
36
8
0
22 May 2022
Balanced Q-learning: Combining the Influence of Optimistic and
  Pessimistic Targets
Balanced Q-learning: Combining the Influence of Optimistic and Pessimistic Targets
Thommen George Karimpanal
Hung Le
Majid Abdolshah
Santu Rana
Sunil R. Gupta
T. Tran
Svetha Venkatesh
25
5
0
03 Nov 2021
Automating Control of Overestimation Bias for Reinforcement Learning
Automating Control of Overestimation Bias for Reinforcement Learning
Arsenii Kuznetsov
Alexander Grishin
Artem Tsypin
Arsenii Ashukha
Artur Kadurin
Dmitry Vetrov
OffRL
8
2
0
26 Oct 2021
Greedification Operators for Policy Optimization: Investigating Forward
  and Reverse KL Divergences
Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences
Alan Chan
Hugo Silva
Sungsu Lim
Tadashi Kozuno
A. R. Mahmood
Martha White
25
29
0
17 Jul 2021
Maxmin Q-learning: Controlling the Estimation Bias of Q-learning
Maxmin Q-learning: Controlling the Estimation Bias of Q-learning
Qingfeng Lan
Yangchen Pan
Alona Fyshe
Martha White
21
176
0
16 Feb 2020
1