Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents

18 September 2017

Matthew J. Hausknecht

Michael Bowling

ArXiv PDF HTML

Papers citing "Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents"

50 / 146 papers shown

Title
Flow Models for Unbounded and Geometry-Aware Distributional Reinforcement Learning Simo Alami C. Rim Kaddah Jesse Read Marie-Paule Cani 51 0 0 07 May 2025
Frog Soup: Zero-Shot, In-Context, and Sample-Efficient Frogger Agents Xiang Li Yiyang Hao Doug Fulop 26 0 0 06 May 2025
Look Before Leap: Look-Ahead Planning with Uncertainty in Reinforcement Learning Yongshuai Liu Xin Liu 95 1 0 26 Mar 2025
State Combinatorial Generalization In Decision Making With Conditional Diffusion Models Xintong Duan Yutong He Fahim Tajwar Wen-Tse Chen Ruslan Salakhutdinov Jeff Schneider OffRL AI4CE 101 0 0 22 Jan 2025
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC Tyler Clark Mark Towers Christine Evers Jonathon Hare OffRL 38 0 0 06 Nov 2024
LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban Simulation Bowen Li Zhaoyu Li Qiwei Du Jinqi Luo Wenshan Wang ... Katia P. Sycara Pradeep Kumar Ravikumar Alexander G. Gray X. Si Sebastian A. Scherer AI4CE LRM 81 3 0 01 Nov 2024
Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient Wenlong Wang Ivana Dusparic Yucheng Shi Ke Zhang Vinny Cahill Mamba 221 0 0 11 Oct 2024
Disentangling Recognition and Decision Regrets in Image-Based Reinforcement Learning Alihan Hüyük A. R. Koblitz Atefeh Mohajeri M. Andrews OffRL 40 0 0 19 Sep 2024
Bellman Unbiasedness: Toward Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation Taehyun Cho Seung Han Kyungjae Lee Seokhun Ju Dohyeong Kim Jungwoo Lee 72 0 0 31 Jul 2024
Towards Generalizable Reinforcement Learning via Causality-Guided Self-Adaptive Representations Yupei Yang Erdun Gao Fan Feng Xinyue Wang Shikui Tu Lei Xu CML OOD TTA 43 1 0 30 Jul 2024
Catastrophic Goodhart: regularizing RLHF with KL divergence does not mitigate heavy-tailed reward misspecification Thomas Kwa Drake Thomas Adrià Garriga-Alonso 41 1 0 19 Jul 2024
Massively Multiagent Minigames for Training Generalist Agents Kyoung Whan Choe Ryan Sullivan Joseph Suárez AI4CE 34 0 0 07 Jun 2024
Looking Backward: Retrospective Backward Synthesis for Goal-Conditioned GFlowNets Haoran He C. Chang Huazhe Xu Ling Pan 89 6 0 03 Jun 2024
The Curse of Diversity in Ensemble-Based Exploration Zhixuan Lin P. DÓro Evgenii Nikishin Rameswar Panda 44 1 0 07 May 2024
A Meta-Game Evaluation Framework for Deep Multiagent Reinforcement Learning Zun Li Michael P. Wellman 42 1 0 30 Apr 2024
Calibration of Continual Learning Models Lanpei Li Elia Piccoli Andrea Cossu Davide Bacciu Vincenzo Lomonaco CLL 45 2 0 11 Apr 2024
The Effective Horizon Explains Deep RL Performance in Stochastic Environments Cassidy Laidlaw Banghua Zhu Stuart J. Russell Anca Dragan 36 2 0 13 Dec 2023
Adversarial Style Transfer for Robust Policy Optimization in Deep Reinforcement Learning Md Masudur Rahman Yexiang Xue 29 4 0 29 Aug 2023
The RL Perceptron: Generalisation Dynamics of Policy Learning in High Dimensions Nishil Patel Sebastian Lee Stefano Sarao Mannelli Sebastian Goldt Adrew Saxe OffRL 36 3 0 17 Jun 2023
Bigger, Better, Faster: Human-level Atari with human-level efficiency Max Schwarzer J. Obando-Ceron Rameswar Panda Marc G. Bellemare Rishabh Agarwal Pablo Samuel Castro OffRL 54 85 0 30 May 2023
Approximate Shielding of Atari Agents for Safe Exploration Alexander W. Goodall Francesco Belardinelli 27 2 0 21 Apr 2023
Toward Risk-based Optimistic Exploration for Cooperative Multi-Agent Reinforcement Learning Ji-Yun Oh Joonkee Kim Minchan Jeong Se-Young Yun 38 1 0 03 Mar 2023
Stochastic Generative Flow Networks L. Pan Dinghuai Zhang Moksh Jain Longbo Huang Yoshua Bengio BDL 49 31 0 19 Feb 2023
Sample Efficient Deep Reinforcement Learning via Local Planning Dong Yin S. Thiagarajan N. Lazić Nived Rajaraman Botao Hao Csaba Szepesvári 25 4 0 29 Jan 2023
Deep Laplacian-based Options for Temporally-Extended Exploration Martin Klissarov Marlos C. Machado OffRL 26 19 0 26 Jan 2023
Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes Aviral Kumar Rishabh Agarwal Xinyang Geng George Tucker Sergey Levine OffRL 44 48 0 28 Nov 2022
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments Daniel Jarrett Corentin Tallec Florent Altché Thomas Mesnard Rémi Munos Michal Valko 48 5 0 18 Nov 2022
Agent-State Construction with Auxiliary Inputs Ruo Yu Tao Adam White Marlos C. Machado 30 5 0 15 Nov 2022
Probing Transfer in Deep Reinforcement Learning without Task Engineering Andrei A. Rusu Sebastian Flennerhag Dushyant Rao Razvan Pascanu R. Hadsell 39 6 0 22 Oct 2022
The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning Henrique Donancio L. Vercouter H. Roclawski AI4CE 18 1 0 20 Oct 2022
Exploration Policies for On-the-Fly Controller Synthesis: A Reinforcement Learning Approach Tomás Delgado Marco Sánchez Sorondo V. Braberman Sebastián Uchitel OffRL 26 1 0 07 Oct 2022
Hyperbolic Deep Reinforcement Learning Edoardo Cetin B. Chamberlain Michael M. Bronstein Jonathan J. Hunt 48 21 0 04 Oct 2022
DMAP: a Distributed Morphological Attention Policy for Learning to Locomote with a Changing Body A. Chiappa Alessandro Marin Vargas Alexander Mathis 34 7 0 28 Sep 2022
Towards a Standardised Performance Evaluation Protocol for Cooperative MARL R. Gorsane Omayma Mahjoub Ruan de Kock Roland Dubb Siddarth S. Singh Arnu Pretorius OffRL 44 50 0 21 Sep 2022
Look where you look! Saliency-guided Q-networks for generalization in visual Reinforcement Learning David Bertoin Adil Zouitine Mehdi Zouitine Emmanuel Rachelson 38 30 0 16 Sep 2022
Cell-Free Latent Go-Explore Quentin Gallouedec Emmanuel Dellandrea 19 1 0 31 Aug 2022
Learning to Generalize with Object-centric Agents in the Open World Survival Game Crafter Aleksandar Stanić Yujin Tang David R Ha Jürgen Schmidhuber ELM 29 13 0 05 Aug 2022
Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning Adam R. Villaflor Zheng Huang Swapnil Pande John M. Dolan J. Schneider OffRL 25 24 0 21 Jul 2022
MLGOPerf: An ML Guided Inliner to Optimize Performance Amir H. Ashouri Mostafa Elhoushi Yu-Wei Hua Xiang Wang Muhammad Asif Manzoor Bryan Chan Yaoqing Gao 31 13 0 18 Jul 2022
Stabilizing Off-Policy Deep Reinforcement Learning from Pixels Edoardo Cetin Philip J. Ball Steve Roberts Oya Celiktutan 35 36 0 03 Jul 2022
Towards Understanding How Machines Can Learn Causal Overhypotheses Eliza Kosoy David M. Chan Adrian Liu Jasmine Collins Bryanna Kaufmann Sandy Han Huang Jessica B. Hamrick John F. Canny Nan Rosemary Ke Alison Gopnik CML AI4CE 28 18 0 16 Jun 2022
BYOL-Explore: Exploration by Bootstrapped Prediction Z. Guo S. Thakoor Miruna Pislar Bernardo Avila-Pires Florent Altché ... Yunhao Tang Michal Valko Rémi Munos M. G. Azar Bilal Piot 22 68 0 16 Jun 2022
Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress Rishabh Agarwal Max Schwarzer Pablo Samuel Castro Rameswar Panda Marc G. Bellemare OffRL OnRL 37 63 0 03 Jun 2022
Chain of Thought Imitation with Procedure Cloning Mengjiao Yang Dale Schuurmans Pieter Abbeel Ofir Nachum OffRL 35 30 0 22 May 2022
Characterizing the Action-Generalization Gap in Deep Q-Learning Zhi-Hua Zhou Cameron Allen Kavosh Asadi George Konidaris 28 2 0 11 May 2022
Local Feature Swapping for Generalization in Reinforcement Learning David Bertoin Emmanuel Rachelson OOD 29 14 0 13 Apr 2022
Learning Purely Tactile In-Hand Manipulation with a Torque-Controlled Hand Leon Sievers Johannes Pitz Berthold Bäuml 29 38 0 07 Apr 2022
Reinforcement learning for automatic quadrilateral mesh generation: a soft actor-critic approach J. Pan Jingwei Huang G. Cheng Yong Zeng AI4CE 24 40 0 19 Mar 2022
Orchestrated Value Mapping for Reinforcement Learning Mehdi Fatemi Arash Tavakoli 27 8 0 14 Mar 2022
Fast and Data Efficient Reinforcement Learning from Pixels via Non-Parametric Value Approximation Alex Long Alan Blair H. V. Hoof 26 3 0 07 Mar 2022