v1v2v3 (latest)

Behaviour Suite for Reinforcement Learning

International Conference on Learning Representations (ICLR), 2019

9 August 2019

David Silver

ArXiv (abs)PDF HTML Github (1522★)

Papers citing "Behaviour Suite for Reinforcement Learning"

50 / 138 papers shown

Octax: Accelerated CHIP-8 Arcade Environments for Reinforcement Learning in JAX

Waris Radji

Thomas Michel

Hector Piteau

228

02 Oct 2025

On the Limits of Tabular Hardness Metrics for Deep RL: A Study with the Pharos Benchmark

Michelangelo Conserva

Remo Sasso

Paulo E. Rauber

OffRL LMTD

165

21 Sep 2025

Priors Matter: Addressing Misspecification in Bayesian Deep Q-Learning

Pascal R. van der Vaart

Neil Yorke-Smith

M. Spaan

BDL UQCV

206

29 Aug 2025

Synthetic POMDPs to Challenge Memory-Augmented RL: Memory Demand Structure Modeling

160

06 Aug 2025

Benchmarking Partial Observability in Reinforcement Learning with a Suite of Memory-Improvable Domains

221

31 Jul 2025

T-GRAB: A Synthetic Diagnostic Benchmark for Learning on Temporal Graphs

Alireza Dizaji

Benedict Aaron Tjandra

Mehrab Hamidi

Shenyang Huang

Guillaume Rabusseau

373

14 Jul 2025

Measurement-Aligned Sampling for Inverse Problem

353

13 Jun 2025

Oryx: a Scalable Sequence Model for Many-Agent Coordination in Offline MARL

...

Daniel Rajaonarivonivelomanantsoa

315

28 May 2025

Multi-Agent Reinforcement Learning Simulation for Environmental Policy SynthesisAdaptive Agents and Multi-Agent Systems (AAMAS), 2025

James Rudd-Jones

Mirco Musolesi

María Pérez-Ortiz

233

17 Apr 2025

KEA: Keeping Exploration Alive by Proactively Coordinating Exploration Strategies

325

23 Mar 2025

SocialJax: An Evaluation Suite for Multi-agent Reinforcement Learning in Sequential Social Dilemmas

412

18 Mar 2025

Memory, Benchmark & Robots: A Benchmark for Solving Complex Tasks with Reinforcement Learning

554

14 Feb 2025

Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation

229

09 Dec 2024

IntersectionZoo: Eco-driving for Benchmarking Multi-Agent Contextual Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2024

242

19 Oct 2024

Can we hop in general? A discussion of benchmark selection and design using the Hopper environment

382

11 Oct 2024

D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning

Rafael Rafailov

Kyle Hatch

Anikait Singh

Laura Smith

...

Chelsea Finn

Sergey Levine

OffRL

252

15 Aug 2024

The Need for a Big World Simulator: A Scientific Challenge for Continual Learning

235

06 Aug 2024

NAVIX: Scaling MiniGrid Environments with JAX

425

28 Jul 2024

Gymnasium: A Standard Interface for Reinforcement Learning Environments

...

521

588

24 Jul 2024

Evaluating AI Evaluation: Perils and Prospects

John Burden

ELM

248

12 Jul 2024

Can Learned Optimization Make Reinforcement Learning Less Difficult?

Alexander David Goldie

561

09 Jul 2024

Simplifying Deep Temporal Difference Learning

684

05 Jul 2024

Model-Free Active Exploration in Reinforcement Learning

Alessio Russo

Alexandre Proutiere

OffRL

383

30 Jun 2024

RRLS : Robust Reinforcement Learning Suite

Matthieu Geist

313

12 Jun 2024

Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning

327

06 May 2024

Sequential Decision Making with Expert Demonstrations under Unobserved HeterogeneityNeural Information Processing Systems (NeurIPS), 2024

Vahid Balazadeh Meresht

485

10 Apr 2024

Policy Mirror Descent with Lookahead

Kimon Protopapas

Anas Barakat

268

21 Mar 2024

Mastering Memory Tasks with World Models

Mohammad Reza Samsami

Artem Zholus

Janarthanan Rajendran

Sarath Chandar

CLL OffRL

383

07 Mar 2024

Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning

422

26 Feb 2024

Learning mirror maps in policy mirror descent

333

07 Feb 2024

Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent

335

05 Feb 2024

Efficiently Quantifying Individual Agent Importance in Cooperative MARL

386

13 Dec 2023

Probabilistic Inference in Reinforcement Learning Done RightNeural Information Processing Systems (NeurIPS), 2023

362

22 Nov 2023

minimax: Efficient Baselines for Autocurricula in JAX

376

21 Nov 2023

EduGym: An Environment and Notebook Suite for Reinforcement Learning Education

Thomas M. Moerland

Matthias Muller-Brockhausen

Zhao Yang

Andrius Bernatavicius

418

17 Nov 2023

Real-Time Recurrent Reinforcement Learning

Julian Lemmel

Radu Grosu

497

08 Nov 2023

Towards model-free RL algorithms that scale well with unstructured data

Joseph Modayil

Zaheer Abbas

OffRL

195

03 Nov 2023

Improving Intrinsic Exploration by Creating Stationary ObjectivesInternational Conference on Learning Representations (ICLR), 2023

Roger Creus Castanyer

Javier Civera

Taihú Pire

OffRL

470

27 Oct 2023

LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision ScenariosNeural Information Processing Systems (NeurIPS), 2023

428

12 Oct 2023

A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning

379

10 Oct 2023

Hieros: Hierarchical Imagination on Structured State Space Sequence World ModelsInternational Conference on Machine Learning (ICML), 2023

Paul Mattes

Rainer Schlosser

R. Herbrich

444

08 Oct 2023

Intrinsic Language-Guided Exploration for Complex Long-Horizon Robotic Manipulation TasksIEEE International Conference on Robotics and Automation (ICRA), 2023

Wenke Huang

Filippos Christianos

Zhibin Li

335

28 Sep 2023

Inferring Capabilities from Task Performance with Bayesian Triangulation

John Burden

Konstantinos Voudouris

Ryan Burnell

Danaja Rutar

Lucy G. Cheke

José Hernández-Orallo

196

21 Sep 2023

A State Representation for Diminishing RewardsNeural Information Processing Systems (NeurIPS), 2023

238

07 Sep 2023

Integrating LLMs and Decision Transformers for Language Grounded Generative Quality-Diversity

Achkan Salehi

Stéphane Doncieux

182

25 Aug 2023

Multi-Dimensional Ability Diagnosis for Machine Learning AlgorithmsScience China Information Sciences (Sci China Inf Sci), 2023

Qi Liu

Hengshu Zhu

Enhong Chen

177

14 Jul 2023

When Do Transformers Shine in RL? Decoupling Memory from Credit AssignmentNeural Information Processing Systems (NeurIPS), 2023

Pierre-Luc Bacon

554

07 Jul 2023

LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning

Outongyi Lv

Bingxin Zhou

OffRL

385

05 Jul 2023

Comparing Reinforcement Learning and Human Learning using the Game of Hidden RulesIEEE Access (IEEE Access), 2023

180

30 Jun 2023

Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX

...

Daniel Furelos-Blanco

Victor Le

Arnu Pretorius

Alexandre Laterre

349

16 Jun 2023