ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1908.03568
  4. Cited By
Behaviour Suite for Reinforcement Learning
v1v2v3 (latest)

Behaviour Suite for Reinforcement Learning

International Conference on Learning Representations (ICLR), 2019
9 August 2019
Ian Osband
Yotam Doron
Matteo Hessel
John Aslanides
Eren Sezener
Andre Saraiva
Katrina McKinney
Tor Lattimore
Csaba Szepesvári
Satinder Singh
Benjamin Van Roy
R. Sutton
David Silver
H. V. Hasselt
    OffRL
ArXiv (abs)PDFHTMLGithub (1522★)

Papers citing "Behaviour Suite for Reinforcement Learning"

50 / 138 papers shown
Octax: Accelerated CHIP-8 Arcade Environments for Reinforcement Learning in JAX
Octax: Accelerated CHIP-8 Arcade Environments for Reinforcement Learning in JAX
Waris Radji
Thomas Michel
Hector Piteau
228
1
0
02 Oct 2025
On the Limits of Tabular Hardness Metrics for Deep RL: A Study with the Pharos Benchmark
On the Limits of Tabular Hardness Metrics for Deep RL: A Study with the Pharos Benchmark
Michelangelo Conserva
Remo Sasso
Paulo E. Rauber
OffRLLMTD
165
1
0
21 Sep 2025
Priors Matter: Addressing Misspecification in Bayesian Deep Q-Learning
Priors Matter: Addressing Misspecification in Bayesian Deep Q-Learning
Pascal R. van der Vaart
Neil Yorke-Smith
M. Spaan
BDLUQCV
206
0
0
29 Aug 2025
Synthetic POMDPs to Challenge Memory-Augmented RL: Memory Demand Structure Modeling
Synthetic POMDPs to Challenge Memory-Augmented RL: Memory Demand Structure Modeling
Yongyi Wang
Lingfeng Li
Bozhou Chen
Ang Li
Hanyu Liu
Qirui Zheng
Xionghui Yang
Wenxin Li
160
1
0
06 Aug 2025
Benchmarking Partial Observability in Reinforcement Learning with a Suite of Memory-Improvable Domains
Benchmarking Partial Observability in Reinforcement Learning with a Suite of Memory-Improvable Domains
Ruo Yu Tao
Kaicheng Guo
Cameron Allen
George Konidaris
221
4
0
31 Jul 2025
T-GRAB: A Synthetic Diagnostic Benchmark for Learning on Temporal Graphs
T-GRAB: A Synthetic Diagnostic Benchmark for Learning on Temporal Graphs
Alireza Dizaji
Benedict Aaron Tjandra
Mehrab Hamidi
Shenyang Huang
Guillaume Rabusseau
373
0
0
14 Jul 2025
Measurement-Aligned Sampling for Inverse Problem
Measurement-Aligned Sampling for Inverse Problem
Shaorong Zhang
Rob Brekelmans
Yunshu Wu
Greg Ver Steeg
DiffM
353
0
0
13 Jun 2025
Oryx: a Scalable Sequence Model for Many-Agent Coordination in Offline MARL
Oryx: a Scalable Sequence Model for Many-Agent Coordination in Offline MARL
Claude Formanek
Omayma Mahjoub
Louay Ben Nessir
Sasha Abramowitz
Ruan de Kock
...
Daniel Rajaonarivonivelomanantsoa
Arnol Fokam
Siddarth S. Singh
Ulrich A. Mbou Sob
Arnu Pretorius
OffRL
315
0
0
28 May 2025
Multi-Agent Reinforcement Learning Simulation for Environmental Policy Synthesis
Multi-Agent Reinforcement Learning Simulation for Environmental Policy SynthesisAdaptive Agents and Multi-Agent Systems (AAMAS), 2025
James Rudd-Jones
Mirco Musolesi
María Pérez-Ortiz
233
4
0
17 Apr 2025
KEA: Keeping Exploration Alive by Proactively Coordinating Exploration Strategies
KEA: Keeping Exploration Alive by Proactively Coordinating Exploration Strategies
Shih-Min Yang
Martin Magnusson
J. A. Stork
Todor Stoyanov
325
1
0
23 Mar 2025
SocialJax: An Evaluation Suite for Multi-agent Reinforcement Learning in Sequential Social Dilemmas
SocialJax: An Evaluation Suite for Multi-agent Reinforcement Learning in Sequential Social Dilemmas
Zihao Guo
Richard Willis
Richard Willis
Tristan Tomilin
Joel Z Leibo
Yali Du
412
5
0
18 Mar 2025
Memory, Benchmark & Robots: A Benchmark for Solving Complex Tasks with Reinforcement Learning
Memory, Benchmark & Robots: A Benchmark for Solving Complex Tasks with Reinforcement Learning
Egor Cherepanov
Nikita Kachaev
A. Kovalev
Aleksandr I. Panov
OffRL
554
14
0
14 Feb 2025
Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation
Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation
Egor Cherepanov
Nikita Kachaev
Artem Zholus
A. Kovalev
Aleksandr I. Panov
229
4
0
09 Dec 2024
IntersectionZoo: Eco-driving for Benchmarking Multi-Agent Contextual
  Reinforcement Learning
IntersectionZoo: Eco-driving for Benchmarking Multi-Agent Contextual Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2024
Vindula Jayawardana
Baptiste Freydt
Ao Qu
Cameron Hickert
Zhongxia Yan
Cathy Wu
242
4
0
19 Oct 2024
Can we hop in general? A discussion of benchmark selection and design
  using the Hopper environment
Can we hop in general? A discussion of benchmark selection and design using the Hopper environment
C. Voelcker
Marcel Hussing
Eric Eaton
OffRL
382
7
0
11 Oct 2024
D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning
D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning
Rafael Rafailov
Kyle Hatch
Anikait Singh
Laura Smith
Aviral Kumar
...
Victor Kolev
Philip J. Ball
Jiajun Wu
Chelsea Finn
Sergey Levine
OffRL
252
12
0
15 Aug 2024
The Need for a Big World Simulator: A Scientific Challenge for Continual
  Learning
The Need for a Big World Simulator: A Scientific Challenge for Continual Learning
Saurabh Kumar
Hong Jun Jeon
Alex Lewandowski
Benjamin Van Roy
235
5
0
06 Aug 2024
NAVIX: Scaling MiniGrid Environments with JAX
NAVIX: Scaling MiniGrid Environments with JAX
Eduardo Pignatelli
Jarek Liesen
R. T. Lange
Chris Xiaoxuan Lu
Pablo Samuel Castro
Laura Toni
425
12
0
28 Jul 2024
Gymnasium: A Standard Interface for Reinforcement Learning Environments
Gymnasium: A Standard Interface for Reinforcement Learning Environments
Mark Towers
Ariel Kwiatkowski
Jordan Terry
John U. Balis
Gianluca De Cola
...
Andrea Pierré
Sander Schulhoff
Jun Jet Tai
Hannah Tan
Omar G. Younis
AuLLMOffRL
521
588
0
24 Jul 2024
Evaluating AI Evaluation: Perils and Prospects
Evaluating AI Evaluation: Perils and Prospects
John Burden
ELM
248
16
0
12 Jul 2024
Can Learned Optimization Make Reinforcement Learning Less Difficult?
Can Learned Optimization Make Reinforcement Learning Less Difficult?
Alexander David Goldie
Chris Xiaoxuan Lu
Matthew Jackson
Shimon Whiteson
Jakob N. Foerster
561
13
0
09 Jul 2024
Simplifying Deep Temporal Difference Learning
Simplifying Deep Temporal Difference Learning
Matteo Gallici
Mattie Fellows
Benjamin Ellis
B. Pou
Ivan Masmitja
Jakob Foerster
Mario Martin
OffRL
684
63
0
05 Jul 2024
Model-Free Active Exploration in Reinforcement Learning
Model-Free Active Exploration in Reinforcement Learning
Alessio Russo
Alexandre Proutiere
OffRL
383
6
0
30 Jun 2024
RRLS : Robust Reinforcement Learning Suite
RRLS : Robust Reinforcement Learning Suite
Adil Zouitine
David Bertoin
Pierre Clavier
Matthieu Geist
Emmanuel Rachelson
OffRL
313
3
0
12 Jun 2024
Sequence Compression Speeds Up Credit Assignment in Reinforcement
  Learning
Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning
Aditya A. Ramesh
Kenny Young
Louis Kirsch
Jürgen Schmidhuber
327
2
0
06 May 2024
Sequential Decision Making with Expert Demonstrations under Unobserved Heterogeneity
Sequential Decision Making with Expert Demonstrations under Unobserved HeterogeneityNeural Information Processing Systems (NeurIPS), 2024
Vahid Balazadeh Meresht
Keertana Chidambaram
Viet Nguyen
Fahad Razak
Vasilis Syrgkanis
485
2
0
10 Apr 2024
Policy Mirror Descent with Lookahead
Policy Mirror Descent with Lookahead
Kimon Protopapas
Anas Barakat
268
4
0
21 Mar 2024
Mastering Memory Tasks with World Models
Mastering Memory Tasks with World Models
Mohammad Reza Samsami
Artem Zholus
Janarthanan Rajendran
Sarath Chandar
CLLOffRL
383
41
0
07 Mar 2024
Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement
  Learning
Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning
Michael T. Matthews
Michael Beukman
Benjamin Ellis
Mikayel Samvelyan
Matthew Jackson
Samuel Coward
Jakob Foerster
OffRL
422
69
0
26 Feb 2024
Learning mirror maps in policy mirror descent
Learning mirror maps in policy mirror descent
Carlo Alfano
Sebastian Towers
Silvia Sapora
Chris Xiaoxuan Lu
Patrick Rebeschini
333
2
0
07 Feb 2024
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice
  via HyperAgent
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent
Yingru Li
Jiawei Xu
Lei Han
Zhi-Quan Luo
BDLOffRL
335
8
0
05 Feb 2024
Efficiently Quantifying Individual Agent Importance in Cooperative MARL
Efficiently Quantifying Individual Agent Importance in Cooperative MARL
Omayma Mahjoub
Ruan de Kock
Siddarth S. Singh
Wiem Khlifi
Abidine Vall
Kale-ab Tessera
Arnu Pretorius
FAtt
386
4
0
13 Dec 2023
Probabilistic Inference in Reinforcement Learning Done Right
Probabilistic Inference in Reinforcement Learning Done RightNeural Information Processing Systems (NeurIPS), 2023
Jean Tarbouriech
Tor Lattimore
Brendan O'Donoghue
BDLOffRL
362
11
0
22 Nov 2023
minimax: Efficient Baselines for Autocurricula in JAX
minimax: Efficient Baselines for Autocurricula in JAX
Minqi Jiang
Michael Dennis
Edward Grefenstette
Tim Rocktaschel
376
11
0
21 Nov 2023
EduGym: An Environment and Notebook Suite for Reinforcement Learning
  Education
EduGym: An Environment and Notebook Suite for Reinforcement Learning Education
Thomas M. Moerland
Matthias Muller-Brockhausen
Zhao Yang
Andrius Bernatavicius
Koen Ponse
Tom Kouwenhoven
Andreas Sauter
Michiel van der Meer
Bram M. Renting
Aske Plaat
OffRL
418
2
0
17 Nov 2023
Real-Time Recurrent Reinforcement Learning
Real-Time Recurrent Reinforcement Learning
Julian Lemmel
Radu Grosu
497
7
0
08 Nov 2023
Towards model-free RL algorithms that scale well with unstructured data
Towards model-free RL algorithms that scale well with unstructured data
Joseph Modayil
Zaheer Abbas
OffRL
195
5
0
03 Nov 2023
Improving Intrinsic Exploration by Creating Stationary Objectives
Improving Intrinsic Exploration by Creating Stationary ObjectivesInternational Conference on Learning Representations (ICLR), 2023
Roger Creus Castanyer
Javier Civera
Taihú Pire
OffRL
470
4
0
27 Oct 2023
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General
  Sequential Decision Scenarios
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision ScenariosNeural Information Processing Systems (NeurIPS), 2023
Yazhe Niu
Yuan Pu
Zhenjie Yang
Xueyan Li
Tong Zhou
Jiyuan Ren
Shuai Hu
Jiaming Song
Yu Liu
428
23
0
12 Oct 2023
A Unified View on Solving Objective Mismatch in Model-Based
  Reinforcement Learning
A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning
Ran Wei
Nathan Lambert
Anthony D. McDonald
Alfredo Garcia
Roberto Calandra
379
9
0
10 Oct 2023
Hieros: Hierarchical Imagination on Structured State Space Sequence
  World Models
Hieros: Hierarchical Imagination on Structured State Space Sequence World ModelsInternational Conference on Machine Learning (ICML), 2023
Paul Mattes
Rainer Schlosser
R. Herbrich
444
8
0
08 Oct 2023
Intrinsic Language-Guided Exploration for Complex Long-Horizon Robotic
  Manipulation Tasks
Intrinsic Language-Guided Exploration for Complex Long-Horizon Robotic Manipulation TasksIEEE International Conference on Robotics and Automation (ICRA), 2023
Wenke Huang
Filippos Christianos
Zhibin Li
335
16
0
28 Sep 2023
Inferring Capabilities from Task Performance with Bayesian Triangulation
Inferring Capabilities from Task Performance with Bayesian Triangulation
John Burden
Konstantinos Voudouris
Ryan Burnell
Danaja Rutar
Lucy G. Cheke
José Hernández-Orallo
196
11
0
21 Sep 2023
A State Representation for Diminishing Rewards
A State Representation for Diminishing RewardsNeural Information Processing Systems (NeurIPS), 2023
Ted Moskovitz
Samo Hromadka
Ahmed Touati
Diana Borsa
M. Sahani
238
4
0
07 Sep 2023
Integrating LLMs and Decision Transformers for Language Grounded
  Generative Quality-Diversity
Integrating LLMs and Decision Transformers for Language Grounded Generative Quality-Diversity
Achkan Salehi
Stéphane Doncieux
182
1
0
25 Aug 2023
Multi-Dimensional Ability Diagnosis for Machine Learning Algorithms
Multi-Dimensional Ability Diagnosis for Machine Learning AlgorithmsScience China Information Sciences (Sci China Inf Sci), 2023
Qi Liu
Zhengze Gong
Zhenya Huang
Chuanren Liu
Hengshu Zhu
Zhi Li
Enhong Chen
Hui Xiong
177
3
0
14 Jul 2023
When Do Transformers Shine in RL? Decoupling Memory from Credit
  Assignment
When Do Transformers Shine in RL? Decoupling Memory from Credit AssignmentNeural Information Processing Systems (NeurIPS), 2023
Tianwei Ni
Michel Ma
Benjamin Eysenbach
Pierre-Luc Bacon
OffRL
554
62
0
07 Jul 2023
LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning
LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning
Outongyi Lv
Bingxin Zhou
OffRL
385
0
0
05 Jul 2023
Comparing Reinforcement Learning and Human Learning using the Game of
  Hidden Rules
Comparing Reinforcement Learning and Human Learning using the Game of Hidden RulesIEEE Access (IEEE Access), 2023
Eric Pulick
Vladimir Menkov
Yonatan Dov Mintz
Paul B. Kantor
Vicki M. Bier
OffRL
180
2
0
30 Jun 2023
Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments
  in JAX
Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX
Matthew Macfarlane
Daniel Luo
Donal Byrne
Shikha Surana
Sasha Abramowitz
...
Siddarth S. Singh
Daniel Furelos-Blanco
Victor Le
Arnu Pretorius
Alexandre Laterre
349
50
0
16 Jun 2023
123
Next
Page 1 of 3