Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.12044
Cited By
XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX
19 December 2023
Alexander Nikulin
Vladislav Kurenkov
Ilya Zisman
Artem Agarkov
Viacheslav Sinii
Sergey Kolesnikov
Re-assign community
ArXiv
PDF
HTML
Papers citing
"XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX"
24 / 24 papers shown
Title
Cross-environment Cooperation Enables Zero-shot Multi-agent Coordination
Kunal Jha
Wilka Carvalho
Yancheng Liang
S. Du
Max Kleiman-Weiner
Natasha Jaques
22
0
0
17 Apr 2025
Thinking agents for zero-shot generalization to qualitatively novel tasks
Thomas Miconi
Kevin L McKee
Yicong Zheng
Jed McCaleb
LRM
AI4CE
41
0
0
25 Mar 2025
Partially Observable Reinforcement Learning with Memory Traces
Onno Eberhard
Michael Muehlebach
Claire Vernade
OffRL
33
0
0
19 Mar 2025
SocialJax: An Evaluation Suite for Multi-agent Reinforcement Learning in Sequential Social Dilemmas
Zihao Guo
Richard Willis
Shuqing Shi
Tristan Tomilin
Joel Z. Leibo
Yali Du
50
0
0
18 Mar 2025
Yes, Q-learning Helps Offline In-Context RL
Denis Tarasov
Alexander Nikulin
Ilya Zisman
Albina Klepach
Andrei Polubarov
Nikita Lyubaykin
Alexander Derevyagin
Igor Kiselev
Vladislav Kurenkov
OffRL
OnRL
76
0
0
24 Feb 2025
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
Thomas Schmied
Thomas Adler
Vihang Patil
M. Beck
Korbinian Poppel
Johannes Brandstetter
G. Klambauer
Razvan Pascanu
Sepp Hochreiter
68
4
0
21 Feb 2025
AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with Transformers
Jake Grigsby
Justin Sasek
Samyak Parajuli
Daniel Adebi
Amy Zhang
Yuke Zhu
OffRL
18
2
0
17 Nov 2024
N-Gram Induction Heads for In-Context RL: Improving Stability and Reducing Data Needs
Ilya Zisman
Alexander Nikulin
Andrei Polubarov
Nikita Lyubaykin
Vladislav Kurenkov
Andrei Polubarov
Igor Kiselev
Vladislav Kurenkov
OffRL
41
1
0
04 Nov 2024
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks
Michael T. Matthews
Michael Beukman
Chris Xiaoxuan Lu
Jakob Foerster
OffRL
AI4CE
36
2
0
30 Oct 2024
No Regrets: Investigating and Improving Regret Approximations for Curriculum Discovery
Alexander Rutherford
Michael Beukman
Timon Willi
Bruno Lacerda
Nick Hawes
Jakob Foerster
38
4
0
27 Aug 2024
NAVIX: Scaling MiniGrid Environments with JAX
Eduardo Pignatelli
Jarek Liesen
R. T. Lange
Chris Xiaoxuan Lu
Pablo Samuel Castro
Laura Toni
29
3
0
28 Jul 2024
RobocupGym: A challenging continuous control benchmark in Robocup
Michael Beukman
Branden Ingram
Geraud Nangue Tasse
Benjamin Rosman
Pravesh Ranchod
OffRL
40
1
0
03 Jul 2024
The Overcooked Generalisation Challenge
Constantin Ruhdorfer
Matteo Bortoletto
Anna Penzkofer
Andreas Bulling
44
3
0
25 Jun 2024
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning
Alexander Nikulin
Ilya Zisman
Alexey Zemtsov
Viacheslav Sinii
99
4
0
13 Jun 2024
Massively Multiagent Minigames for Training Generalist Agents
Kyoung Whan Choe
Ryan Sullivan
Joseph Suárez
AI4CE
30
0
0
07 Jun 2024
A Batch Sequential Halving Algorithm without Performance Degradation
Sotetsu Koyamada
Soichiro Nishimori
Shin Ishii
23
0
0
01 Jun 2024
Benchmarking General-Purpose In-Context Learning
Fan Wang
Chuan Lin
Yang Cao
Yu Kang
25
1
0
27 May 2024
Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning
Michael T. Matthews
Michael Beukman
Benjamin Ellis
Mikayel Samvelyan
Matthew Jackson
Samuel Coward
Jakob Foerster
OffRL
21
24
0
26 Feb 2024
In-Context Reinforcement Learning for Variable Action Spaces
Viacheslav Sinii
Alexander Nikulin
Vladislav Kurenkov
Ilya Zisman
Sergey Kolesnikov
13
14
0
20 Dec 2023
Emergence of In-Context Reinforcement Learning from Noise Distillation
Ilya Zisman
Vladislav Kurenkov
Alexander Nikulin
Viacheslav Sinii
Sergey Kolesnikov
OffRL
19
9
0
19 Dec 2023
minimax: Efficient Baselines for Autocurricula in JAX
Minqi Jiang
Michael Dennis
Edward Grefenstette
Tim Rocktaschel
19
8
0
21 Nov 2023
Structured State Space Models for In-Context Reinforcement Learning
Chris Xiaoxuan Lu
Yannick Schroecker
Albert Gu
Emilio Parisotto
Jakob N. Foerster
Satinder Singh
Feryal M. P. Behbahani
AI4TS
84
80
0
07 Mar 2023
Grounding Language to Entities and Dynamics for Generalization in Reinforcement Learning
H. Wang
Victor Zhong
Karthik Narasimhan
76
44
0
19 Jan 2021
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
237
11,568
0
09 Mar 2017
1