ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.12044
  4. Cited By
XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX

XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX

19 December 2023
Alexander Nikulin
Vladislav Kurenkov
Ilya Zisman
Artem Agarkov
Viacheslav Sinii
Sergey Kolesnikov
ArXivPDFHTML

Papers citing "XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX"

24 / 24 papers shown
Title
Cross-environment Cooperation Enables Zero-shot Multi-agent Coordination
Cross-environment Cooperation Enables Zero-shot Multi-agent Coordination
Kunal Jha
Wilka Carvalho
Yancheng Liang
S. Du
Max Kleiman-Weiner
Natasha Jaques
22
0
0
17 Apr 2025
Thinking agents for zero-shot generalization to qualitatively novel tasks
Thinking agents for zero-shot generalization to qualitatively novel tasks
Thomas Miconi
Kevin L McKee
Yicong Zheng
Jed McCaleb
LRM
AI4CE
41
0
0
25 Mar 2025
Partially Observable Reinforcement Learning with Memory Traces
Partially Observable Reinforcement Learning with Memory Traces
Onno Eberhard
Michael Muehlebach
Claire Vernade
OffRL
33
0
0
19 Mar 2025
SocialJax: An Evaluation Suite for Multi-agent Reinforcement Learning in Sequential Social Dilemmas
SocialJax: An Evaluation Suite for Multi-agent Reinforcement Learning in Sequential Social Dilemmas
Zihao Guo
Richard Willis
Shuqing Shi
Tristan Tomilin
Joel Z. Leibo
Yali Du
50
0
0
18 Mar 2025
Yes, Q-learning Helps Offline In-Context RL
Yes, Q-learning Helps Offline In-Context RL
Denis Tarasov
Alexander Nikulin
Ilya Zisman
Albina Klepach
Andrei Polubarov
Nikita Lyubaykin
Alexander Derevyagin
Igor Kiselev
Vladislav Kurenkov
OffRL
OnRL
76
0
0
24 Feb 2025
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
Thomas Schmied
Thomas Adler
Vihang Patil
M. Beck
Korbinian Poppel
Johannes Brandstetter
G. Klambauer
Razvan Pascanu
Sepp Hochreiter
68
4
0
21 Feb 2025
AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with Transformers
Jake Grigsby
Justin Sasek
Samyak Parajuli
Daniel Adebi
Amy Zhang
Yuke Zhu
OffRL
18
2
0
17 Nov 2024
N-Gram Induction Heads for In-Context RL: Improving Stability and Reducing Data Needs
N-Gram Induction Heads for In-Context RL: Improving Stability and Reducing Data Needs
Ilya Zisman
Alexander Nikulin
Andrei Polubarov
Nikita Lyubaykin
Vladislav Kurenkov
Andrei Polubarov
Igor Kiselev
Vladislav Kurenkov
OffRL
41
1
0
04 Nov 2024
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks
Michael T. Matthews
Michael Beukman
Chris Xiaoxuan Lu
Jakob Foerster
OffRL
AI4CE
36
2
0
30 Oct 2024
No Regrets: Investigating and Improving Regret Approximations for
  Curriculum Discovery
No Regrets: Investigating and Improving Regret Approximations for Curriculum Discovery
Alexander Rutherford
Michael Beukman
Timon Willi
Bruno Lacerda
Nick Hawes
Jakob Foerster
38
4
0
27 Aug 2024
NAVIX: Scaling MiniGrid Environments with JAX
NAVIX: Scaling MiniGrid Environments with JAX
Eduardo Pignatelli
Jarek Liesen
R. T. Lange
Chris Xiaoxuan Lu
Pablo Samuel Castro
Laura Toni
29
3
0
28 Jul 2024
RobocupGym: A challenging continuous control benchmark in Robocup
RobocupGym: A challenging continuous control benchmark in Robocup
Michael Beukman
Branden Ingram
Geraud Nangue Tasse
Benjamin Rosman
Pravesh Ranchod
OffRL
40
1
0
03 Jul 2024
The Overcooked Generalisation Challenge
The Overcooked Generalisation Challenge
Constantin Ruhdorfer
Matteo Bortoletto
Anna Penzkofer
Andreas Bulling
44
3
0
25 Jun 2024
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning
Alexander Nikulin
Ilya Zisman
Alexey Zemtsov
Viacheslav Sinii
99
4
0
13 Jun 2024
Massively Multiagent Minigames for Training Generalist Agents
Massively Multiagent Minigames for Training Generalist Agents
Kyoung Whan Choe
Ryan Sullivan
Joseph Suárez
AI4CE
30
0
0
07 Jun 2024
A Batch Sequential Halving Algorithm without Performance Degradation
A Batch Sequential Halving Algorithm without Performance Degradation
Sotetsu Koyamada
Soichiro Nishimori
Shin Ishii
23
0
0
01 Jun 2024
Benchmarking General-Purpose In-Context Learning
Benchmarking General-Purpose In-Context Learning
Fan Wang
Chuan Lin
Yang Cao
Yu Kang
25
1
0
27 May 2024
Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement
  Learning
Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning
Michael T. Matthews
Michael Beukman
Benjamin Ellis
Mikayel Samvelyan
Matthew Jackson
Samuel Coward
Jakob Foerster
OffRL
21
24
0
26 Feb 2024
In-Context Reinforcement Learning for Variable Action Spaces
In-Context Reinforcement Learning for Variable Action Spaces
Viacheslav Sinii
Alexander Nikulin
Vladislav Kurenkov
Ilya Zisman
Sergey Kolesnikov
13
14
0
20 Dec 2023
Emergence of In-Context Reinforcement Learning from Noise Distillation
Emergence of In-Context Reinforcement Learning from Noise Distillation
Ilya Zisman
Vladislav Kurenkov
Alexander Nikulin
Viacheslav Sinii
Sergey Kolesnikov
OffRL
19
9
0
19 Dec 2023
minimax: Efficient Baselines for Autocurricula in JAX
minimax: Efficient Baselines for Autocurricula in JAX
Minqi Jiang
Michael Dennis
Edward Grefenstette
Tim Rocktaschel
19
8
0
21 Nov 2023
Structured State Space Models for In-Context Reinforcement Learning
Structured State Space Models for In-Context Reinforcement Learning
Chris Xiaoxuan Lu
Yannick Schroecker
Albert Gu
Emilio Parisotto
Jakob N. Foerster
Satinder Singh
Feryal M. P. Behbahani
AI4TS
84
80
0
07 Mar 2023
Grounding Language to Entities and Dynamics for Generalization in
  Reinforcement Learning
Grounding Language to Entities and Dynamics for Generalization in Reinforcement Learning
H. Wang
Victor Zhong
Karthik Narasimhan
76
44
0
19 Jan 2021
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
237
11,568
0
09 Mar 2017
1