Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1806.01780
Cited By
Mix&Match - Agent Curricula for Reinforcement Learning
5 June 2018
Wojciech M. Czarnecki
Siddhant M. Jayakumar
Max Jaderberg
Leonard Hasenclever
Yee Whye Teh
Simon Osindero
N. Heess
Razvan Pascanu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Mix&Match - Agent Curricula for Reinforcement Learning"
26 / 26 papers shown
Title
Self-Composing Policies for Scalable Continual Reinforcement Learning
Mikel Malagón
Josu Ceberio
Jose A. Lozano
CLL
22
5
0
04 Jun 2025
Dynamic Action Interpolation: A Universal Approach for Accelerating Reinforcement Learning with Expert Guidance
Wenjun Cao
81
0
0
26 Apr 2025
Efficient Reinforcement Finetuning via Adaptive Curriculum Learning
Taiwei Shi
Yiyang Wu
Linxin Song
Dinesh Manocha
Jieyu Zhao
LRM
168
15
0
07 Apr 2025
Knowledge Distillation Performs Partial Variance Reduction
M. Safaryan
Alexandra Peste
Dan Alistarh
82
7
0
27 May 2023
Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration
Chao Yu
Xinyi Yang
Jiaxuan Gao
Jiayu Chen
Yunfei Li
...
Yunfei Xiang
Rui Huang
Huazhong Yang
Yi Wu
Yu Wang
68
37
0
09 Jan 2023
Improving Policy Optimization with Generalist-Specialist Learning
Zhiwei Jia
Xuanlin Li
Z. Ling
Shuang Liu
Yiran Wu
H. Su
OffRL
79
25
0
26 Jun 2022
Non-Markovian policies occupancy measures
Romain Laroche
Rémi Tachet des Combes
Jacob Buckman
OffRL
79
1
0
27 May 2022
Environment Generation for Zero-Shot Compositional Reinforcement Learning
Izzeddin Gur
Natasha Jaques
Yingjie Miao
Jongwook Choi
Manoj Kumar Tiwari
Honglak Lee
Aleksandra Faust
91
43
0
21 Jan 2022
Efficient Transformers in Reinforcement Learning using Actor-Learner Distillation
Emilio Parisotto
Ruslan Salakhutdinov
102
46
0
04 Apr 2021
Maximum Entropy Reinforcement Learning with Mixture Policies
Nir Baram
Guy Tennenholtz
Shie Mannor
25
5
0
18 Mar 2021
LISPR: An Options Framework for Policy Reuse with Reinforcement Learning
D. Graves
Jun Jin
Jun Luo
72
2
0
29 Dec 2020
RODE: Learning Roles to Decompose Multi-Agent Tasks
Tonghan Wang
Tarun Gupta
Anuj Mahajan
Bei Peng
Shimon Whiteson
Chongjie Zhang
OffRL
106
211
0
04 Oct 2020
Learning to Play against Any Mixture of Opponents
Max O. Smith
Thomas W. Anthony
Yongzhao Wang
Michael P. Wellman
OffRL
70
9
0
29 Sep 2020
Follow the Object: Curriculum Learning for Manipulation Tasks with Imagined Goals
Ozsel Kilinc
Giovanni Montana
52
5
0
05 Aug 2020
Adaptive Procedural Task Generation for Hard-Exploration Problems
Kuan Fang
Yuke Zhu
Silvio Savarese
Li Fei-Fei
76
26
0
01 Jul 2020
Transient Non-Stationarity and Generalisation in Deep Reinforcement Learning
Maximilian Igl
Gregory Farquhar
Jelena Luketina
Wendelin Boehmer
Shimon Whiteson
125
88
0
10 Jun 2020
Automatic Curriculum Learning For Deep RL: A Short Survey
Rémy Portelas
Cédric Colas
Lilian Weng
Katja Hofmann
Pierre-Yves Oudeyer
ODL
112
176
0
10 Mar 2020
Periodic Intra-Ensemble Knowledge Distillation for Reinforcement Learning
Zhang-Wei Hong
P. Nagarajan
Guilherme J. Maeda
OffRL
53
4
0
01 Feb 2020
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
128
193
0
23 Dec 2019
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNN
VLM
CLL
AI4CE
LRM
176
1,839
0
13 Dec 2019
Attentive Multi-Task Deep Reinforcement Learning
Timo Bram
Gino Brunner
Oliver Richter
Roger Wattenhofer
CLL
144
18
0
05 Jul 2019
Growing Action Spaces
Gregory Farquhar
Laura Gustafson
Zeming Lin
Shimon Whiteson
Nicolas Usunier
Gabriel Synnaeve
72
38
0
28 Jun 2019
Information asymmetry in KL-regularized RL
Alexandre Galashov
Siddhant M. Jayakumar
Leonard Hasenclever
Dhruva Tirumala
Jonathan Richard Schwarz
Guillaume Desjardins
Wojciech M. Czarnecki
Yee Whye Teh
Razvan Pascanu
N. Heess
OffRL
64
104
0
03 May 2019
Exploiting Hierarchy for Learning and Transfer in KL-regularized RL
Dhruva Tirumala
Hyeonwoo Noh
Alexandre Galashov
Leonard Hasenclever
Arun Ahuja
Greg Wayne
Razvan Pascanu
Yee Whye Teh
N. Heess
OffRL
65
44
0
18 Mar 2019
Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research
Joel Z Leibo
Edward Hughes
Marc Lanctot
T. Graepel
79
110
0
02 Mar 2019
Distilling Policy Distillation
Wojciech M. Czarnecki
Razvan Pascanu
Simon Osindero
Siddhant M. Jayakumar
G. Swirszcz
Max Jaderberg
77
134
0
06 Feb 2019
1