ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1806.01780
  4. Cited By
Mix&Match - Agent Curricula for Reinforcement Learning

Mix&Match - Agent Curricula for Reinforcement Learning

5 June 2018
Wojciech M. Czarnecki
Siddhant M. Jayakumar
Max Jaderberg
Leonard Hasenclever
Yee Whye Teh
Simon Osindero
N. Heess
Razvan Pascanu
ArXiv (abs)PDFHTML

Papers citing "Mix&Match - Agent Curricula for Reinforcement Learning"

26 / 26 papers shown
Title
Self-Composing Policies for Scalable Continual Reinforcement Learning
Self-Composing Policies for Scalable Continual Reinforcement Learning
Mikel Malagón
Josu Ceberio
Jose A. Lozano
CLL
22
5
0
04 Jun 2025
Dynamic Action Interpolation: A Universal Approach for Accelerating Reinforcement Learning with Expert Guidance
Dynamic Action Interpolation: A Universal Approach for Accelerating Reinforcement Learning with Expert Guidance
Wenjun Cao
81
0
0
26 Apr 2025
Efficient Reinforcement Finetuning via Adaptive Curriculum Learning
Efficient Reinforcement Finetuning via Adaptive Curriculum Learning
Taiwei Shi
Yiyang Wu
Linxin Song
Dinesh Manocha
Jieyu Zhao
LRM
168
15
0
07 Apr 2025
Knowledge Distillation Performs Partial Variance Reduction
Knowledge Distillation Performs Partial Variance Reduction
M. Safaryan
Alexandra Peste
Dan Alistarh
82
7
0
27 May 2023
Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time
  Multi-Robot Cooperative Exploration
Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration
Chao Yu
Xinyi Yang
Jiaxuan Gao
Jiayu Chen
Yunfei Li
...
Yunfei Xiang
Rui Huang
Huazhong Yang
Yi Wu
Yu Wang
68
37
0
09 Jan 2023
Improving Policy Optimization with Generalist-Specialist Learning
Improving Policy Optimization with Generalist-Specialist Learning
Zhiwei Jia
Xuanlin Li
Z. Ling
Shuang Liu
Yiran Wu
H. Su
OffRL
79
25
0
26 Jun 2022
Non-Markovian policies occupancy measures
Non-Markovian policies occupancy measures
Romain Laroche
Rémi Tachet des Combes
Jacob Buckman
OffRL
79
1
0
27 May 2022
Environment Generation for Zero-Shot Compositional Reinforcement
  Learning
Environment Generation for Zero-Shot Compositional Reinforcement Learning
Izzeddin Gur
Natasha Jaques
Yingjie Miao
Jongwook Choi
Manoj Kumar Tiwari
Honglak Lee
Aleksandra Faust
91
43
0
21 Jan 2022
Efficient Transformers in Reinforcement Learning using Actor-Learner
  Distillation
Efficient Transformers in Reinforcement Learning using Actor-Learner Distillation
Emilio Parisotto
Ruslan Salakhutdinov
99
46
0
04 Apr 2021
Maximum Entropy Reinforcement Learning with Mixture Policies
Maximum Entropy Reinforcement Learning with Mixture Policies
Nir Baram
Guy Tennenholtz
Shie Mannor
23
5
0
18 Mar 2021
LISPR: An Options Framework for Policy Reuse with Reinforcement Learning
LISPR: An Options Framework for Policy Reuse with Reinforcement Learning
D. Graves
Jun Jin
Jun Luo
67
2
0
29 Dec 2020
RODE: Learning Roles to Decompose Multi-Agent Tasks
RODE: Learning Roles to Decompose Multi-Agent Tasks
Tonghan Wang
Tarun Gupta
Anuj Mahajan
Bei Peng
Shimon Whiteson
Chongjie Zhang
OffRL
106
211
0
04 Oct 2020
Learning to Play against Any Mixture of Opponents
Learning to Play against Any Mixture of Opponents
Max O. Smith
Thomas W. Anthony
Yongzhao Wang
Michael P. Wellman
OffRL
70
9
0
29 Sep 2020
Follow the Object: Curriculum Learning for Manipulation Tasks with
  Imagined Goals
Follow the Object: Curriculum Learning for Manipulation Tasks with Imagined Goals
Ozsel Kilinc
Giovanni Montana
52
5
0
05 Aug 2020
Adaptive Procedural Task Generation for Hard-Exploration Problems
Adaptive Procedural Task Generation for Hard-Exploration Problems
Kuan Fang
Yuke Zhu
Silvio Savarese
Li Fei-Fei
76
26
0
01 Jul 2020
Transient Non-Stationarity and Generalisation in Deep Reinforcement
  Learning
Transient Non-Stationarity and Generalisation in Deep Reinforcement Learning
Maximilian Igl
Gregory Farquhar
Jelena Luketina
Wendelin Boehmer
Shimon Whiteson
125
88
0
10 Jun 2020
Automatic Curriculum Learning For Deep RL: A Short Survey
Automatic Curriculum Learning For Deep RL: A Short Survey
Rémy Portelas
Cédric Colas
Lilian Weng
Katja Hofmann
Pierre-Yves Oudeyer
ODL
109
176
0
10 Mar 2020
Periodic Intra-Ensemble Knowledge Distillation for Reinforcement
  Learning
Periodic Intra-Ensemble Knowledge Distillation for Reinforcement Learning
Zhang-Wei Hong
P. Nagarajan
Guilherme J. Maeda
OffRL
53
4
0
01 Feb 2020
A Survey of Deep Reinforcement Learning in Video Games
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRLAI4TS
128
193
0
23 Dec 2019
Dota 2 with Large Scale Deep Reinforcement Learning
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNNVLMCLLAI4CELRM
176
1,839
0
13 Dec 2019
Attentive Multi-Task Deep Reinforcement Learning
Attentive Multi-Task Deep Reinforcement Learning
Timo Bram
Gino Brunner
Oliver Richter
Roger Wattenhofer
CLL
144
18
0
05 Jul 2019
Growing Action Spaces
Growing Action Spaces
Gregory Farquhar
Laura Gustafson
Zeming Lin
Shimon Whiteson
Nicolas Usunier
Gabriel Synnaeve
72
38
0
28 Jun 2019
Information asymmetry in KL-regularized RL
Information asymmetry in KL-regularized RL
Alexandre Galashov
Siddhant M. Jayakumar
Leonard Hasenclever
Dhruva Tirumala
Jonathan Richard Schwarz
Guillaume Desjardins
Wojciech M. Czarnecki
Yee Whye Teh
Razvan Pascanu
N. Heess
OffRL
61
104
0
03 May 2019
Exploiting Hierarchy for Learning and Transfer in KL-regularized RL
Exploiting Hierarchy for Learning and Transfer in KL-regularized RL
Dhruva Tirumala
Hyeonwoo Noh
Alexandre Galashov
Leonard Hasenclever
Arun Ahuja
Greg Wayne
Razvan Pascanu
Yee Whye Teh
N. Heess
OffRL
65
44
0
18 Mar 2019
Autocurricula and the Emergence of Innovation from Social Interaction: A
  Manifesto for Multi-Agent Intelligence Research
Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research
Joel Z Leibo
Edward Hughes
Marc Lanctot
T. Graepel
79
110
0
02 Mar 2019
Distilling Policy Distillation
Distilling Policy Distillation
Wojciech M. Czarnecki
Razvan Pascanu
Simon Osindero
Siddhant M. Jayakumar
G. Swirszcz
Max Jaderberg
75
134
0
06 Feb 2019
1