Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1703.05407
Cited By
Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play
15 March 2017
Sainbayar Sukhbaatar
Zeming Lin
Ilya Kostrikov
Gabriel Synnaeve
Arthur Szlam
Rob Fergus
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play"
50 / 61 papers shown
Title
Efficient Reinforcement Finetuning via Adaptive Curriculum Learning
Taiwei Shi
Yiyang Wu
Linxin Song
Dinesh Manocha
Jieyu Zhao
LRM
78
1
0
07 Apr 2025
Causally Aligned Curriculum Learning
Mingxuan Li
Junzhe Zhang
Elias Bareinboim
CML
61
3
0
21 Mar 2025
Can Learned Optimization Make Reinforcement Learning Less Difficult?
Alexander David Goldie
Chris Xiaoxuan Lu
Matthew Jackson
Shimon Whiteson
Jakob N. Foerster
42
3
0
09 Jul 2024
MU-Bench: A Multitask Multimodal Benchmark for Machine Unlearning
Jiali Cheng
Hadi Amiri
BDL
43
3
0
21 Jun 2024
A social path to human-like artificial intelligence
Edgar A. Duénez-Guzmán
Suzanne Sadedin
Jane X. Wang
Kevin R. McKee
Joel Z Leibo
GNN
31
28
0
22 May 2024
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
Seohong Park
Oleh Rybkin
Sergey Levine
OffRL
33
34
0
13 Oct 2023
Intrinsically Motivated Hierarchical Policy Learning in Multi-objective Markov Decision Processes
Sherif M. Abdelfattah
K. Merrick
Jiankun Hu
23
4
0
18 Aug 2023
MatrixWorld: A pursuit-evasion platform for safe multi-agent coordination and autocurricula
Lijun Sun
Yu-Cheng Chang
Chao Lyu
Chin-Teng Lin
Yuhui Shi
38
1
0
27 Jul 2023
Policy-Based Self-Competition for Planning Problems
Jonathan Pirnay
Q. Göttl
Jakob Burger
D. G. Grimm
34
3
0
07 Jun 2023
Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum
Jigang Kim
Daesol Cho
H. J. Kim
22
3
0
17 May 2023
Proximal Curriculum for Reinforcement Learning Agents
Georgios Tzannetos
Bárbara Gomes Ribeiro
Parameswaran Kamalaruban
Adish Singla
32
5
0
25 Apr 2023
Automaton-Guided Curriculum Generation for Reinforcement Learning Agents
Yash Shukla
A. Kulkarni
Robert Wright
Alvaro Velasquez
Jivko Sinapov
18
1
0
11 Apr 2023
Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition
P. Sunehag
A. Vezhnevets
Edgar A. Duénez-Guzmán
Igor Mordach
Joel Z Leibo
26
2
0
02 Feb 2023
Human-Timescale Adaptation in an Open-Ended Task Space
Adaptive Agent Team
Jakob Bauer
Kate Baumli
Satinder Baveja
Feryal M. P. Behbahani
...
Jakub Sygnowski
K. Tuyls
Sarah York
Alexander Zacherl
Lei Zhang
LM&Ro
OffRL
AI4CE
LRM
35
108
0
18 Jan 2023
Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping
Lina Mezghani
Sainbayar Sukhbaatar
Piotr Bojanowski
A. Lazaric
Alahari Karteek
OffRL
41
18
0
05 Jan 2023
Solving Collaborative Dec-POMDPs with Deep Reinforcement Learning Heuristics
Nitsan Soffair
28
0
0
09 Nov 2022
LEAGUE: Guided Skill Learning and Abstraction for Long-Horizon Manipulation
Shuo Cheng
Danfei Xu
54
37
0
23 Oct 2022
Exploration via Elliptical Episodic Bonuses
Mikael Henaff
Roberta Raileanu
Minqi Jiang
Tim Rocktaschel
OffRL
29
39
0
11 Oct 2022
Goal-Conditioned Q-Learning as Knowledge Distillation
Alexander Levine
S. Feizi
OffRL
22
2
0
28 Aug 2022
Human Decision Makings on Curriculum Reinforcement Learning with Difficulty Adjustment
Yilei Zeng
Jiali Duan
Yongqian Li
Emilio Ferrara
Lerrel Pinto
Chloe Kuo
Stefanos Nikolaidis
48
3
0
04 Aug 2022
Graph-Structured Policy Learning for Multi-Goal Manipulation Tasks
David M. Klee
Ondrej Biza
Robert W. Platt
OffRL
24
1
0
22 Jul 2022
Grounding Aleatoric Uncertainty for Unsupervised Environment Design
Minqi Jiang
Michael Dennis
Jack Parker-Holder
Andrei Lupu
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
Jakob N. Foerster
43
13
0
11 Jul 2022
Walk the Random Walk: Learning to Discover and Reach Goals Without Supervision
Lina Mezghani
Sainbayar Sukhbaatar
Piotr Bojanowski
Alahari Karteek
29
4
0
23 Jun 2022
Learning Design and Construction with Varying-Sized Materials via Prioritized Memory Resets
Yunfei Li
Tao Kong
Lei Li
Yi Wu
43
4
0
12 Apr 2022
ACuTE: Automatic Curriculum Transfer from Simple to Complex Environments
Yash Shukla
Christopher Thierauf
Ramtin Hosseini
Gyan Tatiya
Jivko Sinapov
19
9
0
11 Apr 2022
Evolving Curricula with Regret-Based Environment Design
Jack Parker-Holder
Minqi Jiang
Michael Dennis
Mikayel Samvelyan
Jakob N. Foerster
Edward Grefenstette
Tim Rocktaschel
31
117
0
02 Mar 2022
Rule Mining over Knowledge Graphs via Reinforcement Learning
Lihan Chen
Sihang Jiang
Jingping Liu
Chao Wang
Shenmin Zhang
Chenhao Xie
Jiaqing Liang
Yanghua Xiao
Rui Song
36
19
0
21 Feb 2022
Open-Ended Reinforcement Learning with Neural Reward Functions
Robert Meier
Asier Mujika
37
7
0
16 Feb 2022
Environment Generation for Zero-Shot Compositional Reinforcement Learning
Izzeddin Gur
Natasha Jaques
Yingjie Miao
Jongwook Choi
Manoj Kumar Tiwari
Honglak Lee
Aleksandra Faust
28
43
0
21 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
33
100
0
11 Jan 2022
Situated Dialogue Learning through Procedural Environment Generation
Prithviraj Ammanabrolu
Renee Jia
Mark O. Riedl
109
14
0
07 Oct 2021
Learning Multi-Objective Curricula for Robotic Policy Learning
Jikun Kang
Miao Liu
Abhinav Gupta
C. Pal
Xue Liu
Jie Fu
34
4
0
06 Oct 2021
Replay-Guided Adversarial Environment Design
Minqi Jiang
Michael Dennis
Jack Parker-Holder
Jakob N. Foerster
Edward Grefenstette
Tim Rocktaschel
129
95
0
06 Oct 2021
Is Curiosity All You Need? On the Utility of Emergent Behaviours from Curious Exploration
Oliver Groth
Markus Wulfmeier
Giulia Vezzani
Vibhavari Dasagi
Tim Hertweck
Roland Hafner
N. Heess
Martin Riedmiller
LRM
38
20
0
17 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
36
92
0
14 Sep 2021
Open-Ended Learning Leads to Generally Capable Agents
Open-Ended Learning Team
Adam Stooke
Anuj Mahajan
Catarina Barros
Charlie Deck
...
Nicolas Porcel
Roberta Raileanu
Steph Hughes-Fitt
Valentin Dalibard
Wojciech M. Czarnecki
28
181
0
27 Jul 2021
Unsupervised Skill Discovery with Bottleneck Option Learning
Jaekyeom Kim
Seohong Park
Gunhee Kim
32
32
0
27 Jun 2021
Training a First-Order Theorem Prover from Synthetic Data
Vlad Firoiu
Eser Aygun
Ankit Anand
Zafarali Ahmed
Xavier Glorot
Laurent Orseau
Lei Zhang
Doina Precup
Shibl Mourad
NAI
21
13
0
05 Mar 2021
Asymmetric self-play for automatic goal discovery in robotic manipulation
OpenAI OpenAI
Matthias Plappert
Raul Sampedro
Tao Xu
Ilge Akkaya
...
Hyeonwoo Noh
Lilian Weng
Qiming Yuan
Casey Chu
Wojciech Zaremba
SSL
79
76
0
13 Jan 2021
Continual Learning of Control Primitives: Skill Discovery via Reset-Games
Kelvin Xu
Siddharth Verma
Chelsea Finn
Sergey Levine
CLL
33
33
0
10 Nov 2020
Learning with AMIGo: Adversarially Motivated Intrinsic Goals
Andres Campero
Roberta Raileanu
Heinrich Küttler
J. Tenenbaum
Tim Rocktaschel
Edward Grefenstette
32
125
0
22 Jun 2020
Automatic Curriculum Learning through Value Disagreement
Yunzhi Zhang
Pieter Abbeel
Lerrel Pinto
29
103
0
17 Jun 2020
Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey
Sanmit Narvekar
Bei Peng
Matteo Leonetti
Jivko Sinapov
Matthew E. Taylor
Peter Stone
ODL
152
458
0
10 Mar 2020
Automatic Curriculum Learning For Deep RL: A Short Survey
Rémy Portelas
Cédric Colas
Lilian Weng
Katja Hofmann
Pierre-Yves Oudeyer
ODL
17
167
0
10 Mar 2020
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNN
VLM
CLL
AI4CE
LRM
41
1,794
0
13 Dec 2019
Unsupervised Reinforcement Learning of Transferable Meta-Skills for Embodied Navigation
Juncheng Li
Qing Guo
Siliang Tang
Haizhou Shi
Fei Wu
Yueting Zhuang
William Yang Wang
SSL
40
68
0
18 Nov 2019
A New Framework for Multi-Agent Reinforcement Learning -- Centralized Training and Exploration with Decentralized Execution via Policy Distillation
Gang Chen
17
40
0
21 Oct 2019
Growing Action Spaces
Gregory Farquhar
Laura Gustafson
Zeming Lin
Shimon Whiteson
Nicolas Usunier
Gabriel Synnaeve
14
37
0
28 Jun 2019
Curriculum Learning for Cumulative Return Maximization
Francesco Foglino
Christiano Coletto Christakou
Ricardo Luna Gutierrez
Matteo Leonetti
31
9
0
13 Jun 2019
Exploration via Hindsight Goal Generation
Zhizhou Ren
Kefan Dong
Yuanshuo Zhou
Qiang Liu
Jian-wei Peng
27
85
0
10 Jun 2019
1
2
Next