Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2012.02096
Cited By
Emergent Complexity and Zero-shot Transfer via Unsupervised Environment Design
3 December 2020
Michael Dennis
Natasha Jaques
Eugene Vinitsky
Alexandre M. Bayen
Stuart J. Russell
Andrew Critch
Sergey Levine
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Emergent Complexity and Zero-shot Transfer via Unsupervised Environment Design"
49 / 49 papers shown
Title
Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments
Yun Qu
W. Wang
Yixiu Mao
Yiqin Lv
Xiangyang Ji
TTA
85
0
0
27 Apr 2025
Improving Human-AI Coordination through Adversarial Training and Generative Models
Paresh Chaudhary
Yancheng Liang
Daphne Chen
S. Du
Natasha Jaques
64
0
0
21 Apr 2025
Cross-environment Cooperation Enables Zero-shot Multi-agent Coordination
Kunal Jha
Wilka Carvalho
Yancheng Liang
S. Du
Max Kleiman-Weiner
Natasha Jaques
22
0
0
17 Apr 2025
Causally Aligned Curriculum Learning
Mingxuan Li
Junzhe Zhang
Elias Bareinboim
CML
56
3
0
21 Mar 2025
A Generalist Hanabi Agent
Arjun Vaithilingam Sudhakar
Hadi Nekoei
Mathieu Reymond
Miao Liu
Janarthanan Rajendran
Sarath Chandar
91
0
0
17 Mar 2025
Reinforcement Teaching
Alex Lewandowski
Calarina Muslimani
Dale Schuurmans
Matthew E. Taylor
Jun-Jie Luo
74
1
0
28 Jan 2025
Evolution and The Knightian Blindspot of Machine Learning
Joel Lehman
Elliot Meyerson
Tarek El-Gaaly
Kenneth O. Stanley
Tarin Ziyaee
81
1
0
22 Jan 2025
Enabling Adaptive Agent Training in Open-Ended Simulators by Targeting Diversity
Robby Costales
Stefanos Nikolaidis
AI4CE
23
0
0
07 Nov 2024
Environment as Policy: Learning to Race in Unseen Tracks
Hongze Wang
Jiaxu Xing
Nico Messikommer
Davide Scaramuzza
29
1
0
29 Oct 2024
DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback
Zaid Khan
Elias Stengel-Eskin
Jaemin Cho
Mohit Bansal
VGen
36
1
0
08 Oct 2024
DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots
Maria Bauzá
José Enrique Chen
Valentin Dalibard
Nimrod Gileadi
Roland Hafner
...
Martin Riedmiller
Jon Scholz
Konstantinos Bousmalis
Francesco Nori
Nicolas Heess
28
4
0
10 Sep 2024
Robust Fast Adaptation from Adversarially Explicit Task Distribution Generation
Cheems Wang
Yiqin Lv
Yixiu Mao
Yun Qu
Yi Tian Xu
Xiangyang Ji
OOD
TTA
49
6
0
28 Jul 2024
Can Learned Optimization Make Reinforcement Learning Less Difficult?
Alexander David Goldie
Chris Xiaoxuan Lu
Matthew Jackson
Shimon Whiteson
Jakob N. Foerster
40
3
0
09 Jul 2024
The Overcooked Generalisation Challenge
Constantin Ruhdorfer
Matteo Bortoletto
Anna Penzkofer
Andreas Bulling
46
3
0
25 Jun 2024
Open-Endedness is Essential for Artificial Superhuman Intelligence
Edward Hughes
Michael Dennis
Jack Parker-Holder
Feryal M. P. Behbahani
Aditi Mavalankar
Yuge Shi
Tom Schaul
Tim Rocktaschel
LRM
32
18
0
06 Jun 2024
Scenario-Based Curriculum Generation for Multi-Agent Autonomous Driving
Axel Brunnbauer
Luigi Berducci
P. Priller
D. Ničković
Radu Grosu
35
1
0
26 Mar 2024
Distilling Morphology-Conditioned Hypernetworks for Efficient Universal Morphology Control
Zheng Xiong
Risto Vuorio
Jacob Beck
Matthieu Zimmer
Kun Shao
Shimon Whiteson
27
1
0
09 Feb 2024
Curriculum Learning for Cooperation in Multi-Agent Reinforcement Learning
R. Bhati
S. Gottipati
Clodéric Mars
Matthew E. Taylor
30
0
0
19 Dec 2023
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
67
5
0
13 Dec 2023
Building Open-Ended Embodied Agent via Language-Policy Bidirectional Adaptation
Shaopeng Zhai
Jie Wang
Tianyi Zhang
Fuxian Huang
Qi Zhang
Ming Zhou
Jing Hou
Yu Qiao
Yu Liu
LLMAG
LM&Ro
21
1
0
12 Dec 2023
Train Hard, Fight Easy: Robust Meta Reinforcement Learning
Ido Greenberg
Shie Mannor
Gal Chechik
E. Meirom
OffRL
OOD
19
6
0
26 Jan 2023
A Survey of Meta-Reinforcement Learning
Jacob Beck
Risto Vuorio
E. Liu
Zheng Xiong
L. Zintgraf
Chelsea Finn
Shimon Whiteson
OOD
OffRL
30
119
0
19 Jan 2023
Generalization through Diversity: Improving Unsupervised Environment Design
Wenjun Li
Pradeep Varakantham
Dexun Li
17
7
0
19 Jan 2023
Powderworld: A Platform for Understanding Generalization via Rich Task Distributions
Kevin Frans
Phillip Isola
OffRL
34
9
0
23 Nov 2022
Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification
Takumi Tanabe
Reimi Sato
Kazuto Fukuchi
Jun Sakuma
Youhei Akimoto
OffRL
11
8
0
07 Nov 2022
Environment Design for Inverse Reinforcement Learning
Thomas Kleine Buening
Victor Villin
Christos Dimitrakakis
30
1
0
26 Oct 2022
Augmentative Topology Agents For Open-Ended Learning
Muhammad Umair Nasir
Michael Beukman
Steven D. James
C. Cleghorn
22
3
0
20 Oct 2022
CLUTR: Curriculum Learning via Unsupervised Task Representation Learning
Abdus Salam Azad
Izzeddin Gur
Jasper Emhoff
Nathaniel Alexis
Aleksandra Faust
Pieter Abbeel
Ion Stoica
SSL
16
12
0
19 Oct 2022
Exploration via Elliptical Episodic Bonuses
Mikael Henaff
Roberta Raileanu
Minqi Jiang
Tim Rocktaschel
OffRL
13
39
0
11 Oct 2022
Benchmarking Reinforcement Learning Techniques for Autonomous Navigation
Zifan Xu
Bo Liu
Xuesu Xiao
Anirudh Nair
Peter Stone
21
39
0
10 Oct 2022
MSRL: Distributed Reinforcement Learning with Dataflow Fragments
Huanzhou Zhu
Bo Zhao
Gang Chen
Weifeng Chen
Yijie Chen
Liang Shi
Yaodong Yang
Peter R. Pietzuch
Lei Chen
OffRL
MoE
11
6
0
03 Oct 2022
Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities: Robustness, Safety, and Generalizability
Mengdi Xu
Zuxin Liu
Peide Huang
Wenhao Ding
Zhepeng Cen
Bo-wen Li
Ding Zhao
61
45
0
16 Sep 2022
Bayesian Generational Population-Based Training
Xingchen Wan
Cong Lu
Jack Parker-Holder
Philip J. Ball
Vu-Linh Nguyen
Binxin Ru
Michael A. Osborne
OffRL
14
15
0
19 Jul 2022
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge
Linxi Fan
Guanzhi Wang
Yunfan Jiang
Ajay Mandlekar
Yuncong Yang
Haoyi Zhu
Andrew Tang
De-An Huang
Yuke Zhu
Anima Anandkumar
LM&Ro
42
347
0
17 Jun 2022
ProcTHOR: Large-Scale Embodied AI Using Procedural Generation
Matt Deitke
Eli VanderBilt
Alvaro Herrasti
Luca Weihs
Jordi Salvador
...
Winson Han
Eric Kolve
Ali Farhadi
Aniruddha Kembhavi
Roozbeh Mottaghi
LM&Ro
25
233
0
14 Jun 2022
Relative Policy-Transition Optimization for Fast Policy Transfer
Jiawei Xu
Cheng Zhou
Yizheng Zhang
Zhengyou Zhang
Lei Han
16
0
0
13 Jun 2022
On the Robustness of Safe Reinforcement Learning under Observational Perturbations
Zuxin Liu
Zijian Guo
Zhepeng Cen
Huan Zhang
Jie Tan
Bo-wen Li
Ding Zhao
OOD
OffRL
32
35
0
29 May 2022
Flexible Multiple-Objective Reinforcement Learning for Chip Placement
Fu-Chieh Chang
Yu-Wei Tseng
Ya-Wen Yu
Ssu-Rui Lee
Alexandru Cioba
...
Chien-Yi Yang
Ren-Chu Wang
Yao-Wen Chang
Tai-Chen Chen
Tung-Chieh Chen
OffRL
25
5
0
13 Apr 2022
Learning Robust Real-Time Cultural Transmission without Human Data
Cultural General Intelligence Team
Avishkar Bhoopchand
Bethanie Brownfield
Adrian Collister
Agustin Dal Lago
...
Alex Platonov
Evan Senter
Sukhdeep Singh
Alexander Zacherl
Lei M. Zhang
VLM
24
11
0
01 Mar 2022
Robust Reinforcement Learning as a Stackelberg Game via Adaptively-Regularized Adversarial Training
Peide Huang
Mengdi Xu
Fei Fang
Ding Zhao
59
37
0
19 Feb 2022
Open-Ended Reinforcement Learning with Neural Reward Functions
Robert Meier
Asier Mujika
29
7
0
16 Feb 2022
Environment Generation for Zero-Shot Compositional Reinforcement Learning
Izzeddin Gur
Natasha Jaques
Yingjie Miao
Jongwook Choi
Manoj Kumar Tiwari
Honglak Lee
Aleksandra Faust
18
43
0
21 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
21
100
0
11 Jan 2022
Social Neuro AI: Social Interaction as the "dark matter" of AI
Samuele Bolotta
G. Dumas
11
22
0
31 Dec 2021
Replay-Guided Adversarial Environment Design
Minqi Jiang
Michael Dennis
Jack Parker-Holder
Jakob N. Foerster
Edward Grefenstette
Tim Rocktaschel
116
95
0
06 Oct 2021
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Mikayel Samvelyan
Robert Kirk
Vitaly Kurin
Jack Parker-Holder
Minqi Jiang
Eric Hambro
Fabio Petroni
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
OffRL
226
89
0
27 Sep 2021
Distributionally Robust Policy Learning via Adversarial Environment Generation
Allen Z. Ren
Anirudha Majumdar
OOD
96
15
0
13 Jul 2021
Training a First-Order Theorem Prover from Synthetic Data
Vlad Firoiu
Eser Aygun
Ankit Anand
Zafarali Ahmed
Xavier Glorot
Laurent Orseau
Lei Zhang
Doina Precup
Shibl Mourad
NAI
19
13
0
05 Mar 2021
CAD2RL: Real Single-Image Flight without a Single Real Image
Fereshteh Sadeghi
Sergey Levine
SSL
216
809
0
13 Nov 2016
1