ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.02096
  4. Cited By
Emergent Complexity and Zero-shot Transfer via Unsupervised Environment
  Design
v1v2 (latest)

Emergent Complexity and Zero-shot Transfer via Unsupervised Environment Design

Neural Information Processing Systems (NeurIPS), 2020
3 December 2020
Michael Dennis
Natasha Jaques
Eugene Vinitsky
Alexandre M. Bayen
Stuart J. Russell
Andrew Critch
Sergey Levine
ArXiv (abs)PDFHTML

Papers citing "Emergent Complexity and Zero-shot Transfer via Unsupervised Environment Design"

50 / 202 papers shown
MAESTRO: Multi-Agent Environment Shaping through Task and Reward Optimization
MAESTRO: Multi-Agent Environment Shaping through Task and Reward Optimization
Boyuan Wu
OffRL
274
0
0
24 Nov 2025
Beyond Fixed Tasks: Unsupervised Environment Design for Task-Level Pairs
Beyond Fixed Tasks: Unsupervised Environment Design for Task-Level Pairs
Daniel Furelos-Blanco
Charles Pert
Frederik Kelbel
Alex F Spies
Alessandra Russo
Michael Dennis
184
0
0
16 Nov 2025
Robust and Diverse Multi-Agent Learning via Rational Policy Gradient
Robust and Diverse Multi-Agent Learning via Rational Policy Gradient
Niklas Lauffer
Ameesh Shah
Micah Carroll
Sanjit A. Seshia
Stuart J. Russell
Michael Dennis
AAML
125
3
0
12 Nov 2025
Scaling Multi-Agent Environment Co-Design with Diffusion Models
Scaling Multi-Agent Environment Co-Design with Diffusion Models
Hao Xiang Li
Michael Amir
Amanda Prorok
328
0
0
05 Nov 2025
Curriculum Design for Trajectory-Constrained Agent: Compressing Chain-of-Thought Tokens in LLMs
Curriculum Design for Trajectory-Constrained Agent: Compressing Chain-of-Thought Tokens in LLMs
Georgios Tzannetos
Parameswaran Kamalaruban
Adish Singla
210
2
0
04 Nov 2025
RobustVLA: Robustness-Aware Reinforcement Post-Training for Vision-Language-Action Models
RobustVLA: Robustness-Aware Reinforcement Post-Training for Vision-Language-Action Models
Hongyin Zhang
Shuo Zhang
Junxi Jin
Qixin Zeng
Runze Li
Donglin Wang
VLM
468
3
0
03 Nov 2025
Automating Benchmark Design
Automating Benchmark Design
Amanda Dsouza
Harit Vishwakarma
Zhengyang Qi
Justin Bauer
Derek Pham
Thomas Walshe
Armin Parchami
Frederic Sala
P. Varma
166
0
0
28 Oct 2025
Multi-Environment POMDPs: Discrete Model Uncertainty Under Partial Observability
Multi-Environment POMDPs: Discrete Model Uncertainty Under Partial Observability
Eline M. Bovy
Caleb Probine
Marnix Suilen
Ufuk Topcu
Nils Jansen
169
0
0
27 Oct 2025
Heterogeneous Adversarial Play in Interactive Environments
Heterogeneous Adversarial Play in Interactive Environments
Manjie Xu
Xinyi Yang
Jiayu Zhan
Wei Liang
Chi Zhang
Yixin Zhu
198
1
0
21 Oct 2025
Procedural Game Level Design with Deep Reinforcement Learning
Procedural Game Level Design with Deep Reinforcement Learning
Miraç Buğra Özkan
134
0
0
16 Oct 2025
Pruning Cannot Hurt Robustness: Certified Trade-offs in Reinforcement Learning
Pruning Cannot Hurt Robustness: Certified Trade-offs in Reinforcement Learning
James Pedley
Benjamin Etheridge
Stephen J. Roberts
Francesco Quinzan
OffRLAAML
155
0
0
14 Oct 2025
BuilderBench: The Building Blocks of Intelligent Agents
BuilderBench: The Building Blocks of Intelligent Agents
Raj Ghugare
Catherine Ji
Kathryn Wantlin
Jin Schofield
Benjamin Eysenbach
Karthik Narasimhan
Benjamin Eysenbach
ELM
168
2
0
07 Oct 2025
Video Game Level Design as a Multi-Agent Reinforcement Learning Problem
Video Game Level Design as a Multi-Agent Reinforcement Learning Problem
Sam Earle
Zehua Jiang
Eugene Vinitsky
Julian Togelius
176
0
0
06 Oct 2025
Adversarial Reinforcement Learning Framework for ESP Cheater Simulation
Adversarial Reinforcement Learning Framework for ESP Cheater Simulation
Inkyu Park
J. Lee
Taehwan Kwon
Juheon Choi
Seungku Kim
Junsu Kim
Kimin Lee
AAML
271
0
0
29 Sep 2025
A Framework for Scalable Heterogeneous Multi-Agent Adversarial Reinforcement Learning in IsaacLab
A Framework for Scalable Heterogeneous Multi-Agent Adversarial Reinforcement Learning in IsaacLab
Isaac Peterson
Christopher Allred
Jacob Morrey
Mario Harper
190
0
0
26 Sep 2025
Imagined Autocurricula
Imagined Autocurricula
Ahmet H. Güzel
Matthew Jackson
Jarek Liesen
Tim Rocktaschel
Jakob Foerster
Ilija Bogunovic
Jack Parker-Holder
283
2
0
11 Sep 2025
Bootstrapping Task Spaces for Self-Improvement
Bootstrapping Task Spaces for Self-Improvement
Minqi Jiang
Andrei Lupu
Yoram Bachrach
LRM
223
5
0
04 Sep 2025
NiceWebRL: a Python library for human subject experiments with reinforcement learning environments
NiceWebRL: a Python library for human subject experiments with reinforcement learning environments
Wilka Carvalho
Vikram Goddla
Ishaan Sinha
Hoon Shin
Kunal Jha
OffRL
206
3
0
21 Aug 2025
Towards Agent-based Test Support Systems: An Unsupervised Environment Design Approach
Towards Agent-based Test Support Systems: An Unsupervised Environment Design Approach
Collins O.Ogbodo
Timothy J. Rogers
Mattia Dal Borgo
David J. Wagg
146
0
0
19 Aug 2025
Generative Modeling for Robust Deep Reinforcement Learning on the Traveling Salesman Problem
Generative Modeling for Robust Deep Reinforcement Learning on the Traveling Salesman Problem
Michael Li
Eric Bae
Christopher Haberland
Natasha Jaques
137
0
0
12 Aug 2025
GACL: Grounded Adaptive Curriculum Learning with Active Task and Performance Monitoring
GACL: Grounded Adaptive Curriculum Learning with Active Task and Performance Monitoring
Linji Wang
Zifan Xu
Peter Stone
Xuesu Xiao
212
0
0
05 Aug 2025
Diverse and Adaptive Behavior Curriculum for Autonomous Driving: A Student-Teacher Framework with Multi-Agent RL
Diverse and Adaptive Behavior Curriculum for Autonomous Driving: A Student-Teacher Framework with Multi-Agent RL
Ahmed Abouelazm
Johannes Ratz
Philip Schorner
J. M. Zöllner
270
0
0
25 Jul 2025
How Should We Meta-Learn Reinforcement Learning Algorithms?
How Should We Meta-Learn Reinforcement Learning Algorithms?
Alexander David Goldie
Zilin Wang
Jakob Foerster
Jakob N. Foerster
Shimon Whiteson
OffRL
323
5
0
23 Jul 2025
TRACED: Transition-aware Regret Approximation with Co-learnability for Environment Design
TRACED: Transition-aware Regret Approximation with Co-learnability for Environment Design
Geonwoo Cho
Jaegyun Im
Jihwan Lee
Hojun Yi
Sejin Kim
Sundong Kim
278
0
0
24 Jun 2025
Robust Dynamic Material Handling via Adaptive Constrained Evolutionary Reinforcement Learning
Robust Dynamic Material Handling via Adaptive Constrained Evolutionary Reinforcement LearningIEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2025
Chengpeng Hu
Ziming Wang
Bo Yuan
Jialin Liu
Chengqi Zhang
Xin Yao
242
0
0
20 Jun 2025
Truly Self-Improving Agents Require Intrinsic Metacognitive Learning
Truly Self-Improving Agents Require Intrinsic Metacognitive Learning
Tennison Liu
M. Schaar
AIFinLRM
446
8
0
05 Jun 2025
Deep learning image burst stacking to reconstruct high-resolution ground-based solar observations
Deep learning image burst stacking to reconstruct high-resolution ground-based solar observations
Christoph Schirninger
Robert Jarolim
Astrid M. Veronig
Christoph Kuckein
453
1
0
05 Jun 2025
ADEPT: Adaptive Diffusion Environment for Policy Transfer Sim-to-Real
ADEPT: Adaptive Diffusion Environment for Policy Transfer Sim-to-Real
Youwei Yu
Junhong Xu
Lantao Liu
384
2
0
02 Jun 2025
ROTATE: Regret-driven Open-ended Training for Ad Hoc Teamwork
ROTATE: Regret-driven Open-ended Training for Ad Hoc Teamwork
Caroline Wang
Arrasy Rahman
Jiaxun Cui
Yoonchang Sung
Peter Stone
435
4
0
29 May 2025
An Optimisation Framework for Unsupervised Environment Design
An Optimisation Framework for Unsupervised Environment Design
Nathan Monette
Alistair Letcher
Michael Beukman
Matthew Jackson
Alexander Rutherford
Alexander David Goldie
Jakob N. Foerster
392
5
0
27 May 2025
DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning
DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning
Leander Diaz-Bone
Marco Bagatella
Jonas Hübotter
Andreas Krause
OffRL
352
7
0
26 May 2025
Self-Evolving Curriculum for LLM Reasoning
Self-Evolving Curriculum for LLM Reasoning
Xiaoyin Chen
Jiarui Lu
Minsu Kim
Dinghuai Zhang
Jian Tang
Alexandre Piché
Nicolas Angelard-Gontier
Yoshua Bengio
Ehsan Kamalloo
ReLMLRM
737
60
0
20 May 2025
Automatic Curriculum Learning for Driving Scenarios: Towards Robust and Efficient Reinforcement Learning
Automatic Curriculum Learning for Driving Scenarios: Towards Robust and Efficient Reinforcement Learning
Ahmed Abouelazm
Tim Weinstein
Tim Joseph
Philip Schorner
J. M. Zöllner
434
1
0
13 May 2025
Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments
Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments
Yun Qu
Wenjie Wang
Yixiu Mao
Yiqin Lv
Xiangyang Ji
TTA
595
10
0
27 Apr 2025
Improving Human-AI Coordination through Online Adversarial Training and Generative Models
Improving Human-AI Coordination through Online Adversarial Training and Generative Models
Paresh Chaudhary
Yancheng Liang
Daphne Chen
S. Du
Natasha Jaques
527
2
0
21 Apr 2025
Cross-environment Cooperation Enables Zero-shot Multi-agent Coordination
Cross-environment Cooperation Enables Zero-shot Multi-agent Coordination
Kunal Jha
Wilka Carvalho
Yancheng Liang
S. Du
Max Kleiman-Weiner
Natasha Jaques
558
9
0
17 Apr 2025
OvercookedV2: Rethinking Overcooked for Zero-Shot Coordination
OvercookedV2: Rethinking Overcooked for Zero-Shot CoordinationInternational Conference on Learning Representations (ICLR), 2025
Tobias Gessler
Tin Dizdarevic
Ani Calinescu
Benjamin Ellis
Andrei Lupu
Jakob Foerster
452
10
0
22 Mar 2025
Causally Aligned Curriculum Learning
Causally Aligned Curriculum LearningInternational Conference on Learning Representations (ICLR), 2025
Mingxuan Li
Junzhe Zhang
Elias Bareinboim
CML
342
9
0
21 Mar 2025
A Generalist Hanabi Agent
A Generalist Hanabi AgentInternational Conference on Learning Representations (ICLR), 2025
Arjun Vaithilingam Sudhakar
Hadi Nekoei
Mathieu Reymond
Miao Liu
Janarthanan Rajendran
Sarath Chandar
972
4
0
17 Mar 2025
Automatic Curriculum Design for Zero-Shot Human-AI Coordination
Automatic Curriculum Design for Zero-Shot Human-AI CoordinationIEEE Access (IEEE Access), 2025
Won-Sang You
Tae-Gwan Ha
Seo-Young Lee
Kyung-Joong Kim
489
0
0
10 Mar 2025
Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models
Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models
Alon Albalak
Duy Phung
Nathan Lile
Rafael Rafailov
Kanishk Gandhi
...
Anikait Singh
Chase Blagden
Robert Z. Sparks
Dakota Mahan
Nick Haber
OffRLLRM
355
64
0
24 Feb 2025
Training a Generally Curious Agent
Training a Generally Curious Agent
Fahim Tajwar
Yiding Jiang
Abitha Thankaraj
Sumaita Sadia Rahman
J. Zico Kolter
Jeff Schneider
Ruslan Salakhutdinov
644
19
0
24 Feb 2025
Automating Curriculum Learning for Reinforcement Learning using a Skill-Based Bayesian Network
Automating Curriculum Learning for Reinforcement Learning using a Skill-Based Bayesian NetworkAdaptive Agents and Multi-Agent Systems (AAMAS), 2025
Vincent Hsiao
Mark Roberts
Laura M. Hiatt
George Konidaris
Dana Nau
329
0
0
21 Feb 2025
Improving Environment Novelty Quantification for Effective Unsupervised Environment Design
Improving Environment Novelty Quantification for Effective Unsupervised Environment DesignNeural Information Processing Systems (NeurIPS), 2025
Jayden Teoh
Wenjun Li
Pradeep Varakantham
309
4
0
08 Feb 2025
A Minimax Approach to Ad Hoc Teamwork
A Minimax Approach to Ad Hoc TeamworkAdaptive Agents and Multi-Agent Systems (AAMAS), 2025
Victor Villin
Thomas Kleine Buening
Christos Dimitrakakis
345
2
0
04 Feb 2025
Reinforcement Teaching
Reinforcement Teaching
Alex Lewandowski
Calarina Muslimani
Dale Schuurmans
Matthew E. Taylor
Jun Luo
498
2
0
28 Jan 2025
Evolution and The Knightian Blindspot of Machine Learning
Evolution and The Knightian Blindspot of Machine Learning
Joel Lehman
Elliot Meyerson
Tarek El-Gaaly
Kenneth O. Stanley
Tarin Ziyaee
382
7
0
22 Jan 2025
State Combinatorial Generalization In Decision Making With Conditional Diffusion Models
State Combinatorial Generalization In Decision Making With Conditional Diffusion Models
Xintong Duan
Yutong He
Fahim Tajwar
Wen-Tse Chen
Ruslan Salakhutdinov
Jeff Schneider
OffRLAI4CE
459
2
0
22 Jan 2025
A Research Agenda for Usability and Generalisation in Reinforcement Learning
A Research Agenda for Usability and Generalisation in Reinforcement Learning
Dennis J. N. J. Soemers
Spyridon Samothrakis
Kurt Driessens
M. Winands
OffRL
470
1
0
22 Dec 2024
Neuromodulated Meta-Learning
Neuromodulated Meta-Learning
Wenwen Qiang
Huijie Guo
Jingyao Wang
Jiangmeng Li
Changwen Zheng
Hui Xiong
Gang Hua
432
1
0
11 Nov 2024
12345
Next
Page 1 of 5