
Title |
|---|
![]() Simple Sensor Intentions for Exploration Tim Hertweck Martin Riedmiller Michael Bloesch Jost Tobias Springenberg Noah Y. Siegel Markus Wulfmeier Agrim Gupta N. Heess |
![]() The Variational Bandwidth Bottleneck: Stochastic Evaluation on an
Information BudgetInternational Conference on Learning Representations (ICLR), 2020 |
![]() Agent57: Outperforming the Atari Human BenchmarkInternational Conference on Machine Learning (ICML), 2020 |
![]() RIDE: Rewarding Impact-Driven Exploration for Procedurally-Generated
EnvironmentsInternational Conference on Learning Representations (ICLR), 2020 |
![]() Explore, Discover and Learn: Unsupervised Discovery of State-Covering
SkillsInternational Conference on Machine Learning (ICML), 2020 |
![]() Unsupervised Curricula for Visual Meta-Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2019 |
![]() Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill
DiscoveryAdaptive Agents and Multi-Agent Systems (AAMAS), 2019 |
![]() Hindsight Credit AssignmentNeural Information Processing Systems (NeurIPS), 2019 |
![]() Implicit Generative Modeling for Efficient ExplorationInternational Conference on Machine Learning (ICML), 2019 |
![]() MAVEN: Multi-Agent Variational ExplorationNeural Information Processing Systems (NeurIPS), 2019 |
![]() Dynamics-Aware Unsupervised Discovery of SkillsInternational Conference on Learning Representations (ICLR), 2019 |
![]() Fast Task Inference with Variational Intrinsic Successor FeaturesInternational Conference on Learning Representations (ICLR), 2019 |
![]() Self-Supervised Exploration via DisagreementInternational Conference on Machine Learning (ICML), 2019 |
![]() Learning Novel Policies For TasksInternational Conference on Machine Learning (ICML), 2019 |
![]() Mega-Reward: Achieving Human-Level Play without Extrinsic RewardsAAAI Conference on Artificial Intelligence (AAAI), 2019 |
![]() The Termination CriticInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2019 |
![]() Reward learning from human preferences and demonstrations in AtariNeural Information Processing Systems (NeurIPS), 2018 |