Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2002.02693
Cited By
Ready Policy One: World Building Through Active Learning
International Conference on Machine Learning (ICML), 2020
7 February 2020
Philip J. Ball
Jack Parker-Holder
Aldo Pacchiano
K. Choromanski
Stephen J. Roberts
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Ready Policy One: World Building Through Active Learning"
38 / 38 papers shown
Pruning Cannot Hurt Robustness: Certified Trade-offs in Reinforcement Learning
James Pedley
Benjamin Etheridge
Stephen J. Roberts
Francesco Quinzan
OffRL
AAML
155
0
0
14 Oct 2025
A Case for Validation Buffer in Pessimistic Actor-Critic
Michal Nauman
M. Ostaszewski
Marek Cygan
281
0
0
01 Mar 2024
Active Inference as a Model of Agency
Lancelot Da Costa
Samuel Tenka
Dominic Zhao
Noor Sajid
382
15
0
23 Jan 2024
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics
Michal Nauman
Marek Cygan
504
5
0
30 Oct 2023
COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL
International Conference on Learning Representations (ICLR), 2023
Xiyao Wang
Ruijie Zheng
Yanchao Sun
Ruonan Jia
Wichayaporn Wongkamjan
Huazhe Xu
Furong Huang
OffRL
338
19
0
11 Oct 2023
Predicting Routine Object Usage for Proactive Robot Assistance
Conference on Robot Learning (CoRL), 2023
Maithili Patel
Aswin Prakash
Sonia Chernova
AI4TS
312
11
0
12 Sep 2023
Reward-Free Curricula for Training Robust World Models
International Conference on Learning Representations (ICLR), 2023
Marc Rigter
Minqi Jiang
Ingmar Posner
VLM
OffRL
408
13
0
15 Jun 2023
Co-Learning Empirical Games and World Models
Max O. Smith
Michael P. Wellman
336
4
0
23 May 2023
Predictable MDP Abstraction for Unsupervised Model-Based RL
International Conference on Machine Learning (ICML), 2023
Seohong Park
Sergey Levine
298
11
0
08 Feb 2023
Active Exploration based on Information Gain by Particle Filter for Efficient Spatial Concept Formation
Akira Taniguchi
Y. Tabuchi
Tomochika Ishikawa
Lotfi El Hafi
Y. Hagiwara
T. Taniguchi
459
8
0
20 Nov 2022
Adaptive Behavior Cloning Regularization for Stable Offline-to-Online Reinforcement Learning
The European Symposium on Artificial Neural Networks (ESANN), 2022
Yi Zhao
Rinu Boney
Alexander Ilin
Arno Solin
Joni Pajarinen
OffRL
OnRL
223
47
0
25 Oct 2022
Learning General World Models in a Handful of Reward-Free Deployments
Neural Information Processing Systems (NeurIPS), 2022
Yingchen Xu
Jack Parker-Holder
Aldo Pacchiano
Philip J. Ball
Oleh Rybkin
Stephen J. Roberts
Tim Rocktaschel
Edward Grefenstette
OffRL
279
13
0
23 Oct 2022
Exploration via Planning for Information about the Optimal Trajectory
Neural Information Processing Systems (NeurIPS), 2022
Viraj Mehta
I. Char
J. Abbate
R. Conlin
M. Boyer
Stefano Ermon
J. Schneider
Willie Neiswanger
OffRL
229
9
0
06 Oct 2022
Modelling non-reinforced preferences using selective attention
Noor Sajid
P. Tigas
Zafeirios Fountas
Qinghai Guo
Alexey Zakharov
Lancelot Da Costa
177
1
0
25 Jul 2022
Bayesian Generational Population-Based Training
Xingchen Wan
Cong Lu
Jack Parker-Holder
Philip J. Ball
Vu-Linh Nguyen
Binxin Ru
Michael A. Osborne
OffRL
195
22
0
19 Jul 2022
Stabilizing Off-Policy Deep Reinforcement Learning from Pixels
International Conference on Machine Learning (ICML), 2022
Edoardo Cetin
Philip J. Ball
Steve Roberts
Oya Celiktutan
249
42
0
03 Jul 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Journal of Artificial Intelligence Research (JAIR), 2022
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Katharina Eggensperger
Marius Lindauer
AI4CE
484
135
0
11 Jan 2022
An Experimental Design Perspective on Model-Based Reinforcement Learning
Viraj Mehta
Biswajit Paria
J. Schneider
Stefano Ermon
Willie Neiswanger
OffRL
265
24
0
09 Dec 2021
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Angelos Filos
Eszter Vértes
Zita Marinho
Gregory Farquhar
Diana Borsa
A. Friesen
Feryal M. P. Behbahani
Tom Schaul
André Barreto
Simon Osindero
449
8
0
08 Dec 2021
Look Before You Leap: Safe Model-Based Reinforcement Learning with Human Intervention
Conference on Robot Learning (CoRL), 2021
Yunkun Xu
Zhen-yu Liu
Guifang Duan
Jiangcheng Zhu
X. Bai
Jianrong Tan
292
14
0
10 Nov 2021
Discovering and Achieving Goals via World Models
Russell Mendonca
Oleh Rybkin
Kostas Daniilidis
Danijar Hafner
Deepak Pathak
343
163
0
18 Oct 2021
Revisiting Design Choices in Offline Model-Based Reinforcement Learning
International Conference on Learning Representations (ICLR), 2021
Cong Lu
Philip J. Ball
Jack Parker-Holder
Michael A. Osborne
Stephen J. Roberts
OffRL
300
63
0
08 Oct 2021
Customs Fraud Detection in the Presence of Concept Drift
Tung Mai
Kien Hoang
Aitolkyn Baigutanova
Gaukhartas Alina
Sundong Kim
205
9
0
29 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Zhenxing Ge
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
493
172
0
14 Sep 2021
Centralized Model and Exploration Policy for Multi-Agent RL
Qizhen Zhang
Chris Xiaoxuan Lu
Animesh Garg
Jakob N. Foerster
277
19
0
14 Jul 2021
Exploration and preference satisfaction trade-off in reward-free learning
Noor Sajid
P. Tigas
Alexey Zakharov
Zafeirios Fountas
Karl J. Friston
372
25
0
08 Jun 2021
Same State, Different Task: Continual Reinforcement Learning without Interference
AAAI Conference on Artificial Intelligence (AAAI), 2021
Samuel Kessler
Jack Parker-Holder
Philip J. Ball
S. Zohren
Stephen J. Roberts
CLL
OffRL
267
57
0
05 Jun 2021
Principled Exploration via Optimistic Bootstrapping and Backward Induction
International Conference on Machine Learning (ICML), 2021
Chenjia Bai
Lingxiao Wang
Lei Han
Jianye Hao
Animesh Garg
Peng Liu
Zhaoran Wang
OffRL
240
46
0
13 May 2021
Augmented World Models Facilitate Zero-Shot Dynamics Generalization From a Single Offline Environment
International Conference on Machine Learning (ICML), 2021
Philip J. Ball
Cong Lu
Jack Parker-Holder
Stephen J. Roberts
OffRL
384
54
0
12 Apr 2021
Bayesian Distributional Policy Gradients
AAAI Conference on Artificial Intelligence (AAAI), 2021
Luchen Li
A. Faisal
BDL
OffRL
255
11
0
20 Mar 2021
Tactical Optimism and Pessimism for Deep Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2021
Theodore H. Moskovitz
Jack Parker-Holder
Aldo Pacchiano
Michael Arbel
Sai Li
478
71
0
07 Feb 2021
Learning to Plan Optimistically: Uncertainty-Guided Deep Exploration via Latent Model Ensembles
Conference on Robot Learning (CoRL), 2020
Tim Seyde
Wilko Schwarting
S. Karaman
Daniela Rus
328
15
0
27 Oct 2020
Active Learning for Human-in-the-Loop Customs Inspection
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2020
Sundong Kim
Tung Mai
Sungwon Han
Sungwon Park
Thien Khanh
Jaechan So
Karandeep Singh
M. Cha
361
11
0
27 Oct 2020
Planning with Exploration: Addressing Dynamics Bottleneck in Model-based Reinforcement Learning
Xiyao Wang
Junge Zhang
Wenzhen Huang
Qiyue Yin
214
1
0
24 Oct 2020
Towards Tractable Optimism in Model-Based Reinforcement Learning
Aldo Pacchiano
Philip J. Ball
Jack Parker-Holder
K. Choromanski
Stephen J. Roberts
OffRL
198
12
0
21 Jun 2020
SAMBA: Safe Model-Based & Active Reinforcement Learning
Machine-mediated learning (ML), 2020
Alexander I. Cowen-Rivers
Daniel Palenicek
Vincent Moens
Mohammed Abdullah
Aivar Sootla
Jun Wang
Haitham Bou-Ammar
211
47
0
12 Jun 2020
Planning to Explore via Self-Supervised World Models
Ramanan Sekar
Oleh Rybkin
Kostas Daniilidis
Pieter Abbeel
Danijar Hafner
Deepak Pathak
SSL
438
481
0
12 May 2020
Effective Diversity in Population Based Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2020
Jack Parker-Holder
Aldo Pacchiano
K. Choromanski
Stephen J. Roberts
486
184
0
03 Feb 2020
1
Page 1 of 1