v1v2 (latest)

On the role of planning in model-based deep reinforcement learning

8 November 2020

Jessica B. Hamrick

A. Friesen

Feryal M. P. Behbahani

Papers citing "On the role of planning in model-based deep reinforcement learning"

50 / 50 papers shown

Bootstrap Off-policy with World Model

482

01 Nov 2025

Path Channels and Plan Extension Kernels: a Mechanistic Description of Planning in a Sokoban RNN

313

11 Jun 2025

Trust-Region Twisted Policy Improvement

555

08 Apr 2025

Extendable Planning via Multiscale Diffusion

490

25 Mar 2025

On-line Policy Improvement using Monte-Carlo SearchNeural Information Processing Systems (NeurIPS), 1996

Gerald Tesauro

Gregory R. Galperin

460

276

09 Jan 2025

Demystifying MuZero Planning: Interpreting the Learned ModelIEEE Transactions on Artificial Intelligence (IEEE TAI), 2024

327

07 Nov 2024

Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning

469

15 Oct 2024

How to Choose a Reinforcement-Learning Algorithm

Julian Rodemann

228

30 Jul 2024

Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs

Qian Liu

327

132

13 Jun 2024

Learning to Play Atari in a World of Tokens

Pranav Agarwal

Sheldon Andrews

Samira Ebrahimi Kahou

OffRL

262

03 Jun 2024

Dynamic Model Predictive Shielding for Provably Safe Reinforcement Learning

237

22 May 2024

How does the primate brain combine generative and discriminative computations in vision?

Todd Gureckis

...

262

11 Jan 2024

Simple Hierarchical Planning with Diffusion

287

05 Jan 2024

Predictive auxiliary objectives in deep RL mimic learning in the brainInternational Conference on Learning Representations (ICLR), 2023

Ching Fang

Kimberly L. Stachenfeld

310

09 Oct 2023

Efficient Planning with Latent DiffusionInternational Conference on Learning Representations (ICLR), 2023

Wenhao Li

DiffM

435

30 Sep 2023

Thinker: Learning to Plan and ActNeural Information Processing Systems (NeurIPS), 2023

317

27 Jul 2023

What model does MuZero learn?European Conference on Artificial Intelligence (ECAI), 2023

Jinke He

Thomas M. Moerland

F. Oliehoek

357

01 Jun 2023

Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple ReuseInformation Sciences (Inf. Sci.), 2023

216

29 May 2023

The Update-Equivalence Framework for Decision-Time PlanningInternational Conference on Learning Representations (ICLR), 2023

J. Zico Kolter

345

25 Apr 2023

235

09 Feb 2023

Learning Interaction-aware Motion Prediction Model for Decision-making in Autonomous Driving

244

08 Feb 2023

PushWorld: A benchmark for manipulation planning with tools and movable obstacles

Miguel Lazaro-Gredilla

Dileep George

357

24 Jan 2023

Safe Reinforcement Learning using Data-Driven Predictive ControlInternational Conference on Communications, Signal Processing, and their Applications (ICCSPA), 2022

248

20 Nov 2022

Continuous Monte Carlo Graph SearchAdaptive Agents and Multi-Agent Systems (AAMAS), 2022

973

04 Oct 2022

Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One ObjectiveInternational Conference on Learning Representations (ICLR), 2022

Homanga Bharadhwaj

387

18 Sep 2022

A model-based approach to meta-Reinforcement Learning: Transformers and tree searchThe European Symposium on Artificial Neural Networks (ESANN), 2022

Brieuc Pinon

Jean-Charles Delvenne

Raphaël Jungers

OffRL

230

24 Aug 2022

Efficient Planning in a Compact Latent Action SpaceInternational Conference on Learning Representations (ICLR), 2022

Tianjun Zhang

320

22 Aug 2022

Intelligent problem-solving as integrated hierarchical reinforcement learningNature Machine Intelligence (Nat. Mach. Intell.), 2022

293

18 Aug 2022

Symphony: Learning Realistic and Diverse Agents for Autonomous Driving SimulationIEEE International Conference on Robotics and Automation (ICRA), 2022

259

06 May 2022

Physical Design using Differentiable Learned Simulators

Kelsey R. Allen

Tatiana López-Guevara

Kimberly L. Stachenfeld

Alvaro Sanchez-Gonzalez

277

01 Feb 2022

Inferring perceptual decision making parameters from behavior in production and reproduction tasks

Nils Neupärtl

Constantin Rothkopf

191

31 Dec 2021

Learning Generalizable Behavior via Visual Rewrite Rules

Michael Littman

231

09 Dec 2021

Procedural Generalization by Planning with Self-Supervised World ModelsInternational Conference on Learning Representations (ICLR), 2021

191

02 Nov 2021

Self-Consistent Models and ValuesNeural Information Processing Systems (NeurIPS), 2021

David Silver

257

25 Oct 2021

Model-based Reinforcement Learning for Service Mesh Fault Resiliency in a Web Application-levelApplied and Computational Engineering (ACE), 2021

122

21 Oct 2021

Neural Algorithmic Reasoners are Implicit PlannersNeural Information Processing Systems (NeurIPS), 2021

Andreea Deac

Petar Velivcković

Ognjen Milinković

Pierre-Luc Bacon

Jian Tang

Mladen Nikolic

OffRL

174

11 Oct 2021

Evaluating model-based planning and planner amortization for continuous control

Alessandro Davide Ialongo

...

Jost Tobias Springenberg

A. Abdolmaleki

N. Heess

J. Merel

Martin Riedmiller

192

07 Oct 2021

Potential-based Reward Shaping in Sokoban

176

10 Sep 2021

Subgoal Search For Complex Reasoning TasksNeural Information Processing Systems (NeurIPS), 2021

267

25 Aug 2021

Deep Multiagent Reinforcement Learning: Challenges and DirectionsArtificial Intelligence Review (AIR), 2021

Thomas Bäck

304

159

29 Jun 2021

A Consciousness-Inspired Planning Agent for Model-Based Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2021

Sitao Luan

466

03 Jun 2021

Towards Deeper Deep Reinforcement Learning with Spectral NormalizationNeural Information Processing Systems (NeurIPS), 2021

Johan Bjorck

Daniel Schwalbe-Koda

Kilian Q. Weinberger

349

02 Jun 2021

Learning Neuro-Symbolic Relational Transition Models for Bilevel PlanningIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021

Tomas Lozano-Perez

371

28 May 2021

Transfer Learning and Curriculum Learning in Sokoban

290

25 May 2021

MBRL-Lib: A Modular Library for Model-based Reinforcement Learning

370

20 Apr 2021

Muesli: Combining Improvements in Policy OptimizationInternational Conference on Machine Learning (ICML), 2021

Ivo Danihelka

David Silver

274

13 Apr 2021

Planning and Learning Using Adaptive Entropy Tree SearchIEEE International Joint Conference on Neural Network (IJCNN), 2021

Piotr Kozakowski

Mikolaj Pacek

Piotr Milo's

210

12 Feb 2021

Autotelic Agents with Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short SurveyJournal of Artificial Intelligence Research (JAIR), 2020

903

122

17 Dec 2020

On the model-based stochastic value gradient for continuous reinforcement learningConference on Learning for Dynamics & Control (L4DC), 2020

416

28 Aug 2020

A Unifying Framework for Reinforcement Learning and Planning

535

26 Jun 2020