Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1901.03559
Cited By
An investigation of model-free planning
11 January 2019
A. Guez
M. Berk Mirza
Karol Gregor
Rishabh Kabra
S. Racanière
T. Weber
David Raposo
Adam Santoro
Laurent Orseau
Tom Eccles
Greg Wayne
David Silver
Timothy Lillicrap
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"An investigation of model-free planning"
19 / 19 papers shown
Title
Trust-Region Twisted Policy Improvement
Joery A. de Vries
Jinke He
Yaniv Oren
M. Spaan
OffRL
LRM
30
0
0
08 Apr 2025
Learning a Hierarchical Planner from Humans in Multiple Generations
Leonardo Hernandez Cano
Yewen Pu
Robert D. Hawkins
Josh Tenenbaum
Armando Solar-Lezama
23
2
0
17 Oct 2023
A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential Decision Making
Carlos Núnez-Molina
Pablo Mesejo
Juan Fernández-Olivares
30
3
0
20 Apr 2023
PushWorld: A benchmark for manipulation planning with tools and movable obstacles
Ken Kansky
Skanda Vaidyanath
Scott Swingle
Xinghua Lou
Miguel Lazaro-Gredilla
Dileep George
26
4
0
24 Jan 2023
The Alignment Problem from a Deep Learning Perspective
Richard Ngo
Lawrence Chan
Sören Mindermann
56
183
0
30 Aug 2022
Integrating Symmetry into Differentiable Planning with Steerable Convolutions
Linfeng Zhao
Xu Zhu
Lingzhi Kong
Robin G. Walters
Lawson L. S. Wong
20
7
0
08 Jun 2022
Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search
Michał Zawalski
Michał Tyrolski
K. Czechowski
Tomasz Odrzygó'zd'z
Damian Stachura
Piotr Pikekos
Yuhuai Wu
Lukasz Kuciñski
Piotr Milo's
LRM
18
8
0
01 Jun 2022
Neural Algorithmic Reasoners are Implicit Planners
Andreea Deac
Petar Velivcković
Ognjen Milinković
Pierre-Luc Bacon
Jian Tang
Mladen Nikolic
OffRL
32
23
0
11 Oct 2021
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Mikayel Samvelyan
Robert Kirk
Vitaly Kurin
Jack Parker-Holder
Minqi Jiang
Eric Hambro
Fabio Petroni
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
OffRL
238
89
0
27 Sep 2021
Solving Sokoban with forward-backward reinforcement learning
Yaron Shoham
G. Elidan
OffRL
32
6
0
05 May 2021
Policy-Guided Heuristic Search with Guarantees
Laurent Orseau
Levi H. S. Lelis
29
26
0
21 Mar 2021
HALMA: Humanlike Abstraction Learning Meets Affordance in Rapid Problem Solving
Sirui Xie
Xiaojian Ma
Peiyu Yu
Yixin Zhu
Ying Nian Wu
Song-Chun Zhu
42
20
0
22 Feb 2021
REALab: An Embedded Perspective on Tampering
Ramana Kumar
J. Uesato
Richard Ngo
Tom Everitt
Victoria Krakovna
Shane Legg
19
10
0
17 Nov 2020
Transient Non-Stationarity and Generalisation in Deep Reinforcement Learning
Maximilian Igl
Gregory Farquhar
Jelena Luketina
Wendelin Boehmer
Shimon Whiteson
24
83
0
10 Jun 2020
Causally Correct Partial Models for Reinforcement Learning
Danilo Jimenez Rezende
Ivo Danihelka
George Papamakarios
Nan Rosemary Ke
Ray Jiang
...
Jane X. Wang
Jovana Mitrović
F. Besse
Ioannis Antonoglou
Lars Buesing
AI4TS
24
32
0
07 Feb 2020
Combining Q-Learning and Search with Amortized Value Estimates
Jessica B. Hamrick
V. Bapst
Alvaro Sanchez-Gonzalez
Tobias Pfaff
T. Weber
Lars Buesing
Peter W. Battaglia
OffRL
27
47
0
05 Dec 2019
The Principle of Unchanged Optimality in Reinforcement Learning Generalization
A. Irpan
Xingyou Song
OffRL
25
7
0
02 Jun 2019
Omega-Regular Objectives in Model-Free Reinforcement Learning
E. M. Hahn
Mateo Perez
S. Schewe
F. Somenzi
Ashutosh Trivedi
D. Wojtczak
8
144
0
26 Sep 2018
Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting
Xingjian Shi
Zhourong Chen
Hao Wang
Dit-Yan Yeung
W. Wong
W. Woo
233
7,904
0
13 Jun 2015
1