ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.08810
  4. Cited By
The Predictron: End-To-End Learning and Planning

The Predictron: End-To-End Learning and Planning

28 December 2016
David Silver
H. V. Hasselt
Matteo Hessel
Tom Schaul
A. Guez
Tim Harley
Gabriel Dulac-Arnold
David P. Reichert
Neil C. Rabinowitz
André Barreto
T. Degris
ArXivPDFHTML

Papers citing "The Predictron: End-To-End Learning and Planning"

50 / 173 papers shown
Title
Learning Transformer-based World Models with Contrastive Predictive Coding
Maxime Burchi
Radu Timofte
67
0
0
06 Mar 2025
FACTS: A Factored State-Space Framework For World Modelling
FACTS: A Factored State-Space Framework For World Modelling
Li Nanbo
Firas Laakom
Yucheng Xu
Wenyi Wang
Jürgen Schmidhuber
AI4TS
139
0
0
28 Oct 2024
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
C. Voelcker
Marcel Hussing
Eric Eaton
Amir-massoud Farahmand
Igor Gilitschenski
39
1
0
11 Oct 2024
Meta-Gradient Search Control: A Method for Improving the Efficiency of
  Dyna-style Planning
Meta-Gradient Search Control: A Method for Improving the Efficiency of Dyna-style Planning
Bradley Burega
John D. Martin
Luke Kapeluck
Michael H. Bowling
34
0
0
27 Jun 2024
AlphaZeroES: Direct score maximization outperforms planning loss
  minimization
AlphaZeroES: Direct score maximization outperforms planning loss minimization
Carlos Martin
Tuomas Sandholm
28
0
0
12 Jun 2024
Scaling Value Iteration Networks to 5000 Layers for Extreme Long-Term
  Planning
Scaling Value Iteration Networks to 5000 Layers for Extreme Long-Term Planning
Yuhui Wang
Qingyuan Wu
Weida Li
Dylan R. Ashley
Francesco Faccio
Chao Huang
Jürgen Schmidhuber
AI4CE
26
0
0
12 Jun 2024
A New View on Planning in Online Reinforcement Learning
A New View on Planning in Online Reinforcement Learning
Kevin Roice
Parham Mohammad Panahi
Scott M. Jordan
Adam White
Martha White
OffRL
16
0
0
03 Jun 2024
MuDreamer: Learning Predictive World Models without Reconstruction
MuDreamer: Learning Predictive World Models without Reconstruction
Maxime Burchi
Radu Timofte
34
3
0
23 May 2024
World Models for Autonomous Driving: An Initial Survey
World Models for Autonomous Driving: An Initial Survey
Yanchen Guan
Haicheng Liao
Zhenning Li
Jia Hu
Runze Yuan
Yunjian Li
Guohui Zhang
Chengzhong Xu
32
31
0
05 Mar 2024
Video as the New Language for Real-World Decision Making
Video as the New Language for Real-World Decision Making
Sherry Yang
Jacob Walker
Jack Parker-Holder
Yilun Du
Jake Bruce
Andre Barreto
Pieter Abbeel
Dale Schuurmans
VGen
29
45
0
27 Feb 2024
OIL-AD: An Anomaly Detection Framework for Sequential Decision Sequences
OIL-AD: An Anomaly Detection Framework for Sequential Decision Sequences
Chen Wang
S. Erfani
T. Alpcan
Christopher Leckie
OffRL
27
2
0
07 Feb 2024
TravelPlanner: A Benchmark for Real-World Planning with Language Agents
TravelPlanner: A Benchmark for Real-World Planning with Language Agents
Jian Xie
Kai Zhang
Jiangjie Chen
Tinghui Zhu
Renze Lou
Yuandong Tian
Yanghua Xiao
Yu-Chuan Su
LLMAG
LM&Ro
50
127
0
02 Feb 2024
Bridging State and History Representations: Understanding
  Self-Predictive RL
Bridging State and History Representations: Understanding Self-Predictive RL
Tianwei Ni
Benjamin Eysenbach
Erfan Seyedsalehi
Michel Ma
Clement Gehring
Aditya Mahajan
Pierre-Luc Bacon
AI4TS
AI4CE
22
20
0
17 Jan 2024
Beyond One Model Fits All: Ensemble Deep Learning for Autonomous
  Vehicles
Beyond One Model Fits All: Ensemble Deep Learning for Autonomous Vehicles
Hemanth Manjunatha
Panagiotis Tsiotras
17
0
0
10 Dec 2023
A Unified View on Solving Objective Mismatch in Model-Based
  Reinforcement Learning
A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning
Ran Wei
Nathan Lambert
Anthony D. McDonald
Alfredo Garcia
Roberto Calandra
28
6
0
10 Oct 2023
Multi-timestep models for Model-based Reinforcement Learning
Multi-timestep models for Model-based Reinforcement Learning
Abdelhakim Benechehab
Giuseppe Paolo
Albert Thomas
Maurizio Filippone
Balázs Kégl
OffRL
19
0
0
09 Oct 2023
On Representation Complexity of Model-based and Model-free Reinforcement
  Learning
On Representation Complexity of Model-based and Model-free Reinforcement Learning
Hanlin Zhu
Baihe Huang
Stuart Russell
OffRL
25
3
0
03 Oct 2023
Consistent Aggregation of Objectives with Diverse Time Preferences
  Requires Non-Markovian Rewards
Consistent Aggregation of Objectives with Diverse Time Preferences Requires Non-Markovian Rewards
Silviu Pitis
35
6
0
30 Sep 2023
AI planning in the imagination: High-level planning on learned abstract
  search spaces
AI planning in the imagination: High-level planning on learned abstract search spaces
Carlos Martin
T. Sandholm
29
0
0
16 Aug 2023
$λ$-models: Effective Decision-Aware Reinforcement Learning with
  Latent Models
λλλ-models: Effective Decision-Aware Reinforcement Learning with Latent Models
C. Voelcker
Arash Ahmadian
Romina Abachi
Igor Gilitschenski
Amir-massoud Farahmand
51
0
0
30 Jun 2023
Simplified Temporal Consistency Reinforcement Learning
Simplified Temporal Consistency Reinforcement Learning
Yi Zhao
Wenshuai Zhao
Rinu Boney
Juho Kannala
J. Pajarinen
OffRL
30
12
0
15 Jun 2023
Deep Generative Models for Decision-Making and Control
Deep Generative Models for Decision-Making and Control
Michael Janner
32
1
0
15 Jun 2023
What model does MuZero learn?
What model does MuZero learn?
Jinke He
Thomas M. Moerland
F. Oliehoek
28
4
0
01 Jun 2023
Query-Policy Misalignment in Preference-Based Reinforcement Learning
Query-Policy Misalignment in Preference-Based Reinforcement Learning
Xiao Hu
Jianxiong Li
Xianyuan Zhan
Qing-Shan Jia
Ya-Qin Zhang
22
8
0
27 May 2023
Decision-Aware Actor-Critic with Function Approximation and Theoretical
  Guarantees
Decision-Aware Actor-Critic with Function Approximation and Theoretical Guarantees
Sharan Vaswani
A. Kazemi
Reza Babanezhad
Nicolas Le Roux
OffRL
24
3
0
24 May 2023
KARNet: Kalman Filter Augmented Recurrent Neural Network for Learning
  World Models in Autonomous Driving Tasks
KARNet: Kalman Filter Augmented Recurrent Neural Network for Learning World Models in Autonomous Driving Tasks
Hemanth Manjunatha
A. Pak
Dimitar Filev
Panagiotis Tsiotras
25
5
0
24 May 2023
Co-Learning Empirical Games and World Models
Co-Learning Empirical Games and World Models
Max O. Smith
Michael P. Wellman
11
2
0
23 May 2023
Bayesian Reinforcement Learning with Limited Cognitive Load
Bayesian Reinforcement Learning with Limited Cognitive Load
Dilip Arumugam
Mark K. Ho
Noah D. Goodman
Benjamin Van Roy
OffRL
34
8
0
05 May 2023
A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential
  Decision Making
A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential Decision Making
Carlos Núnez-Molina
Pablo Mesejo
Juan Fernández-Olivares
30
3
0
20 Apr 2023
Learning How to Infer Partial MDPs for In-Context Adaptation and
  Exploration
Learning How to Infer Partial MDPs for In-Context Adaptation and Exploration
Chentian Jiang
Nan Rosemary Ke
Hado van Hasselt
16
3
0
08 Feb 2023
Minimal Value-Equivalent Partial Models for Scalable and Robust Planning
  in Lifelong Reinforcement Learning
Minimal Value-Equivalent Partial Models for Scalable and Robust Planning in Lifelong Reinforcement Learning
Safa Alver
Doina Precup
OffRL
14
5
0
24 Jan 2023
Reinforcement Learning in System Identification
Reinforcement Learning in System Identification
J. Antonio
Martin H Oscar Fernández
Sergio Pérez
Anas Belfadil
C. Ibáñez-Llano
Freddy José Perozo
Javier Valle
Javier Arechalde Pelaz
15
0
0
14 Dec 2022
Operator Splitting Value Iteration
Operator Splitting Value Iteration
Amin Rakhsha
Andrew Wang
Mohammad Ghavamzadeh
Amir-massoud Farahmand
OffRL
25
7
0
25 Nov 2022
Reward-Predictive Clustering
Reward-Predictive Clustering
Lucas Lehnert
M. Frank
Michael L. Littman
OffRL
17
0
0
07 Nov 2022
On Rate-Distortion Theory in Capacity-Limited Cognition & Reinforcement
  Learning
On Rate-Distortion Theory in Capacity-Limited Cognition & Reinforcement Learning
Dilip Arumugam
Mark K. Ho
Noah D. Goodman
Benjamin Van Roy
28
4
0
30 Oct 2022
Auxiliary task discovery through generate-and-test
Auxiliary task discovery through generate-and-test
Banafsheh Rafiee
Sina Ghiassian
Jun Jin
R. Sutton
Jun-Jie Luo
Adam White
16
0
0
25 Oct 2022
Distributional Reward Estimation for Effective Multi-Agent Deep
  Reinforcement Learning
Distributional Reward Estimation for Effective Multi-Agent Deep Reinforcement Learning
Jifeng Hu
Yanchao Sun
Hechang Chen
Sili Huang
Haiyin Piao
Yi-Ju Chang
Lichao Sun
11
5
0
14 Oct 2022
Long N-step Surrogate Stage Reward to Reduce Variances of Deep
  Reinforcement Learning in Complex Problems
Long N-step Surrogate Stage Reward to Reduce Variances of Deep Reinforcement Learning in Complex Problems
Junmin Zhong
Ruofan Wu
J. Si
LRM
8
0
0
10 Oct 2022
Mastering Spatial Graph Prediction of Road Networks
Mastering Spatial Graph Prediction of Road Networks
Sotiris Anagnostidis
Aurélien Lucchi
Thomas Hofmann
GNN
27
1
0
03 Oct 2022
A model-based approach to meta-Reinforcement Learning: Transformers and
  tree search
A model-based approach to meta-Reinforcement Learning: Transformers and tree search
Brieuc Pinon
Jean-Charles Delvenne
Raphaël Jungers
OffRL
19
3
0
24 Aug 2022
Value-Consistent Representation Learning for Data-Efficient
  Reinforcement Learning
Value-Consistent Representation Learning for Data-Efficient Reinforcement Learning
Yang Yue
Bingyi Kang
Zhongwen Xu
Gao Huang
Shuicheng Yan
OffRL
32
13
0
25 Jun 2022
Goal-Space Planning with Subgoal Models
Goal-Space Planning with Subgoal Models
Chun-Ping Lo
Kevin Roice
Parham Mohammad Panahi
Scott M. Jordan
Adam White
Gábor Mihucz
Farzane Aminmansour
Martha White
11
5
0
06 Jun 2022
Deciding What to Model: Value-Equivalent Sampling for Reinforcement
  Learning
Deciding What to Model: Value-Equivalent Sampling for Reinforcement Learning
Dilip Arumugam
Benjamin Van Roy
OffRL
20
14
0
04 Jun 2022
Between Rate-Distortion Theory & Value Equivalence in Model-Based
  Reinforcement Learning
Between Rate-Distortion Theory & Value Equivalence in Model-Based Reinforcement Learning
Dilip Arumugam
Benjamin Van Roy
OffRL
26
1
0
04 Jun 2022
Should Models Be Accurate?
Should Models Be Accurate?
Esraá Saleh
John D. Martin
Anna Koop
Arash Pourzarabi
Michael H. Bowling
23
2
0
22 May 2022
CARNet: A Dynamic Autoencoder for Learning Latent Dynamics in Autonomous
  Driving Tasks
CARNet: A Dynamic Autoencoder for Learning Latent Dynamics in Autonomous Driving Tasks
A. Pak
Hemanth Manjunatha
Dimitar Filev
Panagiotis Tsiotras
14
5
0
18 May 2022
Investigating the Properties of Neural Network Representations in
  Reinforcement Learning
Investigating the Properties of Neural Network Representations in Reinforcement Learning
Han Wang
Erfan Miahi
Martha White
Marlos C. Machado
Zaheer Abbas
Raksha Kumaraswamy
Vincent Liu
Adam White
17
26
0
30 Mar 2022
Retrieval-Augmented Reinforcement Learning
Retrieval-Augmented Reinforcement Learning
Anirudh Goyal
A. Friesen
Andrea Banino
T. Weber
Nan Rosemary Ke
...
Michal Valko
Simon Osindero
Timothy Lillicrap
N. Heess
Charles Blundell
OffRL
32
53
0
17 Feb 2022
Learning to Predict Graphs with Fused Gromov-Wasserstein Barycenters
Learning to Predict Graphs with Fused Gromov-Wasserstein Barycenters
Luc Brogat-Motte
Rémi Flamary
Céline Brouard
Juho Rousu
Florence dÁlché-Buc
25
19
0
08 Feb 2022
ExPoSe: Combining State-Based Exploration with Gradient-Based Online
  Search
ExPoSe: Combining State-Based Exploration with Gradient-Based Online Search
Dixant Mittal
Siddharth Aravindan
W. Lee
OnRL
11
3
0
03 Feb 2022
1234
Next