Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1504.05539
Cited By
Temporal-Difference Networks
21 April 2015
R. Sutton
B. Tanner
PINN
OOD
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Temporal-Difference Networks"
25 / 25 papers shown
Exploring through Random Curiosity with General Value Functions
Neural Information Processing Systems (NeurIPS), 2022
Aditya A. Ramesh
Louis Kirsch
Sjoerd van Steenkiste
Jürgen Schmidhuber
294
13
0
18 Nov 2022
Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning
Gheorghe Comanici
Amelia Glaese
Anita Gergely
Daniel Toyama
Zafarali Ahmed
Tyler Jackson
P. Hamel
Doina Precup
166
4
0
21 Apr 2022
Learning Agent State Online with Recurrent Generate-and-Test
Alireza Samani
R. Sutton
CLL
OffRL
192
2
0
30 Dec 2021
Explainable Artificial Intelligence for Autonomous Driving: A Comprehensive Overview and Field Guide for Future Research Directions
IEEE Access (IEEE Access), 2021
Shahin Atakishiyev
Mohammad Salameh
Hengshuai Yao
Randy Goebel
736
220
0
21 Dec 2021
Representing Knowledge as Predictions (and State as Knowledge)
Mark B. Ring
80
7
0
12 Dec 2021
Towards Safe, Explainable, and Regulated Autonomous Driving
Shahin Atakishiyev
Mohammad Salameh
Hengshuai Yao
Randy Goebel
529
14
0
20 Nov 2021
A Unified Off-Policy Evaluation Approach for General Value Function
Tengyu Xu
Zhuoran Yang
Zhaoran Wang
Yingbin Liang
OffRL
212
2
0
06 Jul 2021
A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation
International Conference on Machine Learning (ICML), 2021
Scott Fujimoto
David Meger
Doina Precup
248
17
0
12 Jun 2021
Predictive Representation Learning for Language Modeling
Qingfeng Lan
Luke N. Kumar
Martha White
Alona Fyshe
OffRL
AI4TS
190
1
0
29 May 2021
Autotelic Agents with Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short Survey
Journal of Artificial Intelligence Research (JAIR), 2020
Cédric Colas
Tristan Karch
Olivier Sigaud
Pierre-Yves Oudeyer
963
127
0
17 Dec 2020
C-Learning: Learning to Achieve Goals via Recursive Classification
International Conference on Learning Representations (ICLR), 2020
Benjamin Eysenbach
Ruslan Salakhutdinov
Sergey Levine
OffRL
402
92
0
17 Nov 2020
Offline Learning of Counterfactual Predictions for Real-World Robotic Reinforcement Learning
IEEE International Conference on Robotics and Automation (ICRA), 2020
Jun Jin
D. Graves
Cameron Haigh
Jun Luo
Martin Jägersand
SSL
OffRL
380
6
0
11 Nov 2020
What's a Good Prediction? Challenges in evaluating an agent's knowledge
Alex Kearney
Anna Koop
P. Pilarski
ELM
219
2
0
23 Jan 2020
Discovery of Useful Questions as Auxiliary Tasks
Neural Information Processing Systems (NeurIPS), 2019
Vivek Veeriah
Matteo Hessel
Zhongwen Xu
Richard L. Lewis
Janarthanan Rajendran
Junhyuk Oh
H. V. Hasselt
David Silver
Satinder Singh
LLMAG
205
88
0
10 Sep 2019
Meta-descent for Online, Continual Prediction
AAAI Conference on Artificial Intelligence (AAAI), 2019
Andrew Jacobsen
M. Schlegel
Cam Linke
T. Degris
Adam White
Martha White
292
25
0
17 Jul 2019
Deep Reinforcement Learning
Yuxi Li
VLM
OffRL
422
139
0
15 Oct 2018
General Value Function Networks
Journal of Artificial Intelligence Research (JAIR), 2018
M. Schlegel
Andrew Jacobsen
Zaheer Abbas
Andrew Patterson
Adam White
Martha White
382
31
0
18 Jul 2018
Convergent Tree Backup and Retrace with Function Approximation
Ahmed Touati
Pierre-Luc Bacon
Doina Precup
Pascal Vincent
338
41
0
25 May 2017
Learning to Make Predictions In Partially Observable Environments Without a Generative Model
Journal of Artificial Intelligence Research (JAIR), 2011
Erik Talvitie
Satinder Singh
224
20
0
16 Jan 2014
Avoiding Confusion between Predictors and Inhibitors in Value Function Approximation
International Conference on Learning Representations (ICLR), 2013
Patrick C. Connor
Thomas Trappenberg
TDI
113
0
0
19 Dec 2013
Scaling Life-long Off-policy Learning
International Conference on Development and Learning (ICDL), 2012
Adam White
Joseph Modayil
R. Sutton
CLL
OffRL
223
26
0
27 Jun 2012
Temporal-Difference Networks for Dynamical Systems with Continuous Observations and Actions
Conference on Uncertainty in Artificial Intelligence (UAI), 2009
Christopher M. Vigorito
AI4CE
154
2
0
09 May 2012
Multi-timescale Nexting in a Reinforcement Learning Robot
Adaptive Behavior (AB), 2011
Joseph Modayil
Adam White
R. Sutton
503
132
0
06 Dec 2011
Toward a Classification of Finite Partial-Monitoring Games
Theoretical Computer Science (TCS), 2010
András Antos
Gábor Bartók
D. Pál
Csaba Szepesvári
678
99
0
10 Feb 2011
A Monte Carlo AIXI Approximation
J. Veness
K. S. Ng
Marcus Hutter
W. Uther
David Silver
384
3
0
04 Sep 2009
1
Page 1 of 1