ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1504.05539
  4. Cited By
Temporal-Difference Networks

Temporal-Difference Networks

21 April 2015
R. Sutton
B. Tanner
    PINNOOD
ArXiv (abs)PDFHTML

Papers citing "Temporal-Difference Networks"

25 / 25 papers shown
Exploring through Random Curiosity with General Value Functions
Exploring through Random Curiosity with General Value FunctionsNeural Information Processing Systems (NeurIPS), 2022
Aditya A. Ramesh
Louis Kirsch
Sjoerd van Steenkiste
Jürgen Schmidhuber
294
13
0
18 Nov 2022
Learning how to Interact with a Complex Interface using Hierarchical
  Reinforcement Learning
Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning
Gheorghe Comanici
Amelia Glaese
Anita Gergely
Daniel Toyama
Zafarali Ahmed
Tyler Jackson
P. Hamel
Doina Precup
166
4
0
21 Apr 2022
Learning Agent State Online with Recurrent Generate-and-Test
Learning Agent State Online with Recurrent Generate-and-Test
Alireza Samani
R. Sutton
CLLOffRL
192
2
0
30 Dec 2021
Explainable Artificial Intelligence for Autonomous Driving: A
  Comprehensive Overview and Field Guide for Future Research Directions
Explainable Artificial Intelligence for Autonomous Driving: A Comprehensive Overview and Field Guide for Future Research DirectionsIEEE Access (IEEE Access), 2021
Shahin Atakishiyev
Mohammad Salameh
Hengshuai Yao
Randy Goebel
736
220
0
21 Dec 2021
Representing Knowledge as Predictions (and State as Knowledge)
Representing Knowledge as Predictions (and State as Knowledge)
Mark B. Ring
80
7
0
12 Dec 2021
Towards Safe, Explainable, and Regulated Autonomous Driving
Towards Safe, Explainable, and Regulated Autonomous Driving
Shahin Atakishiyev
Mohammad Salameh
Hengshuai Yao
Randy Goebel
529
14
0
20 Nov 2021
A Unified Off-Policy Evaluation Approach for General Value Function
A Unified Off-Policy Evaluation Approach for General Value Function
Tengyu Xu
Zhuoran Yang
Zhaoran Wang
Yingbin Liang
OffRL
212
2
0
06 Jul 2021
A Deep Reinforcement Learning Approach to Marginalized Importance
  Sampling with the Successor Representation
A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor RepresentationInternational Conference on Machine Learning (ICML), 2021
Scott Fujimoto
David Meger
Doina Precup
248
17
0
12 Jun 2021
Predictive Representation Learning for Language Modeling
Predictive Representation Learning for Language Modeling
Qingfeng Lan
Luke N. Kumar
Martha White
Alona Fyshe
OffRLAI4TS
190
1
0
29 May 2021
Autotelic Agents with Intrinsically Motivated Goal-Conditioned
  Reinforcement Learning: a Short Survey
Autotelic Agents with Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short SurveyJournal of Artificial Intelligence Research (JAIR), 2020
Cédric Colas
Tristan Karch
Olivier Sigaud
Pierre-Yves Oudeyer
963
127
0
17 Dec 2020
C-Learning: Learning to Achieve Goals via Recursive Classification
C-Learning: Learning to Achieve Goals via Recursive ClassificationInternational Conference on Learning Representations (ICLR), 2020
Benjamin Eysenbach
Ruslan Salakhutdinov
Sergey Levine
OffRL
402
92
0
17 Nov 2020
Offline Learning of Counterfactual Predictions for Real-World Robotic
  Reinforcement Learning
Offline Learning of Counterfactual Predictions for Real-World Robotic Reinforcement LearningIEEE International Conference on Robotics and Automation (ICRA), 2020
Jun Jin
D. Graves
Cameron Haigh
Jun Luo
Martin Jägersand
SSLOffRL
380
6
0
11 Nov 2020
What's a Good Prediction? Challenges in evaluating an agent's knowledge
What's a Good Prediction? Challenges in evaluating an agent's knowledge
Alex Kearney
Anna Koop
P. Pilarski
ELM
219
2
0
23 Jan 2020
Discovery of Useful Questions as Auxiliary Tasks
Discovery of Useful Questions as Auxiliary TasksNeural Information Processing Systems (NeurIPS), 2019
Vivek Veeriah
Matteo Hessel
Zhongwen Xu
Richard L. Lewis
Janarthanan Rajendran
Junhyuk Oh
H. V. Hasselt
David Silver
Satinder Singh
LLMAG
205
88
0
10 Sep 2019
Meta-descent for Online, Continual Prediction
Meta-descent for Online, Continual PredictionAAAI Conference on Artificial Intelligence (AAAI), 2019
Andrew Jacobsen
M. Schlegel
Cam Linke
T. Degris
Adam White
Martha White
292
25
0
17 Jul 2019
Deep Reinforcement Learning
Deep Reinforcement Learning
Yuxi Li
VLMOffRL
422
139
0
15 Oct 2018
General Value Function Networks
General Value Function NetworksJournal of Artificial Intelligence Research (JAIR), 2018
M. Schlegel
Andrew Jacobsen
Zaheer Abbas
Andrew Patterson
Adam White
Martha White
382
31
0
18 Jul 2018
Convergent Tree Backup and Retrace with Function Approximation
Convergent Tree Backup and Retrace with Function Approximation
Ahmed Touati
Pierre-Luc Bacon
Doina Precup
Pascal Vincent
338
41
0
25 May 2017
Learning to Make Predictions In Partially Observable Environments
  Without a Generative Model
Learning to Make Predictions In Partially Observable Environments Without a Generative ModelJournal of Artificial Intelligence Research (JAIR), 2011
Erik Talvitie
Satinder Singh
224
20
0
16 Jan 2014
Avoiding Confusion between Predictors and Inhibitors in Value Function
  Approximation
Avoiding Confusion between Predictors and Inhibitors in Value Function ApproximationInternational Conference on Learning Representations (ICLR), 2013
Patrick C. Connor
Thomas Trappenberg
TDI
113
0
0
19 Dec 2013
Scaling Life-long Off-policy Learning
Scaling Life-long Off-policy LearningInternational Conference on Development and Learning (ICDL), 2012
Adam White
Joseph Modayil
R. Sutton
CLLOffRL
223
26
0
27 Jun 2012
Temporal-Difference Networks for Dynamical Systems with Continuous
  Observations and Actions
Temporal-Difference Networks for Dynamical Systems with Continuous Observations and ActionsConference on Uncertainty in Artificial Intelligence (UAI), 2009
Christopher M. Vigorito
AI4CE
154
2
0
09 May 2012
Multi-timescale Nexting in a Reinforcement Learning Robot
Multi-timescale Nexting in a Reinforcement Learning RobotAdaptive Behavior (AB), 2011
Joseph Modayil
Adam White
R. Sutton
503
132
0
06 Dec 2011
Toward a Classification of Finite Partial-Monitoring Games
Toward a Classification of Finite Partial-Monitoring GamesTheoretical Computer Science (TCS), 2010
András Antos
Gábor Bartók
D. Pál
Csaba Szepesvári
678
99
0
10 Feb 2011
A Monte Carlo AIXI Approximation
A Monte Carlo AIXI Approximation
J. Veness
K. S. Ng
Marcus Hutter
W. Uther
David Silver
384
3
0
04 Sep 2009
1
Page 1 of 1