Temporal-Difference Networks

21 April 2015

Papers citing "Temporal-Difference Networks"

25 / 25 papers shown

Exploring through Random Curiosity with General Value FunctionsNeural Information Processing Systems (NeurIPS), 2022

Aditya A. Ramesh

Louis Kirsch

Sjoerd van Steenkiste

Jürgen Schmidhuber

294

18 Nov 2022

Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning

Daniel Toyama

166

21 Apr 2022

Learning Agent State Online with Recurrent Generate-and-Test

Alireza Samani

R. Sutton

CLL OffRL

192

30 Dec 2021

Explainable Artificial Intelligence for Autonomous Driving: A Comprehensive Overview and Field Guide for Future Research DirectionsIEEE Access (IEEE Access), 2021

736

220

21 Dec 2021

Representing Knowledge as Predictions (and State as Knowledge)

Mark B. Ring

12 Dec 2021

Towards Safe, Explainable, and Regulated Autonomous Driving

529

20 Nov 2021

A Unified Off-Policy Evaluation Approach for General Value Function

212

06 Jul 2021

A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor RepresentationInternational Conference on Machine Learning (ICML), 2021

Scott Fujimoto

David Meger

Doina Precup

248

12 Jun 2021

Predictive Representation Learning for Language Modeling

190

29 May 2021

Autotelic Agents with Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short SurveyJournal of Artificial Intelligence Research (JAIR), 2020

963

127

17 Dec 2020

C-Learning: Learning to Achieve Goals via Recursive ClassificationInternational Conference on Learning Representations (ICLR), 2020

402

17 Nov 2020

Offline Learning of Counterfactual Predictions for Real-World Robotic Reinforcement LearningIEEE International Conference on Robotics and Automation (ICRA), 2020

380

11 Nov 2020

What's a Good Prediction? Challenges in evaluating an agent's knowledge

219

23 Jan 2020

Discovery of Useful Questions as Auxiliary TasksNeural Information Processing Systems (NeurIPS), 2019

Janarthanan Rajendran

David Silver

205

10 Sep 2019

Meta-descent for Online, Continual PredictionAAAI Conference on Artificial Intelligence (AAAI), 2019

292

17 Jul 2019

Deep Reinforcement Learning

Yuxi Li

VLM OffRL

422

139

15 Oct 2018

General Value Function NetworksJournal of Artificial Intelligence Research (JAIR), 2018

382

18 Jul 2018

Convergent Tree Backup and Retrace with Function Approximation

Ahmed Touati

Pierre-Luc Bacon

Doina Precup

Pascal Vincent

338

25 May 2017

Learning to Make Predictions In Partially Observable Environments Without a Generative ModelJournal of Artificial Intelligence Research (JAIR), 2011

Erik Talvitie

Satinder Singh

224

16 Jan 2014

Avoiding Confusion between Predictors and Inhibitors in Value Function ApproximationInternational Conference on Learning Representations (ICLR), 2013

Patrick C. Connor

Thomas Trappenberg

TDI

113

19 Dec 2013

Scaling Life-long Off-policy LearningInternational Conference on Development and Learning (ICDL), 2012

223

27 Jun 2012

Temporal-Difference Networks for Dynamical Systems with Continuous Observations and ActionsConference on Uncertainty in Artificial Intelligence (UAI), 2009

Christopher M. Vigorito

AI4CE

154

09 May 2012

Multi-timescale Nexting in a Reinforcement Learning RobotAdaptive Behavior (AB), 2011

Joseph Modayil

Adam White

R. Sutton

503

132

06 Dec 2011

Toward a Classification of Finite Partial-Monitoring GamesTheoretical Computer Science (TCS), 2010

678

10 Feb 2011

A Monte Carlo AIXI Approximation

J. Veness

K. S. Ng

Marcus Hutter

W. Uther

David Silver

384

04 Sep 2009