ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2007.01839
  4. Cited By
Expected Eligibility Traces
v1v2 (latest)

Expected Eligibility Traces

3 July 2020
H. V. Hasselt
Sephora Madjiheurem
Matteo Hessel
David Silver
André Barreto
Diana Borsa
ArXiv (abs)PDFHTML

Papers citing "Expected Eligibility Traces"

21 / 21 papers shown
Streaming Deep Reinforcement Learning Finally Works
Streaming Deep Reinforcement Learning Finally Works
Mohamed Elsayed
Gautham Vasan
A. R. Mahmood
OffRL
369
21
0
18 Oct 2024
Stacked Universal Successor Feature Approximators for Safety in
  Reinforcement Learning
Stacked Universal Successor Feature Approximators for Safety in Reinforcement Learning
Ian Cannon
Washington Garcia
Thomas Gresavage
Joseph Saurine
Ian Leong
Jared Culbertson
OffRL
177
1
0
06 Sep 2024
Sequence Compression Speeds Up Credit Assignment in Reinforcement
  Learning
Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning
Aditya A. Ramesh
Kenny Young
Louis Kirsch
Jürgen Schmidhuber
334
2
0
06 May 2024
From Past to Future: Rethinking Eligibility Traces
From Past to Future: Rethinking Eligibility Traces
Dhawal Gupta
Scott M. Jordan
Shreyas Chaudhari
Bo Liu
Philip S. Thomas
Bruno Castro da Silva
348
4
0
20 Dec 2023
Towards model-free RL algorithms that scale well with unstructured data
Towards model-free RL algorithms that scale well with unstructured data
Joseph Modayil
Zaheer Abbas
OffRL
212
5
0
03 Nov 2023
Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning
Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning
Akash Velu
Skanda Vaidyanath
Dilip Arumugam
OffRL
340
4
0
21 Jul 2023
Would I have gotten that reward? Long-term credit assignment by
  counterfactual contribution analysis
Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysisNeural Information Processing Systems (NeurIPS), 2023
Alexander Meulemans
Simon Schug
Seijin Kobayashi
Nathaniel D. Daw
Gregory Wayne
415
7
0
29 Jun 2023
Maximum State Entropy Exploration using Predecessor and Successor
  Representations
Maximum State Entropy Exploration using Predecessor and Successor RepresentationsNeural Information Processing Systems (NeurIPS), 2023
A. Jain
Lucas Lehnert
Irina Rish
Glen Berseth
281
23
0
26 Jun 2023
Exploring the Noise Resilience of Successor Features and Predecessor
  Features Algorithms in One and Two-Dimensional Environments
Exploring the Noise Resilience of Successor Features and Predecessor Features Algorithms in One and Two-Dimensional Environments
Hyunsung Lee
305
1
0
14 Apr 2023
The Nature of Temporal Difference Errors in Multi-step Distributional
  Reinforcement Learning
The Nature of Temporal Difference Errors in Multi-step Distributional Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Yunhao Tang
Mark Rowland
Rémi Munos
Bernardo Avila-Pires
Will Dabney
Marc G. Bellemare
OffRL
180
12
0
15 Jul 2022
Predecessor Features
Predecessor Features
D. Bailey
Marcelo G. Mattar
OffRL
136
7
0
01 Jun 2022
Graph Backup: Data Efficient Backup Exploiting Markovian Transitions
Graph Backup: Data Efficient Backup Exploiting Markovian Transitions
Zhengyao Jiang
Tianjun Zhang
Robert Kirk
Tim Rocktaschel
Edward Grefenstette
OffRL
142
2
0
31 May 2022
Off-Beat Multi-Agent Reinforcement Learning
Off-Beat Multi-Agent Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2022
Wei Qiu
Weixun Wang
Rongpin Wang
Bo An
Yujing Hu
S. Obraztsova
Zinovi Rabinovich
Jianye Hao
Yingfeng Chen
Changjie Fan
OffRL
230
2
0
27 May 2022
Marginalized Operators for Off-policy Reinforcement Learning
Marginalized Operators for Off-policy Reinforcement LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Yunhao Tang
Mark Rowland
Rémi Munos
Michal Valko
OffRL
262
0
0
30 Mar 2022
Selective Credit Assignment
Selective Credit Assignment
Veronica Chelu
Diana Borsa
Doina Precup
Hado van Hasselt
221
3
0
20 Feb 2022
A Generalized Bootstrap Target for Value-Learning, Efficiently Combining
  Value and Feature Predictions
A Generalized Bootstrap Target for Value-Learning, Efficiently Combining Value and Feature PredictionsAAAI Conference on Artificial Intelligence (AAAI), 2022
Anthony GX-Chen
Veronica Chelu
Blake A. Richards
Joelle Pineau
TTA
231
1
0
05 Jan 2022
Learning Expected Emphatic Traces for Deep RL
Learning Expected Emphatic Traces for Deep RL
Ray Jiang
Shangtong Zhang
Veronica Chelu
Adam White
Hado van Hasselt
OffRL
305
13
0
12 Jul 2021
Emphatic Algorithms for Deep Reinforcement Learning
Emphatic Algorithms for Deep Reinforcement LearningInternational Conference on Machine Learning (ICML), 2021
Ray Jiang
Tom Zahavy
Zhongwen Xu
Adam White
Matteo Hessel
Charles Blundell
Hado van Hasselt
OffRL
183
20
0
21 Jun 2021
An Information-Theoretic Perspective on Credit Assignment in
  Reinforcement Learning
An Information-Theoretic Perspective on Credit Assignment in Reinforcement Learning
Dilip Arumugam
Peter Henderson
Pierre-Luc Bacon
174
20
0
10 Mar 2021
Adaptive Pairwise Weights for Temporal Credit Assignment
Adaptive Pairwise Weights for Temporal Credit AssignmentAAAI Conference on Artificial Intelligence (AAAI), 2021
Zeyu Zheng
Risto Vuorio
Richard L. Lewis
Satinder Singh
257
5
0
09 Feb 2021
Forethought and Hindsight in Credit Assignment
Forethought and Hindsight in Credit AssignmentNeural Information Processing Systems (NeurIPS), 2020
Veronica Chelu
Doina Precup
H. V. Hasselt
303
28
0
26 Oct 2020
1
Page 1 of 1