ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.08329
  4. Cited By
Value-driven Hindsight Modelling
v1v2 (latest)

Value-driven Hindsight Modelling

Neural Information Processing Systems (NeurIPS), 2020
19 February 2020
A. Guez
Fabio Viola
T. Weber
Lars Buesing
Steven Kapturowski
Doina Precup
David Silver
N. Heess
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Value-driven Hindsight Modelling"

7 / 7 papers shown
Would I have gotten that reward? Long-term credit assignment by
  counterfactual contribution analysis
Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysisNeural Information Processing Systems (NeurIPS), 2023
Alexander Meulemans
Simon Schug
Seijin Kobayashi
Nathaniel D. Daw
Gregory Wayne
418
7
0
29 Jun 2023
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Angelos Filos
Eszter Vértes
Zita Marinho
Gregory Farquhar
Diana Borsa
A. Friesen
Feryal M. P. Behbahani
Tom Schaul
André Barreto
Simon Osindero
469
8
0
08 Dec 2021
Free Will Belief as a consequence of Model-based Reinforcement Learning
Free Will Belief as a consequence of Model-based Reinforcement LearningArtificial General Intelligence (AGI), 2021
E. Rehn
3DV
120
1
0
14 Nov 2021
Self-Consistent Models and Values
Self-Consistent Models and ValuesNeural Information Processing Systems (NeurIPS), 2021
Roy Miles
Kate Baumli
Zita Marinho
Angelos Filos
Matteo Hessel
Hado van Hasselt
David Silver
274
9
0
25 Oct 2021
Policy Gradients Incorporating the Future
Policy Gradients Incorporating the FutureInternational Conference on Learning Representations (ICLR), 2021
David Venuto
Elaine Lau
Doina Precup
Ofir Nachum
OffRL
328
9
0
04 Aug 2021
Muesli: Combining Improvements in Policy Optimization
Muesli: Combining Improvements in Policy OptimizationInternational Conference on Machine Learning (ICML), 2021
Matteo Hessel
Ivo Danihelka
Fabio Viola
A. Guez
Simon Schmitt
Laurent Sifre
T. Weber
David Silver
H. V. Hasselt
314
69
0
13 Apr 2021
Counterfactual Credit Assignment in Model-Free Reinforcement Learning
Counterfactual Credit Assignment in Model-Free Reinforcement LearningInternational Conference on Machine Learning (ICML), 2020
Thomas Mesnard
T. Weber
Fabio Viola
S. Thakoor
Alaa Saade
...
A. Guez
Éric Moulines
Marcus Hutter
Lars Buesing
Rémi Munos
CMLOffRL
296
69
0
18 Nov 2020
1
Page 1 of 1