Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1902.06865
Cited By
Hyperbolic Discounting and Learning over Multiple Horizons
19 February 2019
W. Fedus
Carles Gelada
Yoshua Bengio
Marc G. Bellemare
Hugo Larochelle
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hyperbolic Discounting and Learning over Multiple Horizons"
24 / 24 papers shown
Title
Three Dogmas of Reinforcement Learning
David Abel
Mark K. Ho
A. Harutyunyan
38
5
0
15 Jul 2024
The Curse of Diversity in Ensemble-Based Exploration
Zhixuan Lin
P. DÓro
Evgenii Nikishin
Aaron C. Courville
42
1
0
07 May 2024
Markov Decision Processes with Time-Varying Geometric Discounting
Jiarui Gan
Ann-Kathrin Hennes
R. Majumdar
Debmalya Mandal
Goran Radanović
13
1
0
19 Jul 2023
Towards a Better Understanding of Representation Dynamics under TD-learning
Yunhao Tang
Rémi Munos
OffRL
23
1
0
29 May 2023
A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces
Charline Le Lan
Joshua Greaves
Jesse Farebrother
Mark Rowland
Fabian Pedregosa
Rishabh Agarwal
Marc G. Bellemare
49
8
0
08 Dec 2022
Correlation of the importances of neural network weights calculated by modern methods of overcoming catastrophic forgetting
Alexey Kutalev
11
0
0
24 Oct 2022
Limited or Biased: Modeling Sub-Rational Human Investors in Financial Markets
Penghang Liu
Kshama Dwarakanath
Svitlana Vyetrenko
Tucker Balch
AIFin
31
5
0
16 Oct 2022
Factors of Influence of the Overestimation Bias of Q-Learning
Julius Wagenbach
M. Sabatelli
15
1
0
11 Oct 2022
On the Role of Discount Factor in Offline Reinforcement Learning
Haotian Hu
Yiqin Yang
Qianchuan Zhao
Chongjie Zhang
OffRL
29
18
0
07 Jun 2022
Streaming Inference for Infinite Non-Stationary Clustering
Rylan Schaeffer
Gabrielle K. Liu
Yilun Du
Scott W. Linderman
Ila Rani Fiete
32
1
0
02 May 2022
Intelligent Autonomous Intersection Management
Udesh Gunarathna
S. Karunasekera
Renata Borovica-Gajic
E. Tanin
18
3
0
09 Feb 2022
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Angelos Filos
Eszter Vértes
Zita Marinho
Gregory Farquhar
Diana Borsa
A. Friesen
Feryal M. P. Behbahani
Tom Schaul
André Barreto
Simon Osindero
44
7
0
08 Dec 2021
On the Expressivity of Markov Reward
David Abel
Will Dabney
A. Harutyunyan
Mark K. Ho
Michael L. Littman
Doina Precup
Satinder Singh
26
82
0
01 Nov 2021
C-Planning: An Automatic Curriculum for Learning Goal-Reaching Tasks
Tianjun Zhang
Benjamin Eysenbach
Ruslan Salakhutdinov
Sergey Levine
Joseph E. Gonzalez
OffRL
37
16
0
22 Oct 2021
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods
Xin Guo
Anran Hu
Junzi Zhang
OffRL
25
6
0
13 Sep 2021
Which Mutual-Information Representation Learning Objectives are Sufficient for Control?
Kate Rakelly
Abhishek Gupta
Carlos Florensa
Sergey Levine
SSL
26
38
0
14 Jun 2021
Taylor Expansion of Discount Factors
Yunhao Tang
Mark Rowland
Rémi Munos
Michal Valko
OffRL
26
5
0
11 Jun 2021
MICo: Improved representations via sampling-based state similarity for Markov decision processes
Pablo Samuel Castro
Tyler Kastner
Prakash Panangaden
Mark Rowland
43
35
0
03 Jun 2021
Scalable Transfer Learning with Expert Models
J. Puigcerver
C. Riquelme
Basil Mustafa
Cédric Renggli
André Susano Pinto
Sylvain Gelly
Daniel Keysers
N. Houlsby
34
62
0
28 Sep 2020
Reinforcement Learning Under Moral Uncertainty
Adrien Ecoffet
Joel Lehman
17
32
0
08 Jun 2020
First return, then explore
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
47
349
0
27 Apr 2020
Discounted Reinforcement Learning Is Not an Optimization Problem
T. Reiher
R. Shariff
J. Castrillón
Hengshuai Yao
R. Sutton
8
48
0
04 Oct 2019
Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning
Kristopher De Asis
Alan Chan
Silviu Pitis
R. Sutton
D. Graves
13
32
0
09 Sep 2019
Fast Efficient Hyperparameter Tuning for Policy Gradients
Supratik Paul
Vitaly Kurin
Shimon Whiteson
22
32
0
18 Feb 2019
1