Hyperbolic Discounting and Learning over Multiple Horizons

19 February 2019

Papers citing "Hyperbolic Discounting and Learning over Multiple Horizons"

24 / 24 papers shown

Title
Three Dogmas of Reinforcement Learning David Abel Mark K. Ho A. Harutyunyan 38 5 0 15 Jul 2024
The Curse of Diversity in Ensemble-Based Exploration Zhixuan Lin P. DÓro Evgenii Nikishin Aaron C. Courville 42 1 0 07 May 2024
Markov Decision Processes with Time-Varying Geometric Discounting Jiarui Gan Ann-Kathrin Hennes R. Majumdar Debmalya Mandal Goran Radanović 13 1 0 19 Jul 2023
Towards a Better Understanding of Representation Dynamics under TD-learning Yunhao Tang Rémi Munos OffRL 23 1 0 29 May 2023
A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces Charline Le Lan Joshua Greaves Jesse Farebrother Mark Rowland Fabian Pedregosa Rishabh Agarwal Marc G. Bellemare 49 8 0 08 Dec 2022
Correlation of the importances of neural network weights calculated by modern methods of overcoming catastrophic forgetting Alexey Kutalev 11 0 0 24 Oct 2022
Limited or Biased: Modeling Sub-Rational Human Investors in Financial Markets Penghang Liu Kshama Dwarakanath Svitlana Vyetrenko Tucker Balch AIFin 31 5 0 16 Oct 2022
Factors of Influence of the Overestimation Bias of Q-Learning Julius Wagenbach M. Sabatelli 15 1 0 11 Oct 2022
On the Role of Discount Factor in Offline Reinforcement Learning Haotian Hu Yiqin Yang Qianchuan Zhao Chongjie Zhang OffRL 29 18 0 07 Jun 2022
Streaming Inference for Infinite Non-Stationary Clustering Rylan Schaeffer Gabrielle K. Liu Yilun Du Scott W. Linderman Ila Rani Fiete 32 1 0 02 May 2022
Intelligent Autonomous Intersection Management Udesh Gunarathna S. Karunasekera Renata Borovica-Gajic E. Tanin 18 3 0 09 Feb 2022
Model-Value Inconsistency as a Signal for Epistemic Uncertainty Angelos Filos Eszter Vértes Zita Marinho Gregory Farquhar Diana Borsa A. Friesen Feryal M. P. Behbahani Tom Schaul André Barreto Simon Osindero 44 7 0 08 Dec 2021
On the Expressivity of Markov Reward David Abel Will Dabney A. Harutyunyan Mark K. Ho Michael L. Littman Doina Precup Satinder Singh 26 82 0 01 Nov 2021
C-Planning: An Automatic Curriculum for Learning Goal-Reaching Tasks Tianjun Zhang Benjamin Eysenbach Ruslan Salakhutdinov Sergey Levine Joseph E. Gonzalez OffRL 37 16 0 22 Oct 2021
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods Xin Guo Anran Hu Junzi Zhang OffRL 25 6 0 13 Sep 2021
Which Mutual-Information Representation Learning Objectives are Sufficient for Control? Kate Rakelly Abhishek Gupta Carlos Florensa Sergey Levine SSL 26 38 0 14 Jun 2021
Taylor Expansion of Discount Factors Yunhao Tang Mark Rowland Rémi Munos Michal Valko OffRL 26 5 0 11 Jun 2021
MICo: Improved representations via sampling-based state similarity for Markov decision processes Pablo Samuel Castro Tyler Kastner Prakash Panangaden Mark Rowland 43 35 0 03 Jun 2021
Scalable Transfer Learning with Expert Models J. Puigcerver C. Riquelme Basil Mustafa Cédric Renggli André Susano Pinto Sylvain Gelly Daniel Keysers N. Houlsby 34 62 0 28 Sep 2020
Reinforcement Learning Under Moral Uncertainty Adrien Ecoffet Joel Lehman 17 32 0 08 Jun 2020
First return, then explore Adrien Ecoffet Joost Huizinga Joel Lehman Kenneth O. Stanley Jeff Clune 47 349 0 27 Apr 2020
Discounted Reinforcement Learning Is Not an Optimization Problem T. Reiher R. Shariff J. Castrillón Hengshuai Yao R. Sutton 8 48 0 04 Oct 2019
Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning Kristopher De Asis Alan Chan Silviu Pitis R. Sutton D. Graves 13 32 0 09 Sep 2019
Fast Efficient Hyperparameter Tuning for Policy Gradients Supratik Paul Vitaly Kurin Shimon Whiteson 22 32 0 18 Feb 2019