Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2007.02040
Cited By
Discount Factor as a Regularizer in Reinforcement Learning
4 July 2020
Ron Amit
Ron Meir
K. Ciosek
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Discount Factor as a Regularizer in Reinforcement Learning"
36 / 36 papers shown
From Projection to Prediction: Beyond Logits for Scalable Language Models
Jianbing Dong
Jianbin Chang
142
1
0
18 Nov 2025
SpaceVista: All-Scale Visual Spatial Reasoning from mm to km
Peiwen Sun
Shiqiang Lang
Dongming Wu
Yi Ding
Kaituo Feng
...
Zhen Ye
Rui Liu
Y. Liu
Jianan Wang
Xiangyu Yue
LRM
194
3
0
10 Oct 2025
MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE
Junzhe Li
Yutao Cui
Tao Huang
Yinping Ma
Chun-Kai Fan
Miles Yang
Zhao Zhong
Liefeng Bo
394
87
0
29 Jul 2025
Broaden your SCOPE! Efficient Multi-turn Conversation Planning for LLMs with Semantic Space
Zhiliang Chen
Xinyuan Niu
Chuan-Sheng Foo
Bryan Kian Hsiang Low
546
1
0
14 Mar 2025
On the Effective Horizon of Inverse Reinforcement Learning
Adaptive Agents and Multi-Agent Systems (AAMAS), 2023
Yiqing Xu
Finale Doshi-Velez
David Hsu
403
1
0
21 Feb 2025
EvoRL: A GPU-accelerated Framework for Evolutionary Reinforcement Learning
Bowen Zheng
Ran Cheng
Kay Chen Tan
519
1
0
25 Jan 2025
Bootstrapped Reward Shaping
AAAI Conference on Artificial Intelligence (AAAI), 2025
Jacob Adamczyk
Volodymyr Makarenko
Stas Tiomkin
R. Kulkarni
OffRL
279
6
0
02 Jan 2025
On shallow planning under partial observability
Randy Lefebvre
Audrey Durand
OffRL
269
2
0
22 Jul 2024
On the consistency of hyper-parameter selection in value-based deep reinforcement learning
J. Obando-Ceron
J. G. Araújo
Rameswar Panda
Pablo Samuel Castro
450
20
0
25 Jun 2024
Oracle-Efficient Reinforcement Learning for Max Value Ensembles
Marcel Hussing
Michael Kearns
Aaron Roth
S. B. Sengupta
Jessica Sorrell
259
1
0
27 May 2024
A Survey Analyzing Generalization in Deep Reinforcement Learning
Ezgi Korkmaz
OffRL
343
10
0
04 Jan 2024
Behavior Alignment via Reward Function Optimization
Neural Information Processing Systems (NeurIPS), 2023
Dhawal Gupta
Yash Chandak
Scott M. Jordan
Philip S. Thomas
Bruno Castro da Silva
433
23
0
29 Oct 2023
Consistent Aggregation of Objectives with Diverse Time Preferences Requires Non-Markovian Rewards
Neural Information Processing Systems (NeurIPS), 2023
Silviu Pitis
239
10
0
30 Sep 2023
The Unintended Consequences of Discount Regularization: Improving Regularization in Certainty Equivalence Reinforcement Learning
International Conference on Machine Learning (ICML), 2023
Sarah Rathnam
S. Parbhoo
Weiwei Pan
Susan A. Murphy
Finale Doshi-Velez
OffRL
197
6
0
20 Jun 2023
On the Value of Myopic Behavior in Policy Reuse
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Kang Xu
Chenjia Bai
Delin Qu
Haoran He
Bin Zhao
Zhen Wang
Wei Li
Xuelong Li
240
2
0
28 May 2023
A Tale of Sampling and Estimation in Discounted Reinforcement Learning
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Alberto Maria Metelli
Mirco Mutti
Marcello Restelli
OffRL
254
4
0
11 Apr 2023
UGAE: A Novel Approach to Non-exponential Discounting
Ariel Kwiatkowski
Vicky Kalogeiton
Julien Pettré
Marie-Paule Cani
OffRL
231
3
0
11 Feb 2023
POMRL: No-Regret Learning-to-Plan with Increasing Horizons
Khimya Khetarpal
Claire Vernade
Brendan O'Donoghue
Satinder Singh
Tom Zahavy
OffRL
196
0
0
30 Dec 2022
Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function
Matthew Macfarlane
Laurence Midgley
Alexandre Laterre
293
0
0
19 Nov 2022
Rethinking Value Function Learning for Generalization in Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2022
Seungyong Moon
JunYeong Lee
Hyun Oh Song
OOD
OffRL
266
16
0
18 Oct 2022
Applications of Reinforcement Learning in Finance -- Trading with a Double Deep Q-Network
Frensi Zejnullahu
Maurice Moser
Joerg Osterrieder
AIFin
212
8
0
28 Jun 2022
On the Role of Discount Factor in Offline Reinforcement Learning
International Conference on Machine Learning (ICML), 2022
Haotian Hu
Yiqin Yang
Qianchuan Zhao
Chongjie Zhang
OffRL
322
26
0
07 Jun 2022
Challenges to Solving Combinatorially Hard Long-Horizon Deep RL Tasks
Andrew C. Li
Pashootan Vaezipoor
Rodrigo Toro Icarte
Sheila A. McIlraith
OffRL
LRM
216
5
0
03 Jun 2022
Learning to Transfer Role Assignment Across Team Sizes
Adaptive Agents and Multi-Agent Systems (AAMAS), 2022
D. Nguyen
Phuoc Nguyen
Svetha Venkatesh
T. Tran
163
12
0
17 Apr 2022
A Survey on Reinforcement Learning Methods in Character Animation
Ariel Kwiatkowski
Eduardo Alvarado
Vicky Kalogeiton
Chenxi Liu
Julien Pettré
M. van de Panne
Marie-Paule Cani
AI4CE
337
66
0
07 Mar 2022
One Step at a Time: Pros and Cons of Multi-Step Meta-Gradient Reinforcement Learning
Matthew Macfarlane
Paul Caron
Thomas D. Barrett
Ian Davies
Alexandre Laterre
200
6
0
30 Oct 2021
EnTRPO: Trust Region Policy Optimization Method with Entropy Regularization
Sahar Roostaie
M. Ebadzadeh
256
8
0
26 Oct 2021
Comparison and Unification of Three Regularization Methods in Batch Reinforcement Learning
Sarah Rathnam
Susan Murphy
Finale Doshi-Velez
OffRL
192
1
0
16 Sep 2021
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods
Xin Guo
Anran Hu
Junzi Zhang
OffRL
244
10
0
13 Sep 2021
Active Reinforcement Learning over MDPs
Qi Yang
Peng Yang
Shengcai Liu
280
0
0
05 Aug 2021
Towards Automatic Actor-Critic Solutions to Continuous Control
J. E. Grigsby
Jinsu Yoo
Yanjun Qi
OffRL
209
7
0
16 Jun 2021
On-Policy Deep Reinforcement Learning for the Average-Reward Criterion
International Conference on Machine Learning (ICML), 2021
Yiming Zhang
George Andriopoulos
OffRL
337
57
0
14 Jun 2021
Taylor Expansion of Discount Factors
International Conference on Machine Learning (ICML), 2021
Yunhao Tang
Mark Rowland
Rémi Munos
Michal Valko
OffRL
233
8
0
11 Jun 2021
Heuristic-Guided Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2021
Ching-An Cheng
Andrey Kolobov
Adith Swaminathan
OffRL
380
77
0
05 Jun 2021
A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
Adaptive Agents and Multi-Agent Systems (AAMAS), 2020
Shangtong Zhang
Romain Laroche
H. V. Seijen
Shimon Whiteson
Rémi Tachet des Combes
536
15
0
02 Oct 2020
Forward and inverse reinforcement learning sharing network weights and hyperparameters
E. Uchibe
Kenji Doya
193
22
0
17 Aug 2020
1
Page 1 of 1