ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2007.02040
  4. Cited By
Discount Factor as a Regularizer in Reinforcement Learning

Discount Factor as a Regularizer in Reinforcement Learning

4 July 2020
Ron Amit
Ron Meir
K. Ciosek
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Discount Factor as a Regularizer in Reinforcement Learning"

36 / 36 papers shown
From Projection to Prediction: Beyond Logits for Scalable Language Models
From Projection to Prediction: Beyond Logits for Scalable Language Models
Jianbing Dong
Jianbin Chang
142
1
0
18 Nov 2025
SpaceVista: All-Scale Visual Spatial Reasoning from mm to km
SpaceVista: All-Scale Visual Spatial Reasoning from mm to km
Peiwen Sun
Shiqiang Lang
Dongming Wu
Yi Ding
Kaituo Feng
...
Zhen Ye
Rui Liu
Y. Liu
Jianan Wang
Xiangyu Yue
LRM
194
3
0
10 Oct 2025
MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE
MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE
Junzhe Li
Yutao Cui
Tao Huang
Yinping Ma
Chun-Kai Fan
Miles Yang
Zhao Zhong
Liefeng Bo
394
87
0
29 Jul 2025
Broaden your SCOPE! Efficient Multi-turn Conversation Planning for LLMs with Semantic Space
Broaden your SCOPE! Efficient Multi-turn Conversation Planning for LLMs with Semantic Space
Zhiliang Chen
Xinyuan Niu
Chuan-Sheng Foo
Bryan Kian Hsiang Low
546
1
0
14 Mar 2025
On the Effective Horizon of Inverse Reinforcement Learning
On the Effective Horizon of Inverse Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2023
Yiqing Xu
Finale Doshi-Velez
David Hsu
403
1
0
21 Feb 2025
EvoRL: A GPU-accelerated Framework for Evolutionary Reinforcement Learning
EvoRL: A GPU-accelerated Framework for Evolutionary Reinforcement Learning
Bowen Zheng
Ran Cheng
Kay Chen Tan
519
1
0
25 Jan 2025
Bootstrapped Reward Shaping
Bootstrapped Reward ShapingAAAI Conference on Artificial Intelligence (AAAI), 2025
Jacob Adamczyk
Volodymyr Makarenko
Stas Tiomkin
R. Kulkarni
OffRL
279
6
0
02 Jan 2025
On shallow planning under partial observability
On shallow planning under partial observability
Randy Lefebvre
Audrey Durand
OffRL
269
2
0
22 Jul 2024
On the consistency of hyper-parameter selection in value-based deep
  reinforcement learning
On the consistency of hyper-parameter selection in value-based deep reinforcement learning
J. Obando-Ceron
J. G. Araújo
Rameswar Panda
Pablo Samuel Castro
450
20
0
25 Jun 2024
Oracle-Efficient Reinforcement Learning for Max Value Ensembles
Oracle-Efficient Reinforcement Learning for Max Value Ensembles
Marcel Hussing
Michael Kearns
Aaron Roth
S. B. Sengupta
Jessica Sorrell
259
1
0
27 May 2024
A Survey Analyzing Generalization in Deep Reinforcement Learning
A Survey Analyzing Generalization in Deep Reinforcement Learning
Ezgi Korkmaz
OffRL
343
10
0
04 Jan 2024
Behavior Alignment via Reward Function Optimization
Behavior Alignment via Reward Function OptimizationNeural Information Processing Systems (NeurIPS), 2023
Dhawal Gupta
Yash Chandak
Scott M. Jordan
Philip S. Thomas
Bruno Castro da Silva
433
23
0
29 Oct 2023
Consistent Aggregation of Objectives with Diverse Time Preferences
  Requires Non-Markovian Rewards
Consistent Aggregation of Objectives with Diverse Time Preferences Requires Non-Markovian RewardsNeural Information Processing Systems (NeurIPS), 2023
Silviu Pitis
239
10
0
30 Sep 2023
The Unintended Consequences of Discount Regularization: Improving
  Regularization in Certainty Equivalence Reinforcement Learning
The Unintended Consequences of Discount Regularization: Improving Regularization in Certainty Equivalence Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023
Sarah Rathnam
S. Parbhoo
Weiwei Pan
Susan A. Murphy
Finale Doshi-Velez
OffRL
197
6
0
20 Jun 2023
On the Value of Myopic Behavior in Policy Reuse
On the Value of Myopic Behavior in Policy ReuseIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Kang Xu
Chenjia Bai
Delin Qu
Haoran He
Bin Zhao
Zhen Wang
Wei Li
Xuelong Li
240
2
0
28 May 2023
A Tale of Sampling and Estimation in Discounted Reinforcement Learning
A Tale of Sampling and Estimation in Discounted Reinforcement LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Alberto Maria Metelli
Mirco Mutti
Marcello Restelli
OffRL
254
4
0
11 Apr 2023
UGAE: A Novel Approach to Non-exponential Discounting
UGAE: A Novel Approach to Non-exponential Discounting
Ariel Kwiatkowski
Vicky Kalogeiton
Julien Pettré
Marie-Paule Cani
OffRL
231
3
0
11 Feb 2023
POMRL: No-Regret Learning-to-Plan with Increasing Horizons
POMRL: No-Regret Learning-to-Plan with Increasing Horizons
Khimya Khetarpal
Claire Vernade
Brendan O'Donoghue
Satinder Singh
Tom Zahavy
OffRL
196
0
0
30 Dec 2022
Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer
  Value Function
Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function
Matthew Macfarlane
Laurence Midgley
Alexandre Laterre
293
0
0
19 Nov 2022
Rethinking Value Function Learning for Generalization in Reinforcement
  Learning
Rethinking Value Function Learning for Generalization in Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Seungyong Moon
JunYeong Lee
Hyun Oh Song
OODOffRL
266
16
0
18 Oct 2022
Applications of Reinforcement Learning in Finance -- Trading with a
  Double Deep Q-Network
Applications of Reinforcement Learning in Finance -- Trading with a Double Deep Q-Network
Frensi Zejnullahu
Maurice Moser
Joerg Osterrieder
AIFin
212
8
0
28 Jun 2022
On the Role of Discount Factor in Offline Reinforcement Learning
On the Role of Discount Factor in Offline Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Haotian Hu
Yiqin Yang
Qianchuan Zhao
Chongjie Zhang
OffRL
322
26
0
07 Jun 2022
Challenges to Solving Combinatorially Hard Long-Horizon Deep RL Tasks
Challenges to Solving Combinatorially Hard Long-Horizon Deep RL Tasks
Andrew C. Li
Pashootan Vaezipoor
Rodrigo Toro Icarte
Sheila A. McIlraith
OffRLLRM
216
5
0
03 Jun 2022
Learning to Transfer Role Assignment Across Team Sizes
Learning to Transfer Role Assignment Across Team SizesAdaptive Agents and Multi-Agent Systems (AAMAS), 2022
D. Nguyen
Phuoc Nguyen
Svetha Venkatesh
T. Tran
163
12
0
17 Apr 2022
A Survey on Reinforcement Learning Methods in Character Animation
A Survey on Reinforcement Learning Methods in Character Animation
Ariel Kwiatkowski
Eduardo Alvarado
Vicky Kalogeiton
Chenxi Liu
Julien Pettré
M. van de Panne
Marie-Paule Cani
AI4CE
337
66
0
07 Mar 2022
One Step at a Time: Pros and Cons of Multi-Step Meta-Gradient
  Reinforcement Learning
One Step at a Time: Pros and Cons of Multi-Step Meta-Gradient Reinforcement Learning
Matthew Macfarlane
Paul Caron
Thomas D. Barrett
Ian Davies
Alexandre Laterre
200
6
0
30 Oct 2021
EnTRPO: Trust Region Policy Optimization Method with Entropy
  Regularization
EnTRPO: Trust Region Policy Optimization Method with Entropy Regularization
Sahar Roostaie
M. Ebadzadeh
256
8
0
26 Oct 2021
Comparison and Unification of Three Regularization Methods in Batch
  Reinforcement Learning
Comparison and Unification of Three Regularization Methods in Batch Reinforcement Learning
Sarah Rathnam
Susan Murphy
Finale Doshi-Velez
OffRL
192
1
0
16 Sep 2021
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic
  Reinforcement Learning and Global Convergence of Policy Gradient Methods
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods
Xin Guo
Anran Hu
Junzi Zhang
OffRL
244
10
0
13 Sep 2021
Active Reinforcement Learning over MDPs
Qi Yang
Peng Yang
Shengcai Liu
280
0
0
05 Aug 2021
Towards Automatic Actor-Critic Solutions to Continuous Control
Towards Automatic Actor-Critic Solutions to Continuous Control
J. E. Grigsby
Jinsu Yoo
Yanjun Qi
OffRL
209
7
0
16 Jun 2021
On-Policy Deep Reinforcement Learning for the Average-Reward Criterion
On-Policy Deep Reinforcement Learning for the Average-Reward CriterionInternational Conference on Machine Learning (ICML), 2021
Yiming Zhang
George Andriopoulos
OffRL
337
57
0
14 Jun 2021
Taylor Expansion of Discount Factors
Taylor Expansion of Discount FactorsInternational Conference on Machine Learning (ICML), 2021
Yunhao Tang
Mark Rowland
Rémi Munos
Michal Valko
OffRL
233
8
0
11 Jun 2021
Heuristic-Guided Reinforcement Learning
Heuristic-Guided Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2021
Ching-An Cheng
Andrey Kolobov
Adith Swaminathan
OffRL
380
77
0
05 Jun 2021
A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
A Deeper Look at Discounting Mismatch in Actor-Critic AlgorithmsAdaptive Agents and Multi-Agent Systems (AAMAS), 2020
Shangtong Zhang
Romain Laroche
H. V. Seijen
Shimon Whiteson
Rémi Tachet des Combes
536
15
0
02 Oct 2020
Forward and inverse reinforcement learning sharing network weights and
  hyperparameters
Forward and inverse reinforcement learning sharing network weights and hyperparameters
E. Uchibe
Kenji Doya
193
22
0
17 Aug 2020
1
Page 1 of 1