Discount Factor as a Regularizer in Reinforcement Learning

4 July 2020

Papers citing "Discount Factor as a Regularizer in Reinforcement Learning"

36 / 36 papers shown

From Projection to Prediction: Beyond Logits for Scalable Language Models

Jianbing Dong

Jianbin Chang

142

18 Nov 2025

SpaceVista: All-Scale Visual Spatial Reasoning from mm to km

...

194

10 Oct 2025

MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE

Liefeng Bo

394

29 Jul 2025

Broaden your SCOPE! Efficient Multi-turn Conversation Planning for LLMs with Semantic Space

Zhiliang Chen

Xinyuan Niu

Chuan-Sheng Foo

Bryan Kian Hsiang Low

546

14 Mar 2025

On the Effective Horizon of Inverse Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2023

Yiqing Xu

Finale Doshi-Velez

David Hsu

403

21 Feb 2025

EvoRL: A GPU-accelerated Framework for Evolutionary Reinforcement Learning

Bowen Zheng

Ran Cheng

Kay Chen Tan

519

25 Jan 2025

Bootstrapped Reward ShapingAAAI Conference on Artificial Intelligence (AAAI), 2025

279

02 Jan 2025

On shallow planning under partial observability

Randy Lefebvre

Audrey Durand

OffRL

269

22 Jul 2024

On the consistency of hyper-parameter selection in value-based deep reinforcement learning

J. Obando-Ceron

J. G. Araújo

Rameswar Panda

Pablo Samuel Castro

450

25 Jun 2024

Oracle-Efficient Reinforcement Learning for Max Value Ensembles

259

27 May 2024

A Survey Analyzing Generalization in Deep Reinforcement Learning

Ezgi Korkmaz

OffRL

343

04 Jan 2024

Behavior Alignment via Reward Function OptimizationNeural Information Processing Systems (NeurIPS), 2023

Bruno Castro da Silva

433

29 Oct 2023

Consistent Aggregation of Objectives with Diverse Time Preferences Requires Non-Markovian RewardsNeural Information Processing Systems (NeurIPS), 2023

Silviu Pitis

239

30 Sep 2023

The Unintended Consequences of Discount Regularization: Improving Regularization in Certainty Equivalence Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023

Sarah Rathnam

S. Parbhoo

Weiwei Pan

Susan A. Murphy

Finale Doshi-Velez

OffRL

197

20 Jun 2023

On the Value of Myopic Behavior in Policy ReuseIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

Zhen Wang

Xuelong Li

240

28 May 2023

A Tale of Sampling and Estimation in Discounted Reinforcement LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023

Alberto Maria Metelli

Mirco Mutti

Marcello Restelli

OffRL

254

11 Apr 2023

UGAE: A Novel Approach to Non-exponential Discounting

231

11 Feb 2023

POMRL: No-Regret Learning-to-Plan with Increasing Horizons

196

30 Dec 2022

Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function

Matthew Macfarlane

Laurence Midgley

Alexandre Laterre

293

19 Nov 2022

Rethinking Value Function Learning for Generalization in Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022

266

18 Oct 2022

Applications of Reinforcement Learning in Finance -- Trading with a Double Deep Q-Network

212

28 Jun 2022

On the Role of Discount Factor in Offline Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022

322

07 Jun 2022

Challenges to Solving Combinatorially Hard Long-Horizon Deep RL Tasks

Pashootan Vaezipoor

216

03 Jun 2022

Learning to Transfer Role Assignment Across Team SizesAdaptive Agents and Multi-Agent Systems (AAMAS), 2022

163

17 Apr 2022

A Survey on Reinforcement Learning Methods in Character Animation

337

07 Mar 2022

One Step at a Time: Pros and Cons of Multi-Step Meta-Gradient Reinforcement Learning

200

30 Oct 2021

EnTRPO: Trust Region Policy Optimization Method with Entropy Regularization

Sahar Roostaie

M. Ebadzadeh

256

26 Oct 2021

Comparison and Unification of Three Regularization Methods in Batch Reinforcement Learning

Sarah Rathnam

Susan Murphy

Finale Doshi-Velez

OffRL

192

16 Sep 2021

Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods

244

13 Sep 2021

Active Reinforcement Learning over MDPs

Qi Yang

Peng Yang

Shengcai Liu

280

05 Aug 2021

Towards Automatic Actor-Critic Solutions to Continuous Control

209

16 Jun 2021

On-Policy Deep Reinforcement Learning for the Average-Reward CriterionInternational Conference on Machine Learning (ICML), 2021

Yiming Zhang

George Andriopoulos

OffRL

337

14 Jun 2021

Taylor Expansion of Discount FactorsInternational Conference on Machine Learning (ICML), 2021

233

11 Jun 2021

Heuristic-Guided Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2021

380

05 Jun 2021

A Deeper Look at Discounting Mismatch in Actor-Critic AlgorithmsAdaptive Agents and Multi-Agent Systems (AAMAS), 2020

Rémi Tachet des Combes

536

02 Oct 2020

Forward and inverse reinforcement learning sharing network weights and hyperparameters

E. Uchibe

Kenji Doya

193

17 Aug 2020