ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1901.09732
  4. Cited By
Making Deep Q-learning methods robust to time discretization
v1v2 (latest)

Making Deep Q-learning methods robust to time discretization

28 January 2019
Corentin Tallec
Léonard Blier
Yann Ollivier
    OODOffRL
ArXiv (abs)PDFHTML

Papers citing "Making Deep Q-learning methods robust to time discretization"

50 / 52 papers shown
Teaching signal synchronization in deep neural networks with prospective neurons
Teaching signal synchronization in deep neural networks with prospective neurons
Nicoas Zucchet
Qianqian Feng
Axel Laborieux
Friedemann Zenke
Walter Senn
João Sacramento
69
0
0
18 Nov 2025
Continuous Q-Score Matching: Diffusion Guided Reinforcement Learning for Continuous-Time Control
Continuous Q-Score Matching: Diffusion Guided Reinforcement Learning for Continuous-Time Control
Chengxiu Hua
Jiawen Gu
Yushun Tang
261
0
0
20 Oct 2025
Bridging Discrete and Continuous RL: Stable Deterministic Policy Gradient with Martingale Characterization
Bridging Discrete and Continuous RL: Stable Deterministic Policy Gradient with Martingale Characterization
Ziheng Cheng
Xin Guo
Yufei Zhang
OffRL
104
0
0
28 Sep 2025
Continuous-Time Reinforcement Learning for Asset-Liability Management
Continuous-Time Reinforcement Learning for Asset-Liability Management
Yilie Huang
76
0
0
27 Sep 2025
Data-Driven Exploration for a Class of Continuous-Time Indefinite Linear--Quadratic Reinforcement Learning Problems
Data-Driven Exploration for a Class of Continuous-Time Indefinite Linear--Quadratic Reinforcement Learning Problems
Yilie Huang
Xun Yu Zhou
OffRL
184
1
0
01 Jul 2025
A Temporal Difference Method for Stochastic Continuous Dynamics
A Temporal Difference Method for Stochastic Continuous Dynamics
Haruki Settai
Naoya Takeishi
Takehisa Yairi
524
0
0
21 May 2025
Accuracy of Discretely Sampled Stochastic Policies in Continuous-time Reinforcement Learning
Accuracy of Discretely Sampled Stochastic Policies in Continuous-time Reinforcement Learning
Yanwei Jia
Du Ouyang
Yufei Zhang
302
9
0
13 Mar 2025
Synthesis of Model Predictive Control and Reinforcement Learning: Survey and Classification
Synthesis of Model Predictive Control and Reinforcement Learning: Survey and Classification
Rudolf Reiter
Jasper Hoffmann
D. Reinhardt
Florian Messerer
Katrin Baumgärtner
Shamburaj Sawant
Joschka Boedecker
Moritz Diehl
S. Gros
323
20
0
04 Feb 2025
Action Gaps and Advantages in Continuous-Time Distributional
  Reinforcement Learning
Action Gaps and Advantages in Continuous-Time Distributional Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024
Harley Wiltzer
Marc G. Bellemare
David Meger
Patrick Shafto
Yash Jhaveri
170
3
0
14 Oct 2024
Exploratory Optimal Stopping: A Singular Control Formulation
Exploratory Optimal Stopping: A Singular Control Formulation
Jodi Dianetti
Giorgio Ferrari
Renyuan Xu
219
11
0
18 Aug 2024
Continuous-time q-Learning for Jump-Diffusion Models under Tsallis Entropy
Continuous-time q-Learning for Jump-Diffusion Models under Tsallis Entropy
Lijun Bo
Yijie Huang
Xiang Yu
Tingting Zhang
314
4
0
04 Jul 2024
An Idiosyncrasy of Time-discretization in Reinforcement Learning
An Idiosyncrasy of Time-discretization in Reinforcement Learning
Kris De Asis
Richard S. Sutton
219
0
0
21 Jun 2024
Reinforcement Learning for Intensity Control: An Application to Choice-Based Network Revenue Management
Reinforcement Learning for Intensity Control: An Application to Choice-Based Network Revenue Management
Huiling Meng
Yi Xiong
Ningyuan Chen
214
2
0
08 Jun 2024
Reinforcement Learning for Jump-Diffusions, with Financial Applications
Reinforcement Learning for Jump-Diffusions, with Financial Applications
Ningyuan Chen
Lingfei Li
X. Zhou
452
2
0
26 May 2024
Continuous-time Risk-sensitive Reinforcement Learning via Quadratic
  Variation Penalty
Continuous-time Risk-sensitive Reinforcement Learning via Quadratic Variation Penalty
Yanwei Jia
193
3
0
19 Apr 2024
Approximate Control for Continuous-Time POMDPs
Approximate Control for Continuous-Time POMDPs
Yannick Eich
Bastian Alt
Heinz Koeppl
170
1
0
02 Feb 2024
Continuous Time Continuous Space Homeostatic Reinforcement Learning
  (CTCS-HRRL) : Towards Biological Self-Autonomous Agent
Continuous Time Continuous Space Homeostatic Reinforcement Learning (CTCS-HRRL) : Towards Biological Self-Autonomous Agent
Hugo Laurençon
Yesoda Bhargava
Riddhi Zantye
Charbel-Raphaël Ségerie
J. Lussange
V. Baths
Boris Gutkin
50
1
0
17 Jan 2024
Data-driven rules for multidimensional reflection problems
Data-driven rules for multidimensional reflection problems
Soren Christensen
Asbjorn Holk Thomsen
Lukas Trottner
172
6
0
11 Nov 2023
Efficient Exploration in Continuous-time Model-based Reinforcement
  Learning
Efficient Exploration in Continuous-time Model-based Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Lenart Treven
Jonas Hübotter
Bhavya Sukhija
Florian Dorfler
Andreas Krause
263
16
0
30 Oct 2023
Actor-Critic with variable time discretization via sustained actions
Actor-Critic with variable time discretization via sustained actionsInternational Conference on Neural Information Processing (ICONIP), 2023
Jakub Lyskawa
Pawel Wawrzyñski
OffRL
85
1
0
08 Aug 2023
Policy Optimization for Continuous Reinforcement Learning
Policy Optimization for Continuous Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Hanyang Zhao
Wenpin Tang
D. Yao
OffRL
349
30
0
30 May 2023
Reducing the Cost of Cycle-Time Tuning for Real-World Policy
  Optimization
Reducing the Cost of Cycle-Time Tuning for Real-World Policy OptimizationIEEE International Joint Conference on Neural Network (IJCNN), 2023
Homayoon Farrahi
Rupam Mahmood
168
5
0
09 May 2023
Managing Temporal Resolution in Continuous Value Estimation: A
  Fundamental Trade-off
Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-offNeural Information Processing Systems (NeurIPS), 2022
Zichen Zhang
Johannes Kirschner
Junxi Zhang
Francesco Zanini
Alex Ayoub
Masood Dehghan
Dale Schuurmans
OffRL
332
3
0
17 Dec 2022
Dynamic Decision Frequency with Continuous Options
Dynamic Decision Frequency with Continuous OptionsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022
Amir-Hossein Karimi
Jun Jin
Jun Luo
A. R. Mahmood
Martin Jägersand
Samuele Tosatto
273
10
0
06 Dec 2022
Simultaneously Updating All Persistence Values in Reinforcement Learning
Simultaneously Updating All Persistence Values in Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2022
Luca Sabbioni
Luca Al Daire
L. Bisi
Alberto Maria Metelli
Marcello Restelli
143
3
0
21 Nov 2022
Convergence of policy gradient methods for finite-horizon exploratory
  linear-quadratic control problems
Convergence of policy gradient methods for finite-horizon exploratory linear-quadratic control problemsSIAM Journal of Control and Optimization (SICON), 2022
Michael Giegrich
Christoph Reisinger
Yufei Zhang
322
22
0
01 Nov 2022
Square-root regret bounds for continuous-time episodic Markov decision
  processes
Square-root regret bounds for continuous-time episodic Markov decision processesMathematics of Operations Research (MOR), 2022
Ningyuan Chen
X. Zhou
325
6
0
03 Oct 2022
Offline Reinforcement Learning at Multiple Frequencies
Offline Reinforcement Learning at Multiple FrequenciesConference on Robot Learning (CoRL), 2022
Kaylee Burns
Tianhe Yu
Chelsea Finn
Karol Hausman
OffRL
216
6
0
26 Jul 2022
Adaptive Asynchronous Control Using Meta-learned Neural Ordinary
  Differential Equations
Adaptive Asynchronous Control Using Meta-learned Neural Ordinary Differential EquationsIEEE Transactions on robotics (TRO), 2022
Achkan Salehi
Steffen Rühl
Stéphane Doncieux
AI4CE
355
3
0
25 Jul 2022
q-Learning in Continuous Time
q-Learning in Continuous TimeJournal of machine learning research (JMLR), 2022
Yanwei Jia
X. Zhou
OffRL
490
95
0
02 Jul 2022
Distributional Hamilton-Jacobi-Bellman Equations for Continuous-Time
  Reinforcement Learning
Distributional Hamilton-Jacobi-Bellman Equations for Continuous-Time Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Harley Wiltzer
David Meger
Marc G. Bellemare
182
16
0
24 May 2022
Logarithmic regret bounds for continuous-time average-reward Markov
  decision processes
Logarithmic regret bounds for continuous-time average-reward Markov decision processesSIAM Journal of Control and Optimization (SICON), 2022
Ningyuan Chen
X. Zhou
323
9
0
23 May 2022
Linear convergence of a policy gradient method for some finite horizon
  continuous time control problems
Linear convergence of a policy gradient method for some finite horizon continuous time control problemsSIAM Journal of Control and Optimization (SICON), 2022
C. Reisinger
Wolfgang Stockinger
Yufei Zhang
400
11
0
22 Mar 2022
Temporal Difference Learning with Continuous Time and State in the
  Stochastic Setting
Temporal Difference Learning with Continuous Time and State in the Stochastic Setting
Ziad Kobeissi
Francis R. Bach
OffRL
241
4
0
16 Feb 2022
Policy Gradient and Actor-Critic Learning in Continuous Time and Space:
  Theory and Algorithms
Policy Gradient and Actor-Critic Learning in Continuous Time and Space: Theory and AlgorithmsJournal of machine learning research (JMLR), 2021
Yanwei Jia
X. Zhou
OffRL
373
116
0
22 Nov 2021
Time Discretization-Invariant Safe Action Repetition for Policy Gradient
  Methods
Time Discretization-Invariant Safe Action Repetition for Policy Gradient Methods
Seohong Park
Jaekyeom Kim
Gunhee Kim
309
28
0
06 Nov 2021
Continuous-Time Fitted Value Iteration for Robust Policies
Continuous-Time Fitted Value Iteration for Robust Policies
M. Lutter
Boris Belousov
Shie Mannor
Dieter Fox
Animesh Garg
Jan Peters
211
10
0
05 Oct 2021
Continuous Homeostatic Reinforcement Learning for Self-Regulated
  Autonomous Agents
Continuous Homeostatic Reinforcement Learning for Self-Regulated Autonomous Agents
Hugo Laurençon
Charbel-Raphaël Ségerie
J. Lussange
Boris Gutkin
157
9
0
14 Sep 2021
A generalized stacked reinforcement learning method for sampled systems
A generalized stacked reinforcement learning method for sampled systemsIEEE Transactions on Automatic Control (IEEE TAC), 2021
Pavel Osinenko
D. Dobriborsci
Grigory Yaremenko
Georgiy Malaniya
OffRL
157
6
0
23 Aug 2021
Towards Automatic Actor-Critic Solutions to Continuous Control
Towards Automatic Actor-Critic Solutions to Continuous Control
J. E. Grigsby
Jinsu Yoo
Yanjun Qi
OffRL
149
7
0
16 Jun 2021
Time-Aware Q-Networks: Resolving Temporal Irregularity for Deep
  Reinforcement Learning
Time-Aware Q-Networks: Resolving Temporal Irregularity for Deep Reinforcement Learning
Yeonji Kim
Min Chi
110
0
0
06 May 2021
ACERAC: Efficient reinforcement learning in fine time discretization
ACERAC: Efficient reinforcement learning in fine time discretizationIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
Jakub Łyskawa
Pawel Wawrzyñski
232
3
0
08 Apr 2021
Continuous-Time Model-Based Reinforcement Learning
Continuous-Time Model-Based Reinforcement LearningInternational Conference on Machine Learning (ICML), 2021
Çağatay Yıldız
Markus Heinonen
Harri Lähdesmäki
OffRL
234
70
0
09 Feb 2021
State-Dependent Temperature Control for Langevin Diffusions
State-Dependent Temperature Control for Langevin DiffusionsSIAM Journal of Control and Optimization (SICON), 2020
Ningyuan Chen
Z. Xu
X. Zhou
352
32
0
15 Nov 2020
Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time
  Systems with Lipschitz Continuous Controls
Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time Systems with Lipschitz Continuous ControlsJournal of machine learning research (JMLR), 2020
Jeongho Kim
Jaeuk Shin
Insoon Yang
162
41
0
27 Oct 2020
POMDPs in Continuous Time and Discrete Spaces
POMDPs in Continuous Time and Discrete SpacesNeural Information Processing Systems (NeurIPS), 2020
Bastian Alt
M. Schultheis
Heinz Koeppl
297
9
0
02 Oct 2020
Model-based Reinforcement Learning for Semi-Markov Decision Processes
  with Neural ODEs
Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs
Jianzhun Du
Joseph D. Futoma
Finale Doshi-Velez
194
58
0
29 Jun 2020
Logarithmic regret for episodic continuous-time linear-quadratic
  reinforcement learning over a finite-time horizon
Logarithmic regret for episodic continuous-time linear-quadratic reinforcement learning over a finite-time horizon
Matteo Basei
Xin Guo
Anran Hu
Yufei Zhang
263
48
0
27 Jun 2020
Thinking While Moving: Deep Reinforcement Learning with Concurrent
  Control
Thinking While Moving: Deep Reinforcement Learning with Concurrent ControlInternational Conference on Learning Representations (ICLR), 2020
Ted Xiao
Eric Jang
Dmitry Kalashnikov
Sergey Levine
Julian Ibarz
Karol Hausman
Alexander Herzog
357
42
0
13 Apr 2020
Control Frequency Adaptation via Action Persistence in Batch
  Reinforcement Learning
Control Frequency Adaptation via Action Persistence in Batch Reinforcement LearningInternational Conference on Machine Learning (ICML), 2020
Alberto Maria Metelli
Flavio Mazzolini
L. Bisi
Luca Sabbioni
Marcello Restelli
149
42
0
17 Feb 2020
12
Next