Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1901.09732
Cited By
v1
v2 (latest)
Making Deep Q-learning methods robust to time discretization
28 January 2019
Corentin Tallec
Léonard Blier
Yann Ollivier
OOD
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Making Deep Q-learning methods robust to time discretization"
50 / 52 papers shown
Teaching signal synchronization in deep neural networks with prospective neurons
Nicoas Zucchet
Qianqian Feng
Axel Laborieux
Friedemann Zenke
Walter Senn
João Sacramento
69
0
0
18 Nov 2025
Continuous Q-Score Matching: Diffusion Guided Reinforcement Learning for Continuous-Time Control
Chengxiu Hua
Jiawen Gu
Yushun Tang
261
0
0
20 Oct 2025
Bridging Discrete and Continuous RL: Stable Deterministic Policy Gradient with Martingale Characterization
Ziheng Cheng
Xin Guo
Yufei Zhang
OffRL
104
0
0
28 Sep 2025
Continuous-Time Reinforcement Learning for Asset-Liability Management
Yilie Huang
76
0
0
27 Sep 2025
Data-Driven Exploration for a Class of Continuous-Time Indefinite Linear--Quadratic Reinforcement Learning Problems
Yilie Huang
Xun Yu Zhou
OffRL
184
1
0
01 Jul 2025
A Temporal Difference Method for Stochastic Continuous Dynamics
Haruki Settai
Naoya Takeishi
Takehisa Yairi
524
0
0
21 May 2025
Accuracy of Discretely Sampled Stochastic Policies in Continuous-time Reinforcement Learning
Yanwei Jia
Du Ouyang
Yufei Zhang
302
9
0
13 Mar 2025
Synthesis of Model Predictive Control and Reinforcement Learning: Survey and Classification
Rudolf Reiter
Jasper Hoffmann
D. Reinhardt
Florian Messerer
Katrin Baumgärtner
Shamburaj Sawant
Joschka Boedecker
Moritz Diehl
S. Gros
323
20
0
04 Feb 2025
Action Gaps and Advantages in Continuous-Time Distributional Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2024
Harley Wiltzer
Marc G. Bellemare
David Meger
Patrick Shafto
Yash Jhaveri
170
3
0
14 Oct 2024
Exploratory Optimal Stopping: A Singular Control Formulation
Jodi Dianetti
Giorgio Ferrari
Renyuan Xu
219
11
0
18 Aug 2024
Continuous-time q-Learning for Jump-Diffusion Models under Tsallis Entropy
Lijun Bo
Yijie Huang
Xiang Yu
Tingting Zhang
314
4
0
04 Jul 2024
An Idiosyncrasy of Time-discretization in Reinforcement Learning
Kris De Asis
Richard S. Sutton
219
0
0
21 Jun 2024
Reinforcement Learning for Intensity Control: An Application to Choice-Based Network Revenue Management
Huiling Meng
Yi Xiong
Ningyuan Chen
214
2
0
08 Jun 2024
Reinforcement Learning for Jump-Diffusions, with Financial Applications
Ningyuan Chen
Lingfei Li
X. Zhou
452
2
0
26 May 2024
Continuous-time Risk-sensitive Reinforcement Learning via Quadratic Variation Penalty
Yanwei Jia
193
3
0
19 Apr 2024
Approximate Control for Continuous-Time POMDPs
Yannick Eich
Bastian Alt
Heinz Koeppl
170
1
0
02 Feb 2024
Continuous Time Continuous Space Homeostatic Reinforcement Learning (CTCS-HRRL) : Towards Biological Self-Autonomous Agent
Hugo Laurençon
Yesoda Bhargava
Riddhi Zantye
Charbel-Raphaël Ségerie
J. Lussange
V. Baths
Boris Gutkin
50
1
0
17 Jan 2024
Data-driven rules for multidimensional reflection problems
Soren Christensen
Asbjorn Holk Thomsen
Lukas Trottner
172
6
0
11 Nov 2023
Efficient Exploration in Continuous-time Model-based Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2023
Lenart Treven
Jonas Hübotter
Bhavya Sukhija
Florian Dorfler
Andreas Krause
263
16
0
30 Oct 2023
Actor-Critic with variable time discretization via sustained actions
International Conference on Neural Information Processing (ICONIP), 2023
Jakub Lyskawa
Pawel Wawrzyñski
OffRL
85
1
0
08 Aug 2023
Policy Optimization for Continuous Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2023
Hanyang Zhao
Wenpin Tang
D. Yao
OffRL
349
30
0
30 May 2023
Reducing the Cost of Cycle-Time Tuning for Real-World Policy Optimization
IEEE International Joint Conference on Neural Network (IJCNN), 2023
Homayoon Farrahi
Rupam Mahmood
168
5
0
09 May 2023
Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Neural Information Processing Systems (NeurIPS), 2022
Zichen Zhang
Johannes Kirschner
Junxi Zhang
Francesco Zanini
Alex Ayoub
Masood Dehghan
Dale Schuurmans
OffRL
332
3
0
17 Dec 2022
Dynamic Decision Frequency with Continuous Options
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022
Amir-Hossein Karimi
Jun Jin
Jun Luo
A. R. Mahmood
Martin Jägersand
Samuele Tosatto
273
10
0
06 Dec 2022
Simultaneously Updating All Persistence Values in Reinforcement Learning
AAAI Conference on Artificial Intelligence (AAAI), 2022
Luca Sabbioni
Luca Al Daire
L. Bisi
Alberto Maria Metelli
Marcello Restelli
143
3
0
21 Nov 2022
Convergence of policy gradient methods for finite-horizon exploratory linear-quadratic control problems
SIAM Journal of Control and Optimization (SICON), 2022
Michael Giegrich
Christoph Reisinger
Yufei Zhang
322
22
0
01 Nov 2022
Square-root regret bounds for continuous-time episodic Markov decision processes
Mathematics of Operations Research (MOR), 2022
Ningyuan Chen
X. Zhou
325
6
0
03 Oct 2022
Offline Reinforcement Learning at Multiple Frequencies
Conference on Robot Learning (CoRL), 2022
Kaylee Burns
Tianhe Yu
Chelsea Finn
Karol Hausman
OffRL
216
6
0
26 Jul 2022
Adaptive Asynchronous Control Using Meta-learned Neural Ordinary Differential Equations
IEEE Transactions on robotics (TRO), 2022
Achkan Salehi
Steffen Rühl
Stéphane Doncieux
AI4CE
355
3
0
25 Jul 2022
q-Learning in Continuous Time
Journal of machine learning research (JMLR), 2022
Yanwei Jia
X. Zhou
OffRL
490
95
0
02 Jul 2022
Distributional Hamilton-Jacobi-Bellman Equations for Continuous-Time Reinforcement Learning
International Conference on Machine Learning (ICML), 2022
Harley Wiltzer
David Meger
Marc G. Bellemare
182
16
0
24 May 2022
Logarithmic regret bounds for continuous-time average-reward Markov decision processes
SIAM Journal of Control and Optimization (SICON), 2022
Ningyuan Chen
X. Zhou
323
9
0
23 May 2022
Linear convergence of a policy gradient method for some finite horizon continuous time control problems
SIAM Journal of Control and Optimization (SICON), 2022
C. Reisinger
Wolfgang Stockinger
Yufei Zhang
400
11
0
22 Mar 2022
Temporal Difference Learning with Continuous Time and State in the Stochastic Setting
Ziad Kobeissi
Francis R. Bach
OffRL
241
4
0
16 Feb 2022
Policy Gradient and Actor-Critic Learning in Continuous Time and Space: Theory and Algorithms
Journal of machine learning research (JMLR), 2021
Yanwei Jia
X. Zhou
OffRL
373
116
0
22 Nov 2021
Time Discretization-Invariant Safe Action Repetition for Policy Gradient Methods
Seohong Park
Jaekyeom Kim
Gunhee Kim
309
28
0
06 Nov 2021
Continuous-Time Fitted Value Iteration for Robust Policies
M. Lutter
Boris Belousov
Shie Mannor
Dieter Fox
Animesh Garg
Jan Peters
211
10
0
05 Oct 2021
Continuous Homeostatic Reinforcement Learning for Self-Regulated Autonomous Agents
Hugo Laurençon
Charbel-Raphaël Ségerie
J. Lussange
Boris Gutkin
157
9
0
14 Sep 2021
A generalized stacked reinforcement learning method for sampled systems
IEEE Transactions on Automatic Control (IEEE TAC), 2021
Pavel Osinenko
D. Dobriborsci
Grigory Yaremenko
Georgiy Malaniya
OffRL
157
6
0
23 Aug 2021
Towards Automatic Actor-Critic Solutions to Continuous Control
J. E. Grigsby
Jinsu Yoo
Yanjun Qi
OffRL
149
7
0
16 Jun 2021
Time-Aware Q-Networks: Resolving Temporal Irregularity for Deep Reinforcement Learning
Yeonji Kim
Min Chi
110
0
0
06 May 2021
ACERAC: Efficient reinforcement learning in fine time discretization
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
Jakub Łyskawa
Pawel Wawrzyñski
232
3
0
08 Apr 2021
Continuous-Time Model-Based Reinforcement Learning
International Conference on Machine Learning (ICML), 2021
Çağatay Yıldız
Markus Heinonen
Harri Lähdesmäki
OffRL
234
70
0
09 Feb 2021
State-Dependent Temperature Control for Langevin Diffusions
SIAM Journal of Control and Optimization (SICON), 2020
Ningyuan Chen
Z. Xu
X. Zhou
352
32
0
15 Nov 2020
Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time Systems with Lipschitz Continuous Controls
Journal of machine learning research (JMLR), 2020
Jeongho Kim
Jaeuk Shin
Insoon Yang
162
41
0
27 Oct 2020
POMDPs in Continuous Time and Discrete Spaces
Neural Information Processing Systems (NeurIPS), 2020
Bastian Alt
M. Schultheis
Heinz Koeppl
297
9
0
02 Oct 2020
Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs
Jianzhun Du
Joseph D. Futoma
Finale Doshi-Velez
194
58
0
29 Jun 2020
Logarithmic regret for episodic continuous-time linear-quadratic reinforcement learning over a finite-time horizon
Matteo Basei
Xin Guo
Anran Hu
Yufei Zhang
263
48
0
27 Jun 2020
Thinking While Moving: Deep Reinforcement Learning with Concurrent Control
International Conference on Learning Representations (ICLR), 2020
Ted Xiao
Eric Jang
Dmitry Kalashnikov
Sergey Levine
Julian Ibarz
Karol Hausman
Alexander Herzog
357
42
0
13 Apr 2020
Control Frequency Adaptation via Action Persistence in Batch Reinforcement Learning
International Conference on Machine Learning (ICML), 2020
Alberto Maria Metelli
Flavio Mazzolini
L. Bisi
Luca Sabbioni
Marcello Restelli
149
42
0
17 Feb 2020
1
2
Next