Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.14087
Cited By
Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time Systems with Lipschitz Continuous Controls
27 October 2020
Jeongho Kim
Jaeuk Shin
Insoon Yang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time Systems with Lipschitz Continuous Controls"
19 / 19 papers shown
Title
A Temporal Difference Method for Stochastic Continuous Dynamics
Haruki Settai
Naoya Takeishi
Takehisa Yairi
145
0
0
21 May 2025
Accuracy of Discretely Sampled Stochastic Policies in Continuous-time Reinforcement Learning
Yanwei Jia
Du Ouyang
Yufei Zhang
94
4
0
13 Mar 2025
Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction
Baiting Luo
Ava Pettet
Aron Laszka
A. Dubey
Ayan Mukhopadhyay
OffRL
86
1
0
28 Feb 2025
LLM-Empowered State Representation for Reinforcement Learning
Boyuan Wang
Yun Qu
Yuhang Jiang
Jianzhun Shao
Chang-rui Liu
Wenming Yang
Xiangyang Ji
89
14
0
18 Jul 2024
On the stability of Lipschitz continuous control problems and its application to reinforcement learning
Namkyeong Cho
Yeoneung Kim
102
0
0
20 Apr 2024
Continuous-time Risk-sensitive Reinforcement Learning via Quadratic Variation Penalty
Yanwei Jia
82
2
0
19 Apr 2024
Continuous-Time Reinforcement Learning: New Design Algorithms with Theoretical Insights and Performance Guarantees
Brent A. Wallace
J. Si
104
3
0
18 Jul 2023
Policy Optimization for Continuous Reinforcement Learning
Hanyang Zhao
Wenpin Tang
D. Yao
OffRL
80
18
0
30 May 2023
Actor-Critic Methods using Physics-Informed Neural Networks: Control of a 1D PDE Model for Fluid-Cooled Battery Packs
Amartya Mukherjee
Jun Liu
60
1
0
18 May 2023
Adversarial Path Planning for Optimal Camera Positioning
Gaia Carenini
Alexandre Duplessis
33
0
0
14 Feb 2023
Bridging Physics-Informed Neural Networks with Reinforcement Learning: Hamilton-Jacobi-Bellman Proximal Policy Optimization (HJBPPO)
Amartya Mukherjee
Jun Liu
60
11
0
01 Feb 2023
Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Zichen Zhang
Johannes Kirschner
Junxi Zhang
Francesco Zanini
Alex Ayoub
Masood Dehghan
Dale Schuurmans
OffRL
82
3
0
17 Dec 2022
Reinforcement Learning with Non-Exponential Discounting
M. Schultheis
Constantin Rothkopf
Heinz Koeppl
59
11
0
27 Sep 2022
q-Learning in Continuous Time
Yanwei Jia
X. Zhou
OffRL
156
77
0
02 Jul 2022
Optimisation of Structured Neural Controller Based on Continuous-Time Policy Gradient
Namhoon Cho
Hyo-Sang Shin
38
2
0
17 Jan 2022
Policy Gradient and Actor-Critic Learning in Continuous Time and Space: Theory and Algorithms
Yanwei Jia
X. Zhou
OffRL
121
85
0
22 Nov 2021
Continuous-Time Fitted Value Iteration for Robust Policies
M. Lutter
Boris Belousov
Shie Mannor
Dieter Fox
Animesh Garg
Jan Peters
65
9
0
05 Oct 2021
Robust Value Iteration for Continuous Control Tasks
M. Lutter
Shie Mannor
Jan Peters
Dieter Fox
Animesh Garg
64
18
0
25 May 2021
Value Iteration in Continuous Actions, States and Time
M. Lutter
Shie Mannor
Jan Peters
Dieter Fox
Animesh Garg
52
37
0
10 May 2021
1