Choquet regularization for reinforcement learningSocial Science Research Network (SSRN), 2022 |
q-Learning in Continuous TimeJournal of machine learning research (JMLR), 2022 |
An Algebraically Converging Stochastic Gradient Descent Algorithm for Global OptimizationCommunications in Mathematical Sciences (Commun. Math. Sci.), 2022 |
Policy Gradient and Actor-Critic Learning in Continuous Time and Space:
Theory and AlgorithmsJournal of machine learning research (JMLR), 2021 |