Parameter-free Reduction of the Estimation Bias in Deep Reinforcement
Learning for Deterministic Policy GradientsNeural Processing Letters (NPL), 2021 |
Jointly Learning Environments and Control Policies with Projected
Stochastic Gradient AscentJournal of Artificial Intelligence Research (JAIR), 2020 |
An Application of Deep Reinforcement Learning to Algorithmic TradingExpert systems with applications (ESWA), 2020 |