Coordinate Ascent for Off-Policy RL with Global Convergence GuaranteesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022 |
The problem with DDPG: understanding failures in deterministic
environments with sparse rewardsInternational Conference on Artificial Neural Networks (ICANN), 2019 |