On the Hidden Biases of Policy Mirror Ascent in Continuous Action SpacesInternational Conference on Machine Learning (ICML), 2022 |
An Empirical Analysis of Measure-Valued Derivatives for Policy GradientsIEEE International Joint Conference on Neural Network (IJCNN), 2021 |