Policy Optimization for Continuous Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023 |
Beyond Exponentially Fast Mixing in Average-Reward Reinforcement
Learning via Multi-Level Monte Carlo Actor-CriticInternational Conference on Machine Learning (ICML), 2023 |
Geometry and convergence of natural policy gradient methodsInformation Geometry (IG), 2022 |
Convergence of policy gradient methods for finite-horizon exploratory
linear-quadratic control problemsSIAM Journal of Control and Optimization (SICON), 2022 |
Linear convergence of a policy gradient method for some finite horizon
continuous time control problemsSIAM Journal of Control and Optimization (SICON), 2022 |