Global Optimality of Single-Timescale Actor-Critic under Continuous State-Action Space: A Study on Linear Quadratic RegulatorInternational Joint Conference on Artificial Intelligence (IJCAI), 2024 |
Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement LearningAnnual Conference Computational Learning Theory (COLT), 2024 |
Finite-Time Analysis of Three-Timescale Constrained Actor-Critic and Constrained Natural Actor-Critic AlgorithmsConference on Uncertainty in Artificial Intelligence (UAI), 2023 |
Beyond Exponentially Fast Mixing in Average-Reward Reinforcement
Learning via Multi-Level Monte Carlo Actor-CriticInternational Conference on Machine Learning (ICML), 2023 |
On the Global Convergence of Fitted Q-Iteration with Two-layer Neural
Network ParametrizationInternational Conference on Machine Learning (ICML), 2022 |