A safe exploration approach to constrained Markov decision processesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023 |
Last-Iterate Convergent Policy Gradient Primal-Dual Methods for
Constrained MDPsNeural Information Processing Systems (NeurIPS), 2023 |