Probabilistic Inference in Reinforcement Learning Done RightNeural Information Processing Systems (NeurIPS), 2023 |
Efficient Exploration via Epistemic-Risk-Seeking Policy OptimizationInternational Conference on Machine Learning (ICML), 2023 |
ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for
Last-Iterate Convergence in Constrained MDPsInternational Conference on Machine Learning (ICML), 2023 |
Optimal Regret Is Achievable with Bounded Approximate Inference Error:
An Enhanced Bayesian Upper Confidence Bound FrameworkNeural Information Processing Systems (NeurIPS), 2022 |