Dynamic Regret of Online Markov Decision Processes

Dynamic Regret of Online Markov Decision Processes

26 August 2022

Papers citing "Dynamic Regret of Online Markov Decision Processes"

13 / 13 papers shown

Title
Provably Efficient Reinforcement Learning with Multinomial Logit Function Approximation Long-Fei Li Yu-Jie Zhang Peng Zhao Zhi-Hua Zhou 92 4 0 17 Jan 2025
Near-Optimal Dynamic Regret for Adversarial Linear Mixture MDPs Long-Fei Li Peng Zhao Zhi-Hua Zhou 39 0 0 05 Nov 2024
Optimistic Regret Bounds for Online Learning in Adversarial Markov Decision Processes Sang Bin Moon Abolfazl Hashemi 19 0 0 03 May 2024
Improved Algorithm for Adversarial Linear Mixture MDPs with Bandit Feedback and Unknown Transition Long-Fei Li Peng Zhao Zhi-Hua Zhou 28 4 0 07 Mar 2024
Efficient Methods for Non-stationary Online Learning Peng Zhao Yan-Feng Xie Lijun Zhang Zhi-Hua Zhou 33 19 0 16 Sep 2023
Beyond Black-Box Advice: Learning-Augmented Algorithms for MDPs with Q-Value Predictions Tongxin Li Yiheng Lin Shaolei Ren Adam Wierman AAML OffRL 27 6 0 20 Jul 2023
Online Resource Allocation in Episodic Markov Decision Processes Duksang Lee William Overman Dabeen Lee 24 1 0 18 May 2023
Adapting to Continuous Covariate Shift via Online Density Ratio Estimation Yu-Jie Zhang Zhenyu Zhang Peng Zhao Masashi Sugiyama OOD 14 11 0 06 Feb 2023
Reaching Goals is Hard: Settling the Sample Complexity of the Stochastic Shortest Path Liyu Chen Andrea Tirinzoni Matteo Pirotta A. Lazaric 24 3 0 10 Oct 2022
Adapting to Online Label Shift with Provable Guarantees Yong Bai Yu-Jie Zhang Peng Zhao Masashi Sugiyama Zhi-Hua Zhou OOD 19 24 0 05 Jul 2022
Optimal Dynamic Regret in Exp-Concave Online Learning Dheeraj Baby Yu-Xiang Wang 37 43 0 23 Apr 2021
Non-stationary Online Learning with Memory and Non-stochastic Control Peng Zhao Yu-Hu Yan Yu-Xiang Wang Zhi-Hua Zhou 19 47 0 07 Feb 2021
Efficient Learning in Non-Stationary Linear Markov Decision Processes Ahmed Touati Pascal Vincent 32 29 0 24 Oct 2020