ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.05857
  4. Cited By
Variational Regret Bounds for Reinforcement Learning
v1v2v3 (latest)

Variational Regret Bounds for Reinforcement Learning

14 May 2019
Pratik Gajane
R. Ortner
P. Auer
ArXiv (abs)PDFHTML

Papers citing "Variational Regret Bounds for Reinforcement Learning"

23 / 23 papers shown
Title
MetaCURL: Non-stationary Concave Utility Reinforcement Learning
MetaCURL: Non-stationary Concave Utility Reinforcement Learning
B. Moreno
Margaux Brégère
Pierre Gaillard
Nadia Oudjane
OffRL
87
1
0
30 May 2024
Decision Making in Non-Stationary Environments with Policy-Augmented
  Search
Decision Making in Non-Stationary Environments with Policy-Augmented Search
Ava Pettet
Yunuo Zhang
Baiting Luo
Kyle Wray
Hendrik Baier
Aron Laszka
Abhishek Dubey
Ayan Mukhopadhyay
51
4
0
06 Jan 2024
Restarted Bayesian Online Change-point Detection for Non-Stationary
  Markov Decision Processes
Restarted Bayesian Online Change-point Detection for Non-Stationary Markov Decision Processes
Réda Alami
Mohammed Mahfoud
Eric Moulines
62
3
0
01 Apr 2023
Online Reinforcement Learning in Periodic MDP
Online Reinforcement Learning in Periodic MDP
Ayush Aniket
Arpan Chattopadhyay
53
4
0
16 Mar 2023
Dynamics-Adaptive Continual Reinforcement Learning via Progressive
  Contextualization
Dynamics-Adaptive Continual Reinforcement Learning via Progressive Contextualization
Tiantian Zhang
Zichuan Lin
Yuxing Wang
Deheng Ye
Qiang Fu
Wei Yang
Xueqian Wang
Bin Liang
Bo Yuan
Xiu Li
CLL
101
11
0
01 Sep 2022
Dynamic Regret of Online Markov Decision Processes
Dynamic Regret of Online Markov Decision Processes
Peng Zhao
Longfei Li
Zhi Zhou
OffRL
101
17
0
26 Aug 2022
Lifelong Hyper-Policy Optimization with Multiple Importance Sampling
  Regularization
Lifelong Hyper-Policy Optimization with Multiple Importance Sampling Regularization
P. Liotet
Francesco Vidaich
Alberto Maria Metelli
Marcello Restelli
OffRL
68
8
0
13 Dec 2021
Optimistic Policy Optimization is Provably Efficient in Non-stationary
  MDPs
Optimistic Policy Optimization is Provably Efficient in Non-stationary MDPs
Han Zhong
Zhuoran Yang
Zhaoran Wang
Csaba Szepesvári
117
21
0
18 Oct 2021
Markov Decision Processes with Long-Term Average Constraints
Markov Decision Processes with Long-Term Average Constraints
Mridul Agarwal
Qinbo Bai
Vaneet Aggarwal
52
6
0
12 Jun 2021
Unsupervised Person Re-identification via Simultaneous Clustering and
  Consistency Learning
Unsupervised Person Re-identification via Simultaneous Clustering and Consistency Learning
Hanne I Oberman
Jiayan Qiu
S. van Buuren
G. Vink
Zhanyu Ma
Jun Guo
57
15
0
01 Apr 2021
Dealing with Non-Stationarity in MARL via Trust-Region Decomposition
Dealing with Non-Stationarity in MARL via Trust-Region Decomposition
Wenhao Li
Xiangfeng Wang
Bo Jin
Junjie Sheng
H. Zha
131
9
0
21 Feb 2021
Robust Policy Gradient against Strong Data Corruption
Robust Policy Gradient against Strong Data Corruption
Xuezhou Zhang
Yiding Chen
Xiaojin Zhu
Wen Sun
AAML
99
39
0
11 Feb 2021
Non-stationary Reinforcement Learning without Prior Knowledge: An
  Optimal Black-box Approach
Non-stationary Reinforcement Learning without Prior Knowledge: An Optimal Black-box Approach
Chen-Yu Wei
Haipeng Luo
OffRL
187
107
0
10 Feb 2021
Efficient Learning in Non-Stationary Linear Markov Decision Processes
Efficient Learning in Non-Stationary Linear Markov Decision Processes
Ahmed Touati
Pascal Vincent
108
29
0
24 Oct 2020
A Relearning Approach to Reinforcement Learning for Control of Smart
  Buildings
A Relearning Approach to Reinforcement Learning for Control of Smart Buildings
Avisek Naug
Marcos Quiñones-Grueiro
G. Biswas
CLL
52
11
0
04 Aug 2020
A Kernel-Based Approach to Non-Stationary Reinforcement Learning in
  Metric Spaces
A Kernel-Based Approach to Non-Stationary Reinforcement Learning in Metric Spaces
O. D. Domingues
Pierre Ménard
Matteo Pirotta
E. Kaufmann
Michal Valko
80
40
0
09 Jul 2020
Dynamic Regret of Policy Optimization in Non-stationary Environments
Dynamic Regret of Policy Optimization in Non-stationary Environments
Yingjie Fei
Zhuoran Yang
Zhaoran Wang
Qiaomin Xie
93
56
0
30 Jun 2020
Lipschitzness Is All You Need To Tame Off-policy Generative Adversarial
  Imitation Learning
Lipschitzness Is All You Need To Tame Off-policy Generative Adversarial Imitation Learning
Lionel Blondé
Pablo Strasser
Alexandros Kalousis
90
22
0
28 Jun 2020
Reinforcement Learning for Non-Stationary Markov Decision Processes: The
  Blessing of (More) Optimism
Reinforcement Learning for Non-Stationary Markov Decision Processes: The Blessing of (More) Optimism
Wang Chi Cheung
D. Simchi-Levi
Ruihao Zhu
OffRL
98
96
0
24 Jun 2020
Linear Last-iterate Convergence in Constrained Saddle-point Optimization
Linear Last-iterate Convergence in Constrained Saddle-point Optimization
Chen-Yu Wei
Chung-Wei Lee
Mengxiao Zhang
Haipeng Luo
139
11
0
16 Jun 2020
A Survey of Reinforcement Learning Algorithms for Dynamically Varying
  Environments
A Survey of Reinforcement Learning Algorithms for Dynamically Varying Environments
Sindhu Padakandla
75
155
0
19 May 2020
Contextual Blocking Bandits
Contextual Blocking Bandits
Soumya Basu
Orestis Papadigenopoulos
Constantine Caramanis
Sanjay Shakkottai
83
21
0
06 Mar 2020
Learning and Planning for Time-Varying MDPs Using Maximum Likelihood
  Estimation
Learning and Planning for Time-Varying MDPs Using Maximum Likelihood Estimation
Melkior Ornik
Ufuk Topcu
OOD
41
15
0
29 Nov 2019
1