ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.14389
  4. Cited By
Reinforcement Learning for Non-Stationary Markov Decision Processes: The
  Blessing of (More) Optimism

Reinforcement Learning for Non-Stationary Markov Decision Processes: The Blessing of (More) Optimism

24 June 2020
Wang Chi Cheung
D. Simchi-Levi
Ruihao Zhu
    OffRL
ArXivPDFHTML

Papers citing "Reinforcement Learning for Non-Stationary Markov Decision Processes: The Blessing of (More) Optimism"

17 / 17 papers shown
Title
MetaCURL: Non-stationary Concave Utility Reinforcement Learning
MetaCURL: Non-stationary Concave Utility Reinforcement Learning
B. Moreno
Margaux Brégère
Pierre Gaillard
Nadia Oudjane
OffRL
39
0
0
30 May 2024
Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement
  Learning
Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement Learning
Aneesh Muppidi
Zhiyu Zhang
Heng Yang
34
4
0
26 May 2024
Pausing Policy Learning in Non-stationary Reinforcement Learning
Pausing Policy Learning in Non-stationary Reinforcement Learning
Hyunin Lee
Ming Jin
Javad Lavaei
Somayeh Sojoudi
OffRL
37
2
0
25 May 2024
Variance-Dependent Regret Bounds for Non-stationary Linear Bandits
Variance-Dependent Regret Bounds for Non-stationary Linear Bandits
Zhiyong Wang
Jize Xie
Yi Chen
J. C. Lui
Dongruo Zhou
28
0
0
15 Mar 2024
Confronting Reward Model Overoptimization with Constrained RLHF
Confronting Reward Model Overoptimization with Constrained RLHF
Ted Moskovitz
Aaditya K. Singh
DJ Strouse
T. Sandholm
Ruslan Salakhutdinov
Anca D. Dragan
Stephen Marcus McAleer
39
48
0
06 Oct 2023
Restarted Bayesian Online Change-point Detection for Non-Stationary
  Markov Decision Processes
Restarted Bayesian Online Change-point Detection for Non-Stationary Markov Decision Processes
Réda Alami
Mohammed Mahfoud
Eric Moulines
22
2
0
01 Apr 2023
Online Reinforcement Learning in Periodic MDP
Online Reinforcement Learning in Periodic MDP
Ayush Aniket
Arpan Chattopadhyay
26
2
0
16 Mar 2023
Doubly Inhomogeneous Reinforcement Learning
Doubly Inhomogeneous Reinforcement Learning
Liyuan Hu
Mengbing Li
C. Shi
Zhanghua Wu
Piotr Fryzlewicz
OffRL
31
2
0
08 Nov 2022
Dynamic Regret of Online Markov Decision Processes
Dynamic Regret of Online Markov Decision Processes
Peng Zhao
Longfei Li
Zhi-Hua Zhou
OffRL
31
17
0
26 Aug 2022
Reactive Exploration to Cope with Non-Stationarity in Lifelong
  Reinforcement Learning
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning
C. Steinparz
Thomas Schmied
Fabian Paischer
Marius-Constantin Dinu
Vihang Patil
Angela Bitto-Nemling
Hamid Eghbalzadeh
Sepp Hochreiter
CLL
24
11
0
12 Jul 2022
Performative Reinforcement Learning
Performative Reinforcement Learning
Debmalya Mandal
Stelios Triantafyllou
Goran Radanović
33
18
0
30 Jun 2022
Non-Stationary Bandit Learning via Predictive Sampling
Non-Stationary Bandit Learning via Predictive Sampling
Yueyang Liu
Kuang Xu
Benjamin Van Roy
24
19
0
04 May 2022
Testing Stationarity and Change Point Detection in Reinforcement Learning
Testing Stationarity and Change Point Detection in Reinforcement Learning
Mengbing Li
C. Shi
Zhanghua Wu
Piotr Fryzlewicz
OffRL
42
9
0
03 Mar 2022
Rotting Infinitely Many-armed Bandits
Rotting Infinitely Many-armed Bandits
Jung-hun Kim
Milan Vojnović
Se-Young Yun
24
4
0
31 Jan 2022
Recent Advances in Reinforcement Learning in Finance
Recent Advances in Reinforcement Learning in Finance
B. Hambly
Renyuan Xu
Huining Yang
OffRL
29
167
0
08 Dec 2021
Provably Efficient Black-Box Action Poisoning Attacks Against
  Reinforcement Learning
Provably Efficient Black-Box Action Poisoning Attacks Against Reinforcement Learning
Guanlin Liu
Lifeng Lai
AAML
32
34
0
09 Oct 2021
Model-free Reinforcement Learning in Infinite-horizon Average-reward
  Markov Decision Processes
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Hiteshi Sharma
R. Jain
107
99
0
15 Oct 2019
1