Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2002.03069
Cited By
Adaptive Approximate Policy Iteration
8 February 2020
Botao Hao
N. Lazić
Yasin Abbasi-Yadkori
Pooria Joulani
Csaba Szepesvári
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Adaptive Approximate Policy Iteration"
14 / 14 papers shown
Title
Acceleration in Policy Optimization
Veronica Chelu
Tom Zahavy
A. Guez
Doina Precup
Sebastian Flennerhag
43
0
0
18 Jun 2023
Concentration Phenomenon for Random Dynamical Systems: An Operator Theoretic Approach
Muhammad Naeem
Miroslav Pajic
17
1
0
07 Dec 2022
Transportation-Inequalities, Lyapunov Stability and Sampling for Dynamical Systems on Continuous State Space
Muhammad Naeem
Miroslav Pajic
14
3
0
25 May 2022
Online Learning for Unknown Partially Observable MDPs
Mehdi Jafarnia-Jahromi
Rahul Jain
A. Nayyar
20
20
0
25 Feb 2021
Improved Regret Bound and Experience Replay in Regularized Policy Iteration
N. Lazić
Dong Yin
Yasin Abbasi-Yadkori
Csaba Szepesvári
OffRL
6
17
0
25 Feb 2021
Near-optimal Policy Optimization Algorithms for Learning Adversarial Linear Mixture MDPs
Jiafan He
Dongruo Zhou
Quanquan Gu
95
23
0
17 Feb 2021
Nearly Minimax Optimal Regret for Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation
Yue Wu
Dongruo Zhou
Quanquan Gu
11
21
0
15 Feb 2021
Optimization Issues in KL-Constrained Approximate Policy Iteration
N. Lazić
Botao Hao
Yasin Abbasi-Yadkori
Dale Schuurmans
Csaba Szepesvári
19
10
0
11 Feb 2021
Average-reward model-free reinforcement learning: a systematic review and literature mapping
Vektor Dewanto
George Dunn
A. Eshragh
M. Gallagher
Fred Roosta
6
27
0
18 Oct 2020
Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy
Zuyue Fu
Zhuoran Yang
Zhaoran Wang
15
42
0
02 Aug 2020
Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Rahul Jain
11
43
0
23 Jul 2020
Learning Expected Reward for Switched Linear Control Systems: A Non-Asymptotic View
Muhammad Naeem
Miroslav Pajic
6
1
0
15 Jun 2020
A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret
Mehdi Jafarnia-Jahromi
Chen-Yu Wei
Rahul Jain
Haipeng Luo
12
7
0
08 Jun 2020
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Hiteshi Sharma
R. Jain
107
99
0
15 Oct 2019
1