Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2002.03069
Cited By
v1
v2
v3
v4 (latest)
Adaptive Approximate Policy Iteration
8 February 2020
Botao Hao
N. Lazić
Yasin Abbasi-Yadkori
Pooria Joulani
Csaba Szepesvári
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adaptive Approximate Policy Iteration"
13 / 13 papers shown
Acceleration in Policy Optimization
Veronica Chelu
Tom Zahavy
A. Guez
Doina Precup
Sebastian Flennerhag
352
0
0
18 Jun 2023
Concentration Phenomenon for Random Dynamical Systems: An Operator Theoretic Approach
Conference on Learning for Dynamics & Control (L4DC), 2022
Muhammad Naeem
Miroslav Pajic
351
1
0
07 Dec 2022
Transportation-Inequalities, Lyapunov Stability and Sampling for Dynamical Systems on Continuous State Space
Conference on Learning for Dynamics & Control (L4DC), 2022
Muhammad Naeem
Miroslav Pajic
224
3
0
25 May 2022
Online Learning for Unknown Partially Observable MDPs
International Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Mehdi Jafarnia-Jahromi
Rahul Jain
A. Nayyar
317
24
0
25 Feb 2021
Improved Regret Bound and Experience Replay in Regularized Policy Iteration
International Conference on Machine Learning (ICML), 2021
N. Lazić
Dong Yin
Yasin Abbasi-Yadkori
Csaba Szepesvári
OffRL
151
19
0
25 Feb 2021
Near-optimal Policy Optimization Algorithms for Learning Adversarial Linear Mixture MDPs
International Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Jiafan He
Dongruo Zhou
Quanquan Gu
284
27
0
17 Feb 2021
Nearly Minimax Optimal Regret for Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation
International Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Yue Wu
Dongruo Zhou
Quanquan Gu
192
23
0
15 Feb 2021
Optimization Issues in KL-Constrained Approximate Policy Iteration
N. Lazić
Botao Hao
Yasin Abbasi-Yadkori
Dale Schuurmans
Csaba Szepesvári
130
15
0
11 Feb 2021
Average-reward model-free reinforcement learning: a systematic review and literature mapping
Vektor Dewanto
George Dunn
A. Eshragh
M. Gallagher
Fred Roosta
300
39
0
18 Oct 2020
Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy
International Conference on Learning Representations (ICLR), 2020
Zuyue Fu
Zhuoran Yang
Zhaoran Wang
358
52
0
02 Aug 2020
Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation
International Conference on Artificial Intelligence and Statistics (AISTATS), 2020
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Rahul Jain
332
53
0
23 Jul 2020
Learning Expected Reward for Switched Linear Control Systems: A Non-Asymptotic View
Muhammad Naeem
Miroslav Pajic
198
1
0
15 Jun 2020
A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret
Mehdi Jafarnia-Jahromi
Chen-Yu Wei
Rahul Jain
Haipeng Luo
305
7
0
08 Jun 2020
1
Page 1 of 1