Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2308.14642
Cited By
Rate-Optimal Policy Optimization for Linear Markov Decision Processes
28 August 2023
Uri Sherman
Alon Cohen
Tomer Koren
Yishay Mansour
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Rate-Optimal Policy Optimization for Linear Markov Decision Processes"
10 / 10 papers shown
Title
Narrowing the Gap between Adversarial and Stochastic MDPs via Policy Optimization
D. Tiapkin
Evgenii Chzhen
Gilles Stoltz
74
0
0
08 Jul 2024
Warm-up Free Policy Optimization: Improved Regret in Linear Markov Decision Processes
Asaf B. Cassel
Aviv A. Rosenberg
35
1
0
03 Jul 2024
Near-Optimal Regret in Linear MDPs with Aggregate Bandit Feedback
Asaf B. Cassel
Haipeng Luo
Aviv A. Rosenberg
Dmitry Sotnikov
OffRL
29
3
0
13 May 2024
Imitation Learning in Discounted Linear MDPs without exploration assumptions
Luca Viano
Stratis Skoulakis
V. Cevher
30
3
0
03 May 2024
Refined Sample Complexity for Markov Games with Independent Linear Function Approximation
Yan Dai
Qiwen Cui
S. S. Du
35
1
0
11 Feb 2024
Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity
Guhao Feng
Han Zhong
OffRL
68
3
0
28 Dec 2023
Towards Optimal Regret in Adversarial Linear MDPs with Bandit Feedback
Haolin Liu
Chen-Yu Wei
Julian Zimmert
17
6
0
17 Oct 2023
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
303
11,881
0
04 Mar 2022
First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach
Andrew Wagenmaker
Yifang Chen
Max Simchowitz
S. Du
Kevin G. Jamieson
71
36
0
07 Dec 2021
Policy Mirror Descent for Reinforcement Learning: Linear Convergence, New Sampling Complexity, and Generalized Problem Classes
Guanghui Lan
87
136
0
30 Jan 2021
1