v1v2 (latest)

Provably Efficient Reinforcement Learning with Aggregated States

13 December 2019

Papers citing "Provably Efficient Reinforcement Learning with Aggregated States"

21 / 21 papers shown

Demystifying Linear MDPs and Novel Dynamics Aggregation FrameworkInternational Conference on Learning Representations (ICLR), 2024

Joongkyu Lee

Min-hwan Oh

213

31 Oct 2024

The RL Perceptron: Generalisation Dynamics of Policy Learning in High DimensionsPhysical Review X (PRX), 2023

Nishil Patel

Sebastian Lee

Stefano Sarao Mannelli

Sebastian Goldt

Adrew Saxe

OffRL

431

17 Jun 2023

Exponential Hardness of Reinforcement Learning with Linear Function ApproximationAnnual Conference Computational Learning Theory (COLT), 2023

240

25 Feb 2023

Tight Guarantees for Interactive Decision Making with the Decision-Estimation CoefficientAnnual Conference Computational Learning Theory (COLT), 2023

222

19 Jan 2023

Model-Free Reinforcement Learning with the Decision-Estimation CoefficientNeural Information Processing Systems (NeurIPS), 2022

245

25 Nov 2022

Planning to the Information Horizon of BAMDPs via Epistemic State AbstractionNeural Information Processing Systems (NeurIPS), 2022

Dilip Arumugam

Satinder Singh

193

30 Oct 2022

A General Framework for Sample-Efficient Function Approximation in Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2022

Quanquan Gu

220

30 Sep 2022

Learning to Order for Inventory Systems with Lost Sales and Uncertain SuppliesManagement Sciences (MS), 2022

226

10 Jul 2022

On the Complexity of Adversarial Decision MakingNeural Information Processing Systems (NeurIPS), 2022

186

27 Jun 2022

Deciding What to Model: Value-Equivalent Sampling for Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022

Dilip Arumugam

Benjamin Van Roy

OffRL

254

04 Jun 2022

Finding Safe Zones of policies Markov Decision ProcessesNeural Information Processing Systems (NeurIPS), 2022

Lee Cohen

Yishay Mansour

Michal Moshkovitz

236

23 Feb 2022

Computational-Statistical Gaps in Reinforcement Learning

145

11 Feb 2022

Improved Algorithms for Misspecified Linear Markov Decision Processes

192

12 Sep 2021

Regret Minimization Experience Replay in Off-Policy Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2021

176

15 May 2021

Bilinear Classes: A Structural Framework for Provable Generalization in RLInternational Conference on Machine Learning (ICML), 2021

503

199

19 Mar 2021

Provable Model-based Nonlinear Bandit and Reinforcement Learning: Shelve Optimism, Embrace Virtual CurvatureNeural Information Processing Systems (NeurIPS), 2021

Kefan Dong

Jiaqi Yang

Tengyu Ma

515

08 Feb 2021

Randomized Value Functions via Posterior State-Abstraction Sampling

Dilip Arumugam

Benjamin Van Roy

OffRL

288

05 Oct 2020

Approximation Benefits of Policy Gradient Methods with Aggregated States

Daniel Russo

350

22 Jul 2020

PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient LearningNeural Information Processing Systems (NeurIPS), 2020

246

119

16 Jul 2020

Provably More Efficient Q-Learning in the One-Sided-Feedback/Full-Feedback Settings

Xiao-Yue Gong

D. Simchi-Levi

30 Jun 2020

Reinforcement Learning with Feedback Graphs

153

07 May 2020