v1v2 (latest)

Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation

International Conference on Artificial Intelligence and Statistics (AISTATS), 2020

23 July 2020

Chen-Yu Wei

Mehdi Jafarnia-Jahromi

Haipeng Luo

Rahul Jain

ArXiv (abs)PDF HTML

Papers citing "Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation"

32 / 32 papers shown

No-Regret Thompson Sampling for Finite-Horizon Markov Decision Processes with Gaussian Processes

151

23 Oct 2025

Finite-Time Bounds for Average-Reward Fitted Q-Iteration

Jongmin Lee

Ernest K. Ryu

OffRL

137

20 Oct 2025

Regret Analysis of Average-Reward Unichain MDPs via an Actor-Critic Approach

Swetha Ganesh

Vaneet Aggarwal

284

26 May 2025

Influential Bandits: Pulling an Arm May Change the Environment

Ryoma Sato

Shinji Ito

342

11 Apr 2025

Provably Adaptive Average Reward Reinforcement Learning for Metric SpacesConference on Uncertainty in Artificial Intelligence (UAI), 2024

Avik Kar

Rahul Singh

259

25 Oct 2024

Distributionally Robust Off-Dynamics Reinforcement Learning: Provable Efficiency with Linear Function Approximation

Zhishuai Liu

Pan Xu

OOD OffRL

309

23 Feb 2024

Sharper Model-free Reinforcement Learning for Average-reward Markov Decision ProcessesAnnual Conference Computational Learning Theory (COLT), 2023

Zihan Zhang

Qiaomin Xie

OffRL

290

28 Jun 2023

Optimistic Planning by Regularized Dynamic ProgrammingInternational Conference on Machine Learning (ICML), 2023

Antoine Moulin

Gergely Neu

492

27 Feb 2023

Best of Both Worlds Policy OptimizationInternational Conference on Machine Learning (ICML), 2023

Christoph Dann

Chen-Yu Wei

Julian Zimmert

266

18 Feb 2023

ACPO: A Policy Optimization Algorithm for Average MDPs with ConstraintsInternational Conference on Machine Learning (ICML), 2023

Akhil Agnihotri

R. Jain

Haipeng Luo

828

02 Feb 2023

Improved Regret for Efficient Online Reinforcement Learning with Linear Function ApproximationInternational Conference on Machine Learning (ICML), 2023

Uri Sherman

Tomer Koren

Yishay Mansour

382

30 Jan 2023

Refined Regret for Adversarial MDPs with Linear Function ApproximationInternational Conference on Machine Learning (ICML), 2023

327

30 Jan 2023

Provable Reset-free Reinforcement Learning by No-Regret ReductionInternational Conference on Machine Learning (ICML), 2023

Hoai-An Nguyen

Ching-An Cheng

OffRL

387

06 Jan 2023

Efficient Global Planning in Large MDPs via Stochastic Primal-Dual OptimizationInternational Conference on Algorithmic Learning Theory (ALT), 2022

Gergely Neu

Nneka Okolo

472

21 Oct 2022

Slowly Changing Adversarial Bandit Algorithms are Efficient for Discounted MDPsInternational Conference on Algorithmic Learning Theory (ALT), 2022

Ian A. Kash

L. Reyzin

Zishun Yu

485

18 May 2022

Independent Policy Gradient for Large-Scale Markov Potential Games: Sharper Rates, Function Approximation, and Game-Agnostic ConvergenceInternational Conference on Machine Learning (ICML), 2022

519

08 Feb 2022

Learning Infinite-Horizon Average-Reward Markov Decision Processes with ConstraintsInternational Conference on Machine Learning (ICML), 2022

Liyu Chen

R. Jain

Haipeng Luo

325

31 Jan 2022

Improved No-Regret Algorithms for Stochastic Shortest Path with Linear MDPInternational Conference on Machine Learning (ICML), 2021

Liyu Chen

Rahul Jain

Haipeng Luo

231

18 Dec 2021

Adjacency constraint for efficient hierarchical reinforcement learningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021

615

30 Oct 2021

Understanding Domain Randomization for Sim-to-real Transfer

474

164

07 Oct 2021

Efficient Local Planning with Linear Function ApproximationInternational Conference on Algorithmic Learning Theory (ALT), 2021

421

12 Aug 2021

Policy Optimization in Adversarial MDPs: Improved Exploration via Dilated BonusesNeural Information Processing Systems (NeurIPS), 2021

Haipeng Luo

Chen-Yu Wei

Chung-Wei Lee

327

18 Jul 2021

Provably Efficient Representation Selection in Low-rank Markov Decision Processes: From Online to Offline RLConference on Uncertainty in Artificial Intelligence (UAI), 2021

Quanquan Gu

309

22 Jun 2021

Online Learning for Stochastic Shortest Path Model via Posterior Sampling

Mehdi Jafarnia-Jahromi

302

09 Jun 2021

Average-Reward Reinforcement Learning with Trust Region MethodsInternational Joint Conference on Artificial Intelligence (IJCAI), 2021

254

07 Jun 2021

Sample-Efficient Reinforcement Learning for Linearly-Parameterized MDPs with a Generative ModelNeural Information Processing Systems (NeurIPS), 2021

Bingyan Wang

Yuling Yan

Jianqing Fan

512

28 May 2021

Sample-Efficient Reinforcement Learning Is Feasible for Linearly Realizable MDPs with Limited RevisitingNeural Information Processing Systems (NeurIPS), 2021

333

17 May 2021

Regret Bounds for Stochastic Shortest Path Problems with Linear Function ApproximationInternational Conference on Machine Learning (ICML), 2021

386

04 May 2021

Online Learning for Unknown Partially Observable MDPsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021

Mehdi Jafarnia-Jahromi

Rahul Jain

A. Nayyar

344

25 Feb 2021

Improved Regret Bound and Experience Replay in Regularized Policy IterationInternational Conference on Machine Learning (ICML), 2021

166

25 Feb 2021

Nearly Minimax Optimal Regret for Learning Infinite-horizon Average-reward MDPs with Linear Function ApproximationInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021

Yue Wu

Dongruo Zhou

Quanquan Gu

228

15 Feb 2021

Nonstationary Reinforcement Learning with Linear Function Approximation

433

08 Oct 2020