Provably Efficient Q-Learning with Low Switching Cost

Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices

541

27 Feb 2024

301

08 Feb 2024

Near-Optimal Reinforcement Learning with Self-Play under Adaptivity Constraints

Constant Stepsize Q-learning: Distributional Convergence, Bias and Extrapolation

332

02 Feb 2024

Yixuan Zhang

Qiaomin Xie

349

25 Jan 2024

Sample Efficient Reinforcement Learning with Partial Dynamics Knowledge

Meshal Alharbi

Mardavij Roozbehani

M. Dahleh

337

19 Dec 2023

Sample-Efficiency in Multi-Batch Reinforcement Learning: The Need for Dimension-Dependent AdaptivityInternational Conference on Learning Representations (ICLR), 2023

382

02 Oct 2023

Minimax Optimal Q Learning with Nearest NeighborsIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2023

Puning Zhao

Lifeng Lai

Near-Optimal Partially Observable Reinforcement Learning with Partial Online State Information

324

03 Aug 2023

Settling the Sample Complexity of Online Reinforcement LearningAnnual Conference Computational Learning Theory (COLT), 2023

884

25 Jul 2023

Policy Finetuning in Reinforcement Learning via Design of Experiments using Offline DataNeural Information Processing Systems (NeurIPS), 2023

Ruiqi Zhang

Andrea Zanette

OffRL OnRL

343

10 Jul 2023

Low-Switching Policy Gradient with Exploration via Online Sensitivity SamplingInternational Conference on Machine Learning (ICML), 2023

270

15 Jun 2023

Ming Shi

Yingbin Liang

Ness B. Shroff

411

14 Jun 2023

The Curious Price of Distributional Robustness in Reinforcement Learning with a Generative ModelNeural Information Processing Systems (NeurIPS), 2023

501

26 May 2023

Regret-Optimal Model-Free Reinforcement Learning for Discounted MDPs with Short Burn-In TimeNeural Information Processing Systems (NeurIPS), 2023

Xiang Ji

Gen Li

Human Machine Co-adaption Interface via Cooperation Markov Decision Process System

434

24 May 2023

The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup and BeyondInternational Conference on Machine Learning (ICML), 2023

402

18 May 2023

Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023

339

17 May 2023

156

03 May 2023

Minimax-Optimal Reward-Agnostic Exploration in Reinforcement LearningAnnual Conference Computational Learning Theory (COLT), 2023

372

14 Apr 2023

Logarithmic Switching Cost in Reinforcement Learning beyond Linear MDPs

Ming Yin

227

24 Feb 2023

Near-Optimal Adversarial Reinforcement Learning with Switching CostsInternational Conference on Learning Representations (ICLR), 2023

Ming Shi

Yitao Liang

Ness B. Shroff

180

08 Feb 2023

Sharp Variance-Dependent Bounds in Reinforcement Learning: Best of Both Worlds in Stochastic and Deterministic EnvironmentsInternational Conference on Machine Learning (ICML), 2023

Runlong Zhou

Zihan Zhang

S. Du

406

31 Jan 2023

Communication-Efficient Collaborative Regret Minimization in Multi-Armed BanditsAAAI Conference on Artificial Intelligence (AAAI), 2023

Nikolai Karpov

Qin Zhang

423

26 Jan 2023

Provable Sim-to-real Transfer in Continuous Domain with Partial ObservationsInternational Conference on Learning Representations (ICLR), 2022

364

27 Oct 2022

Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample ComplexityNeural Information Processing Systems (NeurIPS), 2022

Abhishek Gupta

267

100

18 Oct 2022

Near-Optimal Regret Bounds for Multi-batch Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022

260

15 Oct 2022

Near-Optimal Deployment Efficiency in Reward-Free Reinforcement Learning with Linear Function ApproximationInternational Conference on Learning Representations (ICLR), 2022

345

03 Oct 2022

Byzantine-Robust Online and Offline Distributed Reinforcement LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022

404

01 Jun 2022

One Policy is Enough: Parallel Exploration with a Single Policy is Near-Optimal for Reward-Free Reinforcement LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022

Pedro Cisneros-Velarde

468

31 May 2022

The Efficacy of Pessimism in Asynchronous Q-LearningIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2022

393

14 Mar 2022

Learn to Match with No Regret: Reinforcement Learning in Markov Matching MarketsNeural Information Processing Systems (NeurIPS), 2022

Tianhao Wang

272

07 Mar 2022

Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample ComplexityInternational Conference on Machine Learning (ICML), 2022

382

107

28 Feb 2022

Towards Deployment-Efficient Reinforcement Learning: Lower Bound and OptimalityInternational Conference on Learning Representations (ICLR), 2022

355

14 Feb 2022

Sample-Efficient Reinforcement Learning with loglog(T) Switching CostInternational Conference on Machine Learning (ICML), 2022

Ming Yin

Ming Min

A Benchmark for Low-Switching-Cost Reinforcement Learning

302

13 Feb 2022

Improved Regret for Differentially Private Exploration in Linear MDPInternational Conference on Machine Learning (ICML), 2022

Dung Daniel Ngo

G. Vietri

Zhiwei Steven Wu

306

02 Feb 2022

185

13 Dec 2021

Towards Instance-Optimal Offline Reinforcement Learning with Pessimism

Ming Yin