Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2002.05138
Cited By

Regret Bounds for Discounted MDPs

v1v2v3 (latest)

Regret Bounds for Discounted MDPs

12 February 2020

ArXiv (abs)PDF HTML

Papers citing "Regret Bounds for Discounted MDPs"

16 / 16 papers shown

Non-Stationary Restless Multi-Armed Bandits with Provable Guarantee

Non-Stationary Restless Multi-Armed Bandits with Provable Guarantee

Ping-Chun Hsieh

119

0

0

14 Aug 2025

Reinforcement Learning from Multi-level and Episodic Human Feedback

Reinforcement Learning from Multi-level and Episodic Human FeedbackConference on Learning for Dynamics & Control (L4DC), 2025

Muhammad Qasim Elahi

Somtochukwu Oguchienti

Maheed H. Ahmed

599

0

0

20 Apr 2025

Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from
Shifted-Dynamics Data

Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics DataInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024

Kishan Panaganti

308

6

0

06 Nov 2024

A Factored MDP Approach To Moving Target Defense With Dynamic Threat
Modeling and Cost Efficiency

A Factored MDP Approach To Moving Target Defense With Dynamic Threat Modeling and Cost Efficiency

194

0

0

16 Aug 2024

Reinforcement Learning for Infinite-Horizon Average-Reward Linear MDPs via Approximation by Discounted-Reward MDPs

Reinforcement Learning for Infinite-Horizon Average-Reward Linear MDPs via Approximation by Discounted-Reward MDPsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024

Dabeen Lee

Ambuj Tewari

471

1

0

23 May 2024

Regret-Optimal Model-Free Reinforcement Learning for Discounted MDPs
with Short Burn-In Time

Regret-Optimal Model-Free Reinforcement Learning for Discounted MDPs with Short Burn-In TimeNeural Information Processing Systems (NeurIPS), 2023

433

9

0

24 May 2023

Optimistic Planning by Regularized Dynamic Programming

Optimistic Planning by Regularized Dynamic ProgrammingInternational Conference on Machine Learning (ICML), 2023

492

8

0

27 Feb 2023

No-regret Learning in Repeated First-Price Auctions with Budget
Constraints

No-regret Learning in Repeated First-Price Auctions with Budget Constraints

277

14

0

29 May 2022

Slowly Changing Adversarial Bandit Algorithms are Efficient for
Discounted MDPs

Slowly Changing Adversarial Bandit Algorithms are Efficient for Discounted MDPsInternational Conference on Algorithmic Learning Theory (ALT), 2022

476

1

0

18 May 2022

Provably Efficient Kernelized Q-Learning

Provably Efficient Kernelized Q-Learning

371

4

0

21 Apr 2022

Gap-Dependent Bounds for Two-Player Markov Games

Gap-Dependent Bounds for Two-Player Markov Games

Zehao Dou

141

8

0

01 Jul 2021

MADE: Exploration via Maximizing Deviation from Explored Regions

MADE: Exploration via Maximizing Deviation from Explored RegionsNeural Information Processing Systems (NeurIPS), 2021

Tianjun Zhang

Paria Rashidinejad

Joseph E. Gonzalez

Stuart J. Russell

254

50

0

18 Jun 2021

Nearly Minimax Optimal Reinforcement Learning for Linear Mixture Markov
Decision Processes

Nearly Minimax Optimal Reinforcement Learning for Linear Mixture Markov Decision ProcessesAnnual Conference Computational Learning Theory (COLT), 2020

Quanquan Gu

Csaba Szepesvári

340

229

0

15 Dec 2020

Nearly Minimax Optimal Reinforcement Learning for Discounted MDPs

Nearly Minimax Optimal Reinforcement Learning for Discounted MDPsNeural Information Processing Systems (NeurIPS), 2020

Quanquan Gu

500

47

0

01 Oct 2020

Provably Efficient Reinforcement Learning for Discounted MDPs with
Feature Mapping

Provably Efficient Reinforcement Learning for Discounted MDPs with Feature MappingInternational Conference on Machine Learning (ICML), 2020

Quanquan Gu

440

143

0

23 Jun 2020

$Q$-learning with Logarithmic Regret

Q

-learning with Logarithmic Regret

416

72

0

16 Jun 2020

Page 1 of 1