ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.13743
  4. Cited By
Optimistic Q-learning for average reward and episodic reinforcement learning

Optimistic Q-learning for average reward and episodic reinforcement learning

18 July 2024
Priyank Agrawal
Shipra Agrawal
ArXivPDFHTML

Papers citing "Optimistic Q-learning for average reward and episodic reinforcement learning"

3 / 3 papers shown
Title
Improved Sample Complexity for Global Convergence of Actor-Critic
  Algorithms
Improved Sample Complexity for Global Convergence of Actor-Critic Algorithms
Navdeep Kumar
Priyank Agrawal
Giorgia Ramponi
Kfir Y. Levy
Shie Mannor
30
0
0
11 Oct 2024
Achieving Tractable Minimax Optimal Regret in Average Reward MDPs
Achieving Tractable Minimax Optimal Regret in Average Reward MDPs
Victor Boone
Zihan Zhang
32
5
0
03 Jun 2024
Model-free Reinforcement Learning in Infinite-horizon Average-reward
  Markov Decision Processes
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Hiteshi Sharma
R. Jain
103
99
0
15 Oct 2019
1