ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.05476
  4. Cited By
Multi-Timescale Ensemble Q-learning for Markov Decision Process Policy
  Optimization

Multi-Timescale Ensemble Q-learning for Markov Decision Process Policy Optimization

8 February 2024
Talha Bozkus
Urbashi Mitra
    OffRL
ArXivPDFHTML

Papers citing "Multi-Timescale Ensemble Q-learning for Markov Decision Process Policy Optimization"

6 / 6 papers shown
Title
Generative Multi-Agent Q-Learning for Policy Optimization: Decentralized Wireless Networks
Talha Bozkus
U. Mitra
OffRL
34
0
0
07 Mar 2025
Ensemble RL through Classifier Models: Enhancing Risk-Return Trade-offs in Trading Strategies
Ensemble RL through Classifier Models: Enhancing Risk-Return Trade-offs in Trading Strategies
Zheli Xiong
42
0
0
23 Feb 2025
Coverage Analysis of Multi-Environment Q-Learning Algorithms for
  Wireless Network Optimization
Coverage Analysis of Multi-Environment Q-Learning Algorithms for Wireless Network Optimization
Talha Bozkus
Urbashi Mitra
35
2
0
29 Aug 2024
Leveraging Digital Cousins for Ensemble Q-Learning in Large-Scale
  Wireless Networks
Leveraging Digital Cousins for Ensemble Q-Learning in Large-Scale Wireless Networks
Talha Bozkus
Urbashi Mitra
21
4
0
12 Feb 2024
Approximation Benefits of Policy Gradient Methods with Aggregated States
Approximation Benefits of Policy Gradient Methods with Aggregated States
Daniel Russo
38
7
0
22 Jul 2020
Measuring and testing dependence by correlation of distances
Measuring and testing dependence by correlation of distances
G. Székely
Maria L. Rizzo
N. K. Bakirov
175
2,577
0
28 Mar 2008
1