Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.05476
Cited By
Multi-Timescale Ensemble Q-learning for Markov Decision Process Policy Optimization
8 February 2024
Talha Bozkus
Urbashi Mitra
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Multi-Timescale Ensemble Q-learning for Markov Decision Process Policy Optimization"
6 / 6 papers shown
Title
Generative Multi-Agent Q-Learning for Policy Optimization: Decentralized Wireless Networks
Talha Bozkus
U. Mitra
OffRL
34
0
0
07 Mar 2025
Ensemble RL through Classifier Models: Enhancing Risk-Return Trade-offs in Trading Strategies
Zheli Xiong
42
0
0
23 Feb 2025
Coverage Analysis of Multi-Environment Q-Learning Algorithms for Wireless Network Optimization
Talha Bozkus
Urbashi Mitra
35
2
0
29 Aug 2024
Leveraging Digital Cousins for Ensemble Q-Learning in Large-Scale Wireless Networks
Talha Bozkus
Urbashi Mitra
21
4
0
12 Feb 2024
Approximation Benefits of Policy Gradient Methods with Aggregated States
Daniel Russo
38
7
0
22 Jul 2020
Measuring and testing dependence by correlation of distances
G. Székely
Maria L. Rizzo
N. K. Bakirov
175
2,577
0
28 Mar 2008
1