ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2008.01644
  4. Cited By
Queueing Network Controls via Deep Reinforcement Learning
v1v2v3v4v5v6v7 (latest)

Queueing Network Controls via Deep Reinforcement Learning

Stochastic Systems (Stoch. Syst.), 2020
31 July 2020
J. Dai
Mark O. Gluzman
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Queueing Network Controls via Deep Reinforcement Learning"

32 / 32 papers shown
Title
Non-iid hypothesis testing: from classical to quantum
Non-iid hypothesis testing: from classical to quantum
Giacomo De Palma
Marco Fanizza
Connor Mowry
Ryan O'Donnell
76
0
0
07 Oct 2025
Optimising Call Centre Operations using Reinforcement Learning: Value Iteration versus Proximal Policy Optimisation
Optimising Call Centre Operations using Reinforcement Learning: Value Iteration versus Proximal Policy Optimisation
Kwong Ho Li
Wathsala Karunarathne
OffRL
82
0
0
24 Jul 2025
QSpark: Towards Reliable Qiskit Code Generation
QSpark: Towards Reliable Qiskit Code Generation
Kiana Kheiri
Aamna Aamir
Andriy Miranskyy
Chen Ding
169
2
0
16 Jul 2025
Semi-Gradient SARSA Routing with Theoretical Guarantee on Traffic Stability and Weight Convergence
Semi-Gradient SARSA Routing with Theoretical Guarantee on Traffic Stability and Weight Convergence
Yidan Wu
Yu Yu
Jianan Zhang
Li Jin
158
0
0
19 Mar 2025
A Novel Switch-Type Policy Network for Resource Allocation Problems: Technical Report
A Novel Switch-Type Policy Network for Resource Allocation Problems: Technical ReportInternational Symposium on Modeling and Optimization in Mobile, Ad-Hoc and Wireless Networks (WiOpt), 2025
Jerrod Wigmore
B. Shrader
E. Modiano
215
0
0
19 Jan 2025
QGym: Scalable Simulation and Benchmarking of Queuing Network
  Controllers
QGym: Scalable Simulation and Benchmarking of Queuing Network ControllersNeural Information Processing Systems (NeurIPS), 2024
Haozhe Chen
Ang Li
Ethan Che
Tianyi Peng
Jing Dong
Hongseok Namkoong
OffRL
117
0
0
08 Oct 2024
Adversarial Network Optimization under Bandit Feedback: Maximizing
  Utility in Non-Stationary Multi-Hop Networks
Adversarial Network Optimization under Bandit Feedback: Maximizing Utility in Non-Stationary Multi-Hop NetworksProceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), 2024
Yan Dai
Longbo Huang
122
3
0
29 Aug 2024
Edge AI: A Taxonomy, Systematic Review and Future Directions
Edge AI: A Taxonomy, Systematic Review and Future Directions
S. Gill
Muhammed Golec
Jianmin Hu
Minxian Xu
Junhui Du
...
Kejiang Ye
Prabal Verma
Surendra Kumar
Félix Cuadrado
Steve Uhlig
197
94
0
04 Jul 2024
Reinforcement Learning for Intensity Control: An Application to Choice-Based Network Revenue Management
Reinforcement Learning for Intensity Control: An Application to Choice-Based Network Revenue Management
Huiling Meng
Yi Xiong
Ningyuan Chen
163
2
0
08 Jun 2024
Performance of NPG in Countable State-Space Average-Cost RL
Performance of NPG in Countable State-Space Average-Cost RL
Yashaswini Murthy
Isaac Grosof
S. T. Maguluri
R. Srikant
OffRL
168
1
0
30 May 2024
Intervention-Assisted Policy Gradient Methods for Online Stochastic
  Queuing Network Optimization: Technical Report
Intervention-Assisted Policy Gradient Methods for Online Stochastic Queuing Network Optimization: Technical Report
Jerrod Wigmore
B. Shrader
E. Modiano
OffRL
164
2
0
05 Apr 2024
Efficient Reinforcement Learning for Routing Jobs in Heterogeneous
  Queueing Systems
Efficient Reinforcement Learning for Routing Jobs in Heterogeneous Queueing Systems
Neharika Jali
Guannan Qu
Weina Wang
Gauri Joshi
185
7
0
02 Feb 2024
Constant Stepsize Q-learning: Distributional Convergence, Bias and
  Extrapolation
Constant Stepsize Q-learning: Distributional Convergence, Bias and Extrapolation
Yixuan Zhang
Qiaomin Xie
227
11
0
25 Jan 2024
Optimal Control of Multiclass Fluid Queueing Networks: A Machine
  Learning Approach
Optimal Control of Multiclass Fluid Queueing Networks: A Machine Learning Approach
Dimitris Bertsimas
Cheolhyeong Kim
63
3
0
23 Jul 2023
Bayesian Learning of Optimal Policies in Markov Decision Processes with
  Countably Infinite State-Space
Bayesian Learning of Optimal Policies in Markov Decision Processes with Countably Infinite State-SpaceNeural Information Processing Systems (NeurIPS), 2023
Saghar Adler
V. Subramanian
147
2
0
05 Jun 2023
Learning to Stabilize Online Reinforcement Learning in Unbounded State
  Spaces
Learning to Stabilize Online Reinforcement Learning in Unbounded State SpacesInternational Conference on Machine Learning (ICML), 2023
Brahma S. Pavse
M. Zurek
Yudong Chen
Qiaomin Xie
Josiah P. Hanna
OffRL
278
2
0
02 Jun 2023
Policy Optimization for Continuous Reinforcement Learning
Policy Optimization for Continuous Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Hanyang Zhao
Wenpin Tang
D. Yao
OffRL
241
28
0
30 May 2023
Sample Efficient Reinforcement Learning in Mixed Systems through
  Augmented Samples and Its Applications to Queueing Networks
Sample Efficient Reinforcement Learning in Mixed Systems through Augmented Samples and Its Applications to Queueing NetworksNeural Information Processing Systems (NeurIPS), 2023
Honghao Wei
Xin Liu
Weina Wang
Lei Ying
152
10
0
25 May 2023
Online Learning and Optimization for Queues with Unknown Demand Curve and Service Distribution
Online Learning and Optimization for Queues with Unknown Demand Curve and Service Distribution
Xinyun Chen
Yunan Liu
Yunan Liu
97
4
0
06 Mar 2023
Minimax Optimal Estimation of Stability Under Distribution Shift
Minimax Optimal Estimation of Stability Under Distribution ShiftOperational Research (OR), 2022
Hongseok Namkoong
Yuanzhe Ma
Peter Glynn
296
6
0
13 Dec 2022
Offline Estimation of Controlled Markov Chains: Minimaxity and Sample
  Complexity
Offline Estimation of Controlled Markov Chains: Minimaxity and Sample Complexity
Imon Banerjee
Harsha Honnappa
Vinayak A. Rao
OffRL
188
3
0
14 Nov 2022
Hindsight Learning for MDPs with Exogenous Inputs
Hindsight Learning for MDPs with Exogenous InputsInternational Conference on Machine Learning (ICML), 2022
Sean R. Sinclair
Felipe Vieira Frujeri
Ching-An Cheng
Luke Marshall
Hugo Barbalho
Jingling Li
Jennifer Neville
Ishai Menache
Adith Swaminathan
191
25
0
13 Jul 2022
Finding Optimal Policy for Queueing Models: New Parameterization
Finding Optimal Policy for Queueing Models: New Parameterization
Trang H. Tran
Lam M. Nguyen
K. Scheinberg
OffRL
114
2
0
21 Jun 2022
Logarithmic regret bounds for continuous-time average-reward Markov
  decision processes
Logarithmic regret bounds for continuous-time average-reward Markov decision processesSIAM Journal of Control and Optimization (SICON), 2022
Ningyuan Chen
X. Zhou
295
9
0
23 May 2022
Learning to Admit Optimally in an $M/M/k/k+N$ Queueing System with Unknown Service Rate
Learning to Admit Optimally in an M/M/k/k+NM/M/k/k+NM/M/k/k+N Queueing System with Unknown Service Rate
Saghar Adler
Mehrdad Moharrami
V. Subramanian
130
1
0
04 Feb 2022
Can machines solve general queueing systems?
Can machines solve general queueing systems?
Eliran Sherzer
Arik Senderovich
Opher Baron
Dmitry Krass
54
3
0
03 Feb 2022
Refined Policy Improvement Bounds for MDPs
Refined Policy Improvement Bounds for MDPs
J. Dai
Mark O. Gluzman
83
3
0
16 Jul 2021
Average-Reward Reinforcement Learning with Trust Region Methods
Average-Reward Reinforcement Learning with Trust Region MethodsInternational Joint Conference on Artificial Intelligence (IJCAI), 2021
Xiaoteng Ma
Xiao-Jing Tang
Li Xia
Jun Yang
Qianchuan Zhao
145
23
0
07 Jun 2021
Learning and Information in Stochastic Networks and Queues
Learning and Information in Stochastic Networks and Queues
N. Walton
Kuang Xu
177
21
0
18 May 2021
Benchmarking Perturbation-based Saliency Maps for Explaining Atari
  Agents
Benchmarking Perturbation-based Saliency Maps for Explaining Atari AgentsFrontiers in Artificial Intelligence (Front. Artif. Intell.), 2021
Tobias Huber
Benedikt Limmer
Elisabeth André
FAtt
135
18
0
18 Jan 2021
Scalable Deep Reinforcement Learning for Ride-Hailing
Scalable Deep Reinforcement Learning for Ride-HailingIEEE Control Systems Letters (L-CSS), 2020
Jiekun Feng
Mark O. Gluzman
J. Dai
BDL
166
19
0
27 Sep 2020
An online learning approach to dynamic pricing and capacity sizing in
  service systems
An online learning approach to dynamic pricing and capacity sizing in service systemsOperational Research (OR), 2020
Xinyun Chen
Yunan Liu
Yunan Liu
133
25
0
07 Sep 2020
1