ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2207.06272
  4. Cited By
Hindsight Learning for MDPs with Exogenous Inputs

Hindsight Learning for MDPs with Exogenous Inputs

13 July 2022
Sean R. Sinclair
Felipe Vieira Frujeri
Ching-An Cheng
Luke Marshall
Hugo Barbalho
Jingling Li
Jennifer Neville
Ishai Menache
Adith Swaminathan
ArXivPDFHTML

Papers citing "Hindsight Learning for MDPs with Exogenous Inputs"

16 / 16 papers shown
Title
Learning Virtual Machine Scheduling in Cloud Computing through Language Agents
Learning Virtual Machine Scheduling in Cloud Computing through Language Agents
Jiehao Wu
Ziwei Wang
Junjie Sheng
Wenhao Li
Xiangfei Wang
Jun Luo
19
0
0
15 May 2025
Multi-Agent Reinforcement Learning with Long-Term Performance Objectives for Service Workforce Optimization
Kareem Eissa
Rayal Prasad
Sarith Mohan
Ankur Kapoor
D. Comaniciu
V. Singh
33
0
0
03 Mar 2025
Scalable Reinforcement Learning for Virtual Machine Scheduling
Junjie Sheng
Jiehao Wu
Haochuan Cui
Yiqiu Hu
Wenli Zhou
Lei Zhu
Qian Peng
Wenhao Li
Xiangfeng Wang
OffRL
31
0
0
01 Mar 2025
Zero-shot Generalization in Inventory Management: Train, then Estimate
  and Decide
Zero-shot Generalization in Inventory Management: Train, then Estimate and Decide
Tarkan Temizoz
Christina Imdahl
R. Dijkman
Douniel Lamghari-Idrissi
W. Jaarsveld
19
1
0
01 Nov 2024
Neural Coordination and Capacity Control for Inventory Management
Neural Coordination and Capacity Control for Inventory Management
Carson Eisenach
Udaya Ghai
Dhruv Madeka
Kari Torkkola
Dean Phillips Foster
Sham Kakade
13
0
0
24 Sep 2024
On Overcoming Miscalibrated Conversational Priors in LLM-based Chatbots
On Overcoming Miscalibrated Conversational Priors in LLM-based Chatbots
Christine Herlihy
Jennifer Neville
Tobias Schnabel
Adith Swaminathan
36
3
0
01 Jun 2024
VC Theory for Inventory Policies
VC Theory for Inventory Policies
Yaqi Xie
Will Ma
Linwei Xin
18
5
0
17 Apr 2024
Provably Efficient Partially Observable Risk-Sensitive Reinforcement
  Learning with Hindsight Observation
Provably Efficient Partially Observable Risk-Sensitive Reinforcement Learning with Hindsight Observation
Tonghe Zhang
Yu Chen
Longbo Huang
31
0
0
28 Feb 2024
Optimizing Heat Alert Issuance with Reinforcement Learning
Optimizing Heat Alert Issuance with Reinforcement Learning
Ellen M. Considine
Rachel C. Nethery
G. Wellenius
Francesca Dominici
Mauricio Tec
OffRL
21
0
0
21 Dec 2023
Learning an Inventory Control Policy with General Inventory Arrival
  Dynamics
Learning an Inventory Control Policy with General Inventory Arrival Dynamics
Sohrab Andaz
Carson Eisenach
Dhruv Madeka
Kari Torkkola
Randy Jia
Dean Phillips Foster
Sham Kakade
17
2
0
26 Oct 2023
Theoretical Hardness and Tractability of POMDPs in RL with Partial
  Online State Information
Theoretical Hardness and Tractability of POMDPs in RL with Partial Online State Information
Ming Shi
Yingbin Liang
Ness B. Shroff
26
2
0
14 Jun 2023
Efficient Reinforcement Learning with Impaired Observability: Learning
  to Act with Delayed and Missing State Observations
Efficient Reinforcement Learning with Impaired Observability: Learning to Act with Delayed and Missing State Observations
Minshuo Chen
Jie Meng
Yunru Bai
Yinyu Ye
H. Vincent Poor
Mengdi Wang
23
0
0
02 Jun 2023
Sample Efficient Reinforcement Learning in Mixed Systems through
  Augmented Samples and Its Applications to Queueing Networks
Sample Efficient Reinforcement Learning in Mixed Systems through Augmented Samples and Its Applications to Queueing Networks
Honghao Wei
Xin Liu
Weina Wang
Lei Ying
24
10
0
25 May 2023
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Pouya Hamadanian
Arash Nasr-Esfahany
Malte Schwarzkopf
Siddartha Sen
MohammadIman Alizadeh
CLL
OffRL
40
0
0
04 Feb 2023
Learning in POMDPs is Sample-Efficient with Hindsight Observability
Learning in POMDPs is Sample-Efficient with Hindsight Observability
Jonathan Lee
Alekh Agarwal
Christoph Dann
Tong Zhang
21
19
0
31 Jan 2023
Deep Inventory Management
Deep Inventory Management
Dhruv Madeka
Kari Torkkola
Carson Eisenach
Anna Luo
Dean Phillips Foster
Sham M. Kakade
BDL
35
15
0
06 Oct 2022
1