Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1605.08062
Cited By
v1
v2 (latest)
A PAC RL Algorithm for Episodic POMDPs
25 May 2016
Z. Guo
Shayan Doroudi
Emma Brunskill
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A PAC RL Algorithm for Episodic POMDPs"
37 / 37 papers shown
Statistical Tractability of Off-policy Evaluation of History-dependent Policies in POMDPs
International Conference on Learning Representations (ICLR), 2025
Yuheng Zhang
Nan Jiang
OffRL
302
5
0
03 Mar 2025
Efficient Learning of POMDPs with Known Observation Model in Average-Reward Setting
Alessio Russo
Alberto Maria Metelli
Marcello Restelli
251
1
0
02 Oct 2024
RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation
Jeongyeol Kwon
Shie Mannor
Constantine Caramanis
Yonathan Efroni
OffRL
444
6
0
03 Jun 2024
On the Curses of Future and History in Future-dependent Value Functions for Off-policy Evaluation
Yuheng Zhang
Nan Jiang
OffRL
318
7
0
22 Feb 2024
Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning
International Conference on Machine Learning (ICML), 2023
Hongming Zhang
Zhaolin Ren
Chenjun Xiao
Dale Schuurmans
Bo Dai
451
9
0
20 Nov 2023
Posterior Sampling-based Online Learning for Episodic POMDPs
Dengwang Tang
Dongze Ye
Rahul Jain
A. Nayyar
Pierluigi Nuzzo
OffRL
450
1
0
16 Oct 2023
Learning Optimal Admission Control in Partially Observable Queueing Networks
Jonatha Anselmi
B. Gaujal
Louis-Sébastien Rebuffi
176
2
0
04 Aug 2023
Sample-Efficient Learning of POMDPs with Multiple Observations In Hindsight
International Conference on Learning Representations (ICLR), 2023
Jiacheng Guo
Minshuo Chen
Haiquan Wang
Caiming Xiong
Mengdi Wang
Yu Bai
298
6
0
06 Jul 2023
Provably Efficient UCB-type Algorithms For Learning Predictive State Representations
International Conference on Learning Representations (ICLR), 2023
Ruiquan Huang
Yitao Liang
J. Yang
OffRL
424
6
0
01 Jul 2023
Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP
International Conference on Machine Learning (ICML), 2023
Jiacheng Guo
Zihao Li
Huazheng Wang
Mengdi Wang
Zhuoran Yang
Xuezhou Zhang
280
8
0
21 Jun 2023
Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition
International Conference on Machine Learning (ICML), 2023
Yash Chandak
S. Thakoor
Z. Guo
Yunhao Tang
Rémi Munos
Will Dabney
Diana Borsa
346
6
0
01 May 2023
Act-Then-Measure: Reinforcement Learning for Partially Observable Environments with Active Measuring
International Conference on Automated Planning and Scheduling (ICAPS), 2023
Merlijn Krale
T. D. Simão
N. Jansen
OffRL
220
12
0
14 Mar 2023
Learning in POMDPs is Sample-Efficient with Hindsight Observability
International Conference on Machine Learning (ICML), 2023
Jonathan Lee
Alekh Agarwal
Christoph Dann
Tong Zhang
360
25
0
31 Jan 2023
An Instrumental Variable Approach to Confounded Off-Policy Evaluation
International Conference on Machine Learning (ICML), 2022
Yang Xu
Jin Zhu
C. Shi
Shuang Luo
R. Song
OffRL
361
24
0
29 Dec 2022
Reward-Mixing MDPs with a Few Latent Contexts are Learnable
Jeongyeol Kwon
Yonathan Efroni
Constantine Caramanis
Shie Mannor
213
5
0
05 Oct 2022
Partially Observable RL with B-Stability: Unified Structural Condition and Sharp Sample-Efficient Algorithms
International Conference on Learning Representations (ICLR), 2022
Fan Chen
Yu Bai
Song Mei
345
25
0
29 Sep 2022
Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
Neural Information Processing Systems (NeurIPS), 2022
Masatoshi Uehara
Haruka Kiyohara
Andrew Bennett
Victor Chernozhukov
Nan Jiang
Nathan Kallus
C. Shi
Wen Sun
OffRL
499
25
0
26 Jul 2022
PAC Reinforcement Learning for Predictive State Representations
International Conference on Learning Representations (ICLR), 2022
Wenhao Zhan
Masatoshi Uehara
Wen Sun
Jason D. Lee
542
46
0
12 Jul 2022
Computationally Efficient PAC RL in POMDPs with Latent Determinism and Conditional Embeddings
International Conference on Machine Learning (ICML), 2022
Masatoshi Uehara
Ayush Sekhari
Jason D. Lee
Nathan Kallus
Wen Sun
262
9
0
24 Jun 2022
Provably Efficient Reinforcement Learning in Partially Observable Dynamical Systems
Neural Information Processing Systems (NeurIPS), 2022
Masatoshi Uehara
Ayush Sekhari
Jason D. Lee
Nathan Kallus
Wen Sun
OffRL
317
44
0
24 Jun 2022
Learning in Observable POMDPs, without Computationally Intractable Oracles
Neural Information Processing Systems (NeurIPS), 2022
Noah Golowich
Ankur Moitra
Dhruv Rohatgi
274
32
0
07 Jun 2022
Sample-Efficient Reinforcement Learning of Partially Observable Markov Games
Neural Information Processing Systems (NeurIPS), 2022
Qinghua Liu
Csaba Szepesvári
Chi Jin
313
32
0
02 Jun 2022
Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
International Conference on Learning Representations (ICLR), 2022
Miao Lu
Yifei Min
Zhaoran Wang
Zhuoran Yang
OffRL
432
26
0
26 May 2022
Embed to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency
Lingxiao Wang
Qi Cai
Zhuoran Yang
Zhaoran Wang
382
19
0
26 May 2022
Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency
International Conference on Machine Learning (ICML), 2022
Qi Cai
Zhuoran Yang
Zhaoran Wang
242
17
0
20 Apr 2022
When Is Partially Observable Reinforcement Learning Not Scary?
Annual Conference Computational Learning Theory (COLT), 2022
Qinghua Liu
Alan Chung
Csaba Szepesvári
Chi Jin
278
123
0
19 Apr 2022
Provable Reinforcement Learning with a Short-Term Memory
International Conference on Machine Learning (ICML), 2022
Yonathan Efroni
Chi Jin
A. Krishnamurthy
Sobhan Miryoosefi
OffRL
256
45
0
08 Feb 2022
Planning in Observable POMDPs in Quasipolynomial Time
Noah Golowich
Ankur Moitra
Dhruv Rohatgi
311
27
0
12 Jan 2022
Reinforcement Learning in Reward-Mixing MDPs
Jeongyeol Kwon
Yonathan Efroni
Constantine Caramanis
Shie Mannor
412
19
0
07 Oct 2021
Sublinear Regret for Learning POMDPs
Production and operations management (POM), 2021
Yi Xiong
Yi Xiong
Ningyuan Chen
Xiang Zhou
449
26
0
08 Jul 2021
RL for Latent MDPs: Regret Guarantees and a Lower Bound
Neural Information Processing Systems (NeurIPS), 2021
Jeongyeol Kwon
Yonathan Efroni
Constantine Caramanis
Shie Mannor
330
88
0
09 Feb 2021
Sequential Transfer in Reinforcement Learning with a Generative Model
Andrea Tirinzoni
Riccardo Poiani
Marcello Restelli
220
26
0
01 Jul 2020
Sample-Efficient Reinforcement Learning of Undercomplete POMDPs
Chi Jin
Sham Kakade
A. Krishnamurthy
Qinghua Liu
416
76
0
22 Jun 2020
Hidden Markov Model Estimation-Based Q-learning for Partially Observable Markov Decision Process
Hyung-Jin Yoon
Donghwan Lee
N. Hovakimyan
230
10
0
17 Sep 2018
On Oracle-Efficient PAC RL with Rich Observations
Neural Information Processing Systems (NeurIPS), 2018
Christoph Dann
Nan Jiang
A. Krishnamurthy
Alekh Agarwal
John Langford
Robert Schapire
355
105
0
01 Mar 2018
Reinforcement Learning in Rich-Observation MDPs using Spectral Methods
Kamyar Azizzadenesheli
A. Lazaric
Anima Anandkumar
346
31
0
11 Nov 2016
Reinforcement Learning of POMDPs using Spectral Methods
Kamyar Azizzadenesheli
A. Lazaric
Anima Anandkumar
268
141
0
25 Feb 2016
1
Page 1 of 1