Papers
Communities
Organizations
Events
Blog
Pricing
Feedback
Contact Sales
Search
Open menu
Home
Papers
1301.6718
Cited By
On the Complexity of Policy Iteration
23 January 2013
Yishay Mansour
Satinder Singh
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"On the Complexity of Policy Iteration"
12 / 12 papers shown
Title
Efficient Computation of Blackwell Optimal Policies using Rational Functions
Dibyangshu Mukherjee
Shivaram Kalyanakrishnan
OffRL
4
0
0
25 Aug 2025
Howard's Policy Iteration is Subexponential for Deterministic Markov Decision Problems with Rewards of Fixed Bit-size and Arbitrary Discount Factor
Dibyangshu Mukherjee
Shivaram Kalyanakrishnan
76
2
0
01 May 2025
Geometric Policy Iteration for Markov Decision Processes
Yue Wu
J. D. Loera
106
3
0
12 Jun 2022
Lower Bounds for Policy Iteration on Multi-action MDPs
Kumar Ashutosh
Sarthak Consul
Bhishma Dedhia
Parthasarathi Khirwadkar
Sahil Shah
Shivaram Kalyanakrishnan
33
3
0
16 Sep 2020
Solving Discounted Stochastic Two-Player Games with Near-Optimal Time and Sample Complexity
Aaron Sidford
Mengdi Wang
Lin F. Yang
Yinyu Ye
138
71
0
29 Aug 2019
The Value Function Polytope in Reinforcement Learning
Robert Dadashi
Adrien Ali Taïga
Nicolas Le Roux
Dale Schuurmans
Marc G. Bellemare
110
47
0
31 Jan 2019
On the Complexity of Value Iteration
N. Balaji
S. Kiefer
Petr Novotný
G. Pérez
M. Shirmohammadi
88
14
0
13 Jul 2018
Variance Reduced Value Iteration and Faster Algorithms for Solving Markov Decision Processes
Aaron Sidford
Mengdi Wang
X. Wu
Yinyu Ye
162
132
0
27 Oct 2017
Primal-Dual
π
π
π
Learning: Sample Complexity and Sublinear Run Time for Ergodic Markov Decision Problems
Mengdi Wang
173
70
0
17 Oct 2017
Lower Bound On the Computational Complexity of Discounted Markov Decision Problems
Yichen Chen
Mengdi Wang
85
18
0
20 May 2017
Improved and Generalized Upper Bounds on the Complexity of Policy Iteration
B. Scherrer
190
77
0
03 Jun 2013
A Learning Theoretic Approach to Energy Harvesting Communication System Optimization
Pol Blasco
Deniz Gündüz
Mischa Dohler
157
269
0
21 Aug 2012
1