Randomised Bayesian Least-Squares Policy Iteration
Papers citing "Randomised Bayesian Least-Squares Policy Iteration"
1 / 1 papers shown
Worst-Case Regret Bounds for Exploration via Randomized Value FunctionsNeural Information Processing Systems (NeurIPS), 2019 |
