$Pessimism for Offline Linear Contextual Bandits using $\ell_p$ Confidence Sets$

Pessimism for Offline Linear Contextual Bandits using $\ell_p$ Confidence Sets

21 May 2022

Papers citing "Pessimism for Offline Linear Contextual Bandits using $\ell_p$ Confidence Sets"

3 / 3 papers shown

Title
Offline Minimax Soft-Q-learning Under Realizability and Partial Coverage Masatoshi Uehara Nathan Kallus Jason D. Lee Wen Sun OffRL 13 5 0 05 Feb 2023
Pessimistic Model-based Offline Reinforcement Learning under Partial Coverage Masatoshi Uehara Wen Sun OffRL 91 20 0 13 Jul 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems Sergey Levine Aviral Kumar George Tucker Justin Fu OffRL GP 329 1,944 0 04 May 2020