A Complete Characterization of Linear Estimators for Offline Policy
EvaluationJournal of machine learning research (JMLR), 2022 |
Offline Reinforcement Learning with Realizability and Single-policy
ConcentrabilityAnnual Conference Computational Learning Theory (COLT), 2022 |
Provably Efficient Representation Selection in Low-rank Markov Decision
Processes: From Online to Offline RLConference on Uncertainty in Artificial Intelligence (UAI), 2021 |