Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.10671
Cited By
Pessimism for Offline Linear Contextual Bandits using
ℓ
p
\ell_p
ℓ
p
Confidence Sets
21 May 2022
Gen Li
Cong Ma
Nathan Srebro
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Pessimism for Offline Linear Contextual Bandits using $\ell_p$ Confidence Sets"
3 / 3 papers shown
Title
Offline Minimax Soft-Q-learning Under Realizability and Partial Coverage
Masatoshi Uehara
Nathan Kallus
Jason D. Lee
Wen Sun
OffRL
13
5
0
05 Feb 2023
Pessimistic Model-based Offline Reinforcement Learning under Partial Coverage
Masatoshi Uehara
Wen Sun
OffRL
91
20
0
13 Jul 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
329
1,944
0
04 May 2020
1