Posterior Sampling with Delayed Feedback for Reinforcement Learning with
Linear Function ApproximationNeural Information Processing Systems (NeurIPS), 2023 |
Efficient Reinforcement Learning with Impaired Observability: Learning
to Act with Delayed and Missing State ObservationsIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2023 |