Adaptive Trade-Offs in Off-Policy Learning

16 October 2019

Papers citing "Adaptive Trade-Offs in Off-Policy Learning"

8 / 8 papers shown

Title
Off-policy Distributional Q( $λ$ ): Distributional RL without Importance Sampling Yunhao Tang Mark Rowland Rémi Munos Bernardo Avila-Pires Will Dabney OffRL 10 1 0 08 Feb 2024
Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning Brett Daley Martha White Chris Amato Marlos C. Machado OffRL 11 3 0 26 Jan 2023
Grounding Aleatoric Uncertainty for Unsupervised Environment Design Minqi Jiang Michael Dennis Jack Parker-Holder Andrei Lupu Heinrich Küttler Edward Grefenstette Tim Rocktaschel Jakob N. Foerster 43 13 0 11 Jul 2022
Safe-FinRL: A Low Bias and Variance Deep Reinforcement Learning Implementation for High-Freq Stock Trading Zitao Song Xuyang Jin Chenliang Li OffRL AIFin 23 1 0 13 Jun 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems Jack Parker-Holder Raghunandan Rajan Xingyou Song André Biedenkapp Yingjie Miao ... Vu-Linh Nguyen Roberto Calandra Aleksandra Faust Frank Hutter Marius Lindauer AI4CE 33 100 0 11 Jan 2022
Improving the Efficiency of Off-Policy Reinforcement Learning by Accounting for Past Decisions Brett Daley Chris Amato OffRL 16 1 0 23 Dec 2021
On component interactions in two-stage recommender systems Jiri Hron K. Krauth Michael I. Jordan Niki Kilbertus CML LRM 40 31 0 28 Jun 2021
Self-Imitation Learning via Generalized Lower Bound Q-learning Yunhao Tang SSL 30 24 0 12 Jun 2020