Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.07478
Cited By
Adaptive Trade-Offs in Off-Policy Learning
16 October 2019
Mark Rowland
Will Dabney
Rémi Munos
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Adaptive Trade-Offs in Off-Policy Learning"
8 / 8 papers shown
Title
Off-policy Distributional Q(
λ
λ
λ
): Distributional RL without Importance Sampling
Yunhao Tang
Mark Rowland
Rémi Munos
Bernardo Avila-Pires
Will Dabney
OffRL
10
1
0
08 Feb 2024
Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning
Brett Daley
Martha White
Chris Amato
Marlos C. Machado
OffRL
9
3
0
26 Jan 2023
Grounding Aleatoric Uncertainty for Unsupervised Environment Design
Minqi Jiang
Michael Dennis
Jack Parker-Holder
Andrei Lupu
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
Jakob N. Foerster
43
13
0
11 Jul 2022
Safe-FinRL: A Low Bias and Variance Deep Reinforcement Learning Implementation for High-Freq Stock Trading
Zitao Song
Xuyang Jin
Chenliang Li
OffRL
AIFin
21
1
0
13 Jun 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
33
100
0
11 Jan 2022
Improving the Efficiency of Off-Policy Reinforcement Learning by Accounting for Past Decisions
Brett Daley
Chris Amato
OffRL
16
1
0
23 Dec 2021
On component interactions in two-stage recommender systems
Jiri Hron
K. Krauth
Michael I. Jordan
Niki Kilbertus
CML
LRM
40
31
0
28 Jun 2021
Self-Imitation Learning via Generalized Lower Bound Q-learning
Yunhao Tang
SSL
27
24
0
12 Jun 2020
1