Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1812.02216
Cited By
Composing Entropic Policies using Divergence Correction
5 December 2018
Jonathan J. Hunt
André Barreto
Timothy Lillicrap
N. Heess
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Composing Entropic Policies using Divergence Correction"
2 / 2 papers shown
Title
The Option Keyboard: Combining Skills in Reinforcement Learning
André Barreto
Diana Borsa
Shaobo Hou
Gheorghe Comanici
Eser Aygun
...
Daniel Toyama
Jonathan J. Hunt
Shibl Mourad
David Silver
Doina Precup
27
98
0
24 Jun 2021
Q-Learning in enormous action spaces via amortized approximate maximization
T. Wiele
David Warde-Farley
A. Mnih
Volodymyr Mnih
29
60
0
22 Jan 2020
1