ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.02216
  4. Cited By
Composing Entropic Policies using Divergence Correction

Composing Entropic Policies using Divergence Correction

5 December 2018
Jonathan J. Hunt
André Barreto
Timothy Lillicrap
N. Heess
ArXivPDFHTML

Papers citing "Composing Entropic Policies using Divergence Correction"

2 / 2 papers shown
Title
The Option Keyboard: Combining Skills in Reinforcement Learning
The Option Keyboard: Combining Skills in Reinforcement Learning
André Barreto
Diana Borsa
Shaobo Hou
Gheorghe Comanici
Eser Aygun
...
Daniel Toyama
Jonathan J. Hunt
Shibl Mourad
David Silver
Doina Precup
27
98
0
24 Jun 2021
Q-Learning in enormous action spaces via amortized approximate
  maximization
Q-Learning in enormous action spaces via amortized approximate maximization
T. Wiele
David Warde-Farley
A. Mnih
Volodymyr Mnih
29
60
0
22 Jan 2020
1