Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2001.08116
Cited By
Q-Learning in enormous action spaces via amortized approximate maximization
22 January 2020
T. Wiele
David Warde-Farley
A. Mnih
Volodymyr Mnih
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Q-Learning in enormous action spaces via amortized approximate maximization"
16 / 16 papers shown
Title
A comparison of RL-based and PID controllers for 6-DOF swimming robots: hybrid underwater object tracking
F. Lotfi
K. Virji
Nicholas Dudek
Gregory Dudek
27
0
0
29 Jan 2024
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
78
5
0
13 Dec 2023
Online Network Source Optimization with Graph-Kernel MAB
Laura Toni
P. Frossard
24
1
0
07 Jul 2023
Leveraging Factored Action Spaces for Efficient Offline Reinforcement Learning in Healthcare
Shengpu Tang
Maggie Makar
Michael Sjoding
Finale Doshi-Velez
Jenna Wiens
OffRL
50
37
0
02 May 2023
Solving Continuous Control via Q-learning
Tim Seyde
Peter Werner
Wilko Schwarting
Igor Gilitschenski
Martin Riedmiller
Daniela Rus
Markus Wulfmeier
OffRL
LRM
35
22
0
22 Oct 2022
MAN: Multi-Action Networks Learning
Keqin Wang
Alison Bartsch
A. Farimani
16
3
0
19 Sep 2022
Tutorial on amortized optimization
Brandon Amos
OffRL
75
43
0
01 Feb 2022
Towards Autonomous Satellite Communications: An AI-based Framework to Address System-level Challenges
J. Luis
Skylar Eiskowitz
Nils Pachler de la Osa
E. Crawley
B. Cameron
13
5
0
11 Dec 2021
NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL
Khaled Nakhleh
Santosh Ganji
Ping-Chun Hsieh
I.-Hong Hou
S. Shakkottai
61
37
0
05 Oct 2021
Deep hierarchical reinforcement agents for automated penetration testing
Khuong Tran
Ashlesha Akella
Maxwell Standen
Junae Kim
David Bowman
Toby J. Richer
Chin-Teng Lin Institution One
38
38
0
14 Sep 2021
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research
J. Luis
E. Crawley
B. Cameron
OffRL
25
6
0
07 Jul 2021
Reversible Action Design for Combinatorial Optimization with Reinforcement Learning
Fan Yao
Renqin Cai
Hongning Wang
27
11
0
14 Feb 2021
Learning Dexterous Manipulation from Suboptimal Experts
Rae Jeong
Jost Tobias Springenberg
Jackie Kay
Daniel Zheng
Yuxiang Zhou
Alexandre Galashov
N. Heess
F. Nori
OffRL
10
36
0
16 Oct 2020
Monte-Carlo Tree Search as Regularized Policy Optimization
Jean-Bastien Grill
Florent Altché
Yunhao Tang
Thomas Hubert
Michal Valko
Ioannis Antonoglou
Rémi Munos
19
73
0
24 Jul 2020
Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning
Tianren Zhang
Shangqi Guo
Tian Tan
Xiaolin Hu
Feng Chen
22
80
0
20 Jun 2020
Growing Action Spaces
Gregory Farquhar
Laura Gustafson
Zeming Lin
Shimon Whiteson
Nicolas Usunier
Gabriel Synnaeve
14
37
0
28 Jun 2019
1