Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.07461
Cited By
Shallow Updates for Deep Reinforcement Learning
21 May 2017
Nir Levine
Tom Zahavy
D. Mankowitz
Aviv Tamar
Shie Mannor
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Shallow Updates for Deep Reinforcement Learning"
15 / 15 papers shown
Title
Spreeze: High-Throughput Parallel Reinforcement Learning Framework
Jing Hou
Guang Chen
Ruiqi Zhang
Zhijun Li
Shangding Gu
Changjun Jiang
OffRL
32
2
0
11 Dec 2023
Towards a Better Understanding of Representation Dynamics under TD-learning
Yunhao Tang
Rémi Munos
OffRL
31
2
0
29 May 2023
Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size
Alexander Nikulin
Vladislav Kurenkov
Denis Tarasov
Dmitry Akimov
Sergey Kolesnikov
OffRL
36
14
0
20 Nov 2022
Meta-Learning-Based Robust Adaptive Flight Control Under Uncertain Wind Conditions
Michael O'Connell
Guanya Shi
Xichen Shi
Soon-Jo Chung
31
22
0
02 Mar 2021
Online Limited Memory Neural-Linear Bandits with Likelihood Matching
Ofir Nabati
Tom Zahavy
Shie Mannor
27
18
0
07 Feb 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Min Zhang
OffRL
60
73
0
01 Jan 2021
Randomized Policy Learning for Continuous State and Action MDPs
Hiteshi Sharma
Rahul Jain
21
1
0
08 Jun 2020
A Comprehensive Overview and Survey of Recent Advances in Meta-Learning
Huimin Peng
VLM
OffRL
24
35
0
17 Apr 2020
Distributional Robustness and Regularization in Reinforcement Learning
E. Derman
Shie Mannor
27
44
0
05 Mar 2020
How Transferable are the Representations Learned by Deep Q Agents?
Jacob Tyo
Zachary Chase Lipton
OffRL
9
6
0
24 Feb 2020
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
43
188
0
23 Dec 2019
Deep Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching
Tom Zahavy
Shie Mannor
HAI
36
30
0
24 Jan 2019
Trust Region Value Optimization using Kalman Filtering
Shirli Di-Castro Shashua
Shie Mannor
19
7
0
23 Jan 2019
A Theoretical Analysis of Deep Q-Learning
Jianqing Fan
Zhuoran Yang
Yuchen Xie
Zhaoran Wang
28
596
0
01 Jan 2019
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
310
2,896
0
15 Sep 2016
1