Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1901.00117
Cited By
An Active Learning Framework for Efficient Robust Policy Search
1 January 2019
Sai Kiran Narayanaswami
N. Sudarsanam
Balaraman Ravindran
Re-assign community
ArXiv
PDF
HTML
Papers citing
"An Active Learning Framework for Efficient Robust Policy Search"
21 / 21 papers shown
Title
BayesSim: adaptive domain randomization via probabilistic inference for robotics simulators
F. Ramos
Rafael Possas
Dieter Fox
29
156
0
04 Jun 2019
Learning Robust Options by Conditional Value at Risk Optimization
Takuya Hiraoka
Takahisa Imagawa
Tatsuya Mori
Takashi Onishi
Yoshimasa Tsuruoka
21
27
0
22 May 2019
Bayesian Policy Optimization for Model Uncertainty
Gilwoo Lee
Brian Hou
Aditya Mandalika
Jeongseok Lee
Sanjiban Choudhury
S. Srinivasa
75
41
0
01 Oct 2018
Model-Ensemble Trust-Region Policy Optimization
Thanard Kurutach
I. Clavera
Yan Duan
Aviv Tamar
Pieter Abbeel
33
450
0
28 Feb 2018
Sim-to-Real Transfer of Robotic Control with Dynamics Randomization
Xue Bin Peng
Marcin Andrychowicz
Wojciech Zaremba
Pieter Abbeel
78
1,355
0
18 Oct 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
183
18,685
0
20 Jul 2017
Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World
Joshua Tobin
Rachel Fong
Alex Ray
Jonas Schneider
Wojciech Zaremba
Pieter Abbeel
113
2,948
0
20 Mar 2017
Robust Adversarial Reinforcement Learning
Lerrel Pinto
James Davidson
Rahul Sukthankar
Abhinav Gupta
OOD
67
848
0
08 Mar 2017
Preparing for the Unknown: Learning a Universal Policy with Online System Identification
Wenhao Yu
Jie Tan
Chenxi Liu
Greg Turk
OffRL
51
305
0
08 Feb 2017
Sample Efficient Actor-Critic with Experience Replay
Ziyun Wang
V. Bapst
N. Heess
Volodymyr Mnih
Rémi Munos
Koray Kavukcuoglu
Nando de Freitas
64
757
0
03 Nov 2016
EPOpt: Learning Robust Neural Network Policies Using Model Ensembles
Aravind Rajeswaran
Sarvjeet Ghotra
Balaraman Ravindran
Sergey Levine
75
349
0
05 Oct 2016
Progressive Neural Networks
Andrei A. Rusu
Neil C. Rabinowitz
Guillaume Desjardins
Hubert Soyer
J. Kirkpatrick
Koray Kavukcuoglu
Razvan Pascanu
R. Hadsell
CLL
AI4CE
33
2,428
0
15 Jun 2016
Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning
Emilio Parisotto
Jimmy Lei Ba
Ruslan Salakhutdinov
OffRL
54
594
0
19 Nov 2015
Policy Distillation
Andrei A. Rusu
Sergio Gomez Colmenarejo
Çağlar Gülçehre
Guillaume Desjardins
J. Kirkpatrick
Razvan Pascanu
Volodymyr Mnih
Koray Kavukcuoglu
R. Hadsell
37
685
0
19 Nov 2015
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
130
13,174
0
09 Sep 2015
Upper-Confidence-Bound Algorithms for Active Learning in Multi-Armed Bandits
Alexandra Carpentier
A. Lazaric
Mohammad Ghavamzadeh
Rémi Munos
P. Auer
András Antos
17
98
0
16 Jul 2015
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
29
3,368
0
08 Jun 2015
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
212
6,722
0
19 Feb 2015
Optimizing the CVaR via Sampling
Aviv Tamar
Yonatan Glassner
Shie Mannor
35
186
0
15 Apr 2014
Thompson Sampling for Contextual Bandits with Linear Payoffs
Shipra Agrawal
Navin Goyal
84
993
0
15 Sep 2012
A Contextual-Bandit Approach to Personalized News Article Recommendation
Lihong Li
Wei Chu
John Langford
Robert Schapire
184
2,935
0
28 Feb 2010
1