ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1610.01283
  4. Cited By
EPOpt: Learning Robust Neural Network Policies Using Model Ensembles
v1v2v3v4 (latest)

EPOpt: Learning Robust Neural Network Policies Using Model Ensembles

International Conference on Learning Representations (ICLR), 2016
5 October 2016
Aravind Rajeswaran
Sarvjeet Ghotra
Balaraman Ravindran
Sergey Levine
ArXiv (abs)PDFHTML

Papers citing "EPOpt: Learning Robust Neural Network Policies Using Model Ensembles"

50 / 230 papers shown
Risk-Averse Model Uncertainty for Distributionally Robust Safe
  Reinforcement Learning
Risk-Averse Model Uncertainty for Distributionally Robust Safe Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
James Queeney
M. Benosman
OODOffRL
284
13
0
30 Jan 2023
Train Hard, Fight Easy: Robust Meta Reinforcement Learning
Train Hard, Fight Easy: Robust Meta Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Ido Greenberg
Shie Mannor
Gal Chechik
E. Meirom
OffRLOOD
267
10
0
26 Jan 2023
DIRECT: Learning from Sparse and Shifting Rewards using Discriminative
  Reward Co-Training
DIRECT: Learning from Sparse and Shifting Rewards using Discriminative Reward Co-Training
Philipp Altmann
Thomy Phan
Fabian Ritz
Thomas Gabor
Claudia Linnhoff-Popien
OffRL
154
1
0
18 Jan 2023
Robust Average-Reward Markov Decision Processes
Robust Average-Reward Markov Decision ProcessesAAAI Conference on Artificial Intelligence (AAAI), 2023
Yue Wang
Alvaro Velasquez
George Atia
Ashley Prater-Bennette
Shaofeng Zou
248
20
0
02 Jan 2023
Risk-Sensitive Policy with Distributional Reinforcement Learning
Risk-Sensitive Policy with Distributional Reinforcement Learning
Thibaut Théate
D. Ernst
OffRL
287
10
0
30 Dec 2022
Understanding the Complexity Gains of Single-Task RL with a Curriculum
Understanding the Complexity Gains of Single-Task RL with a CurriculumInternational Conference on Machine Learning (ICML), 2022
Qiyang Li
Yuexiang Zhai
Yi-An Ma
Sergey Levine
388
18
0
24 Dec 2022
Security of Deep Reinforcement Learning for Autonomous Driving: A Survey
Security of Deep Reinforcement Learning for Autonomous Driving: A Survey
Ambra Demontis
Srishti Gupta
Christian Scano
Luca Demetrio
Kathrin Grosse
Hsiao-Ying Lin
Chengfang Fang
Battista Biggio
Fabio Roli
AAML
365
4
0
12 Dec 2022
One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based
  Offline Reinforcement Learning
One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Marc Rigter
Bruno Lacerda
Nick Hawes
OffRL
416
10
0
30 Nov 2022
Domain Generalization for Robust Model-Based Offline Reinforcement
  Learning
Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Alan Clark
Shoaib Ahmed Siddiqui
Robert Kirk
Usman Anwar
Stephen Chung
David M. Krueger
OODOffRL
185
0
0
27 Nov 2022
Build generally reusable agent-environment interaction models
Build generally reusable agent-environment interaction models
Jun Jin
Hongming Zhang
Jun Luo
133
0
0
13 Nov 2022
ART/ATK: A research platform for assessing and mitigating the
  sim-to-real gap in robotics and autonomous vehicle engineering
ART/ATK: A research platform for assessing and mitigating the sim-to-real gap in robotics and autonomous vehicle engineering
A. Elmquist
Aaron Young
Thomas Hansen
Sriram Ashokkumar
Stefan Caldararu
...
Luning Fang
Henghua Shen
Xiangru Xu
R. Serban
Dan Negrut
237
22
0
09 Nov 2022
Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness
  to Model Misspecification
Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model MisspecificationNeural Information Processing Systems (NeurIPS), 2022
Takumi Tanabe
Reimi Sato
Kazuto Fukuchi
Jun Sakuma
Youhei Akimoto
OffRL
271
15
0
07 Nov 2022
On the Power of Pre-training for Generalization in RL: Provable Benefits
  and Hardness
On the Power of Pre-training for Generalization in RL: Provable Benefits and HardnessInternational Conference on Machine Learning (ICML), 2022
Haotian Ye
Xiaoyu Chen
Liwei Wang
S. Du
OffRL
273
7
0
19 Oct 2022
Open-Ended Diverse Solution Discovery with Regulated Behavior Patterns
  for Cross-Domain Adaptation
Open-Ended Diverse Solution Discovery with Regulated Behavior Patterns for Cross-Domain AdaptationAAAI Conference on Artificial Intelligence (AAAI), 2022
Kang Xu
Yan Ma
Bingsheng Wei
Wei Li
272
3
0
24 Sep 2022
Quantification before Selection: Active Dynamics Preference for Robust
  Reinforcement Learning
Quantification before Selection: Active Dynamics Preference for Robust Reinforcement Learning
Kang Xu
Yan Ma
Wei Li
337
0
0
23 Sep 2022
Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities:
  Robustness, Safety, and Generalizability
Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities: Robustness, Safety, and Generalizability
Mengdi Xu
Zuxin Liu
Peide Huang
Wenhao Ding
Zhepeng Cen
Yue Liu
Ding Zhao
396
51
0
16 Sep 2022
Robust Constrained Reinforcement Learning
Robust Constrained Reinforcement Learning
Yue Wang
Fei Miao
Shaofeng Zou
171
21
0
14 Sep 2022
Distributionally Robust Offline Reinforcement Learning with Linear
  Function Approximation
Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation
Xiaoteng Ma
Zhipeng Liang
Jose H. Blanchet
MingWen Liu
Li Xia
Jiheng Zhang
Qianchuan Zhao
Zhengyuan Zhou
OODOffRL
328
34
0
14 Sep 2022
A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free
  Reinforcement Learning
A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free Reinforcement Learning
Laura M. Smith
Ilya Kostrikov
Sergey Levine
OffRL
146
122
0
16 Aug 2022
Online vs. Offline Adaptive Domain Randomization Benchmark
Online vs. Offline Adaptive Domain Randomization BenchmarkInternational Workshop on Human Friendly Robotics (HFR), 2022
Gabriele Tiboni
Karol Arndt
Giuseppe Averta
Ville Kyrki
Tatiana Tommasi
OffRL
151
6
0
29 Jun 2022
When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning
When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Haoyi Niu
Sanjay Kariyappa
Yiwen Qiu
Ming Li
Guyue Zhou
Jianming Hu
Xianyuan Zhan
OffRLOnRL
593
69
0
27 Jun 2022
Defending Observation Attacks in Deep Reinforcement Learning via
  Detection and Denoising
Defending Observation Attacks in Deep Reinforcement Learning via Detection and Denoising
Zikang Xiong
Joe Eappen
He Zhu
Suresh Jagannathan
AAML
121
13
0
14 Jun 2022
A software toolkit and hardware platform for investigating and comparing
  robot autonomy algorithms in simulation and reality
A software toolkit and hardware platform for investigating and comparing robot autonomy algorithms in simulation and reality
A. Elmquist
Aaron Young
Ishaan Mahajan
Kyle Fahey
Abhiraj Dashora
...
Stefan Caldararu
Victor Freire
Xiangru Xu
R. Serban
Dan Negrut
118
16
0
14 Jun 2022
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
RORL: Robust Offline Reinforcement Learning via Conservative SmoothingNeural Information Processing Systems (NeurIPS), 2022
Rui Yang
Chenjia Bai
Xiaoteng Ma
Zhaoran Wang
Chongjie Zhang
Lei Han
OffRL
424
102
0
06 Jun 2022
Provably Sample-Efficient RL with Side Information about Latent Dynamics
Provably Sample-Efficient RL with Side Information about Latent DynamicsNeural Information Processing Systems (NeurIPS), 2022
Yao Liu
Dipendra Kumar Misra
Miroslav Dudík
Robert Schapire
127
2
0
27 May 2022
Data Valuation for Offline Reinforcement Learning
Data Valuation for Offline Reinforcement Learning
Amir Abolfazli
Gregory Palmer
D. Kudenko
OffRL
109
0
0
19 May 2022
Policy Gradient Method For Robust Reinforcement Learning
Policy Gradient Method For Robust Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Yue Wang
Shaofeng Zou
265
93
0
15 May 2022
Efficient Risk-Averse Reinforcement Learning
Efficient Risk-Averse Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Ido Greenberg
Yinlam Chow
Mohammad Ghavamzadeh
Shie Mannor
276
51
0
10 May 2022
Control-oriented meta-learning
Control-oriented meta-learning
Spencer M. Richards
Navid Azizan
Jean-Jacques E. Slotine
Marco Pavone
173
34
0
14 Apr 2022
Gradient-Based Trajectory Optimization With Learned Dynamics
Gradient-Based Trajectory Optimization With Learned DynamicsIEEE International Conference on Robotics and Automation (ICRA), 2022
Bhavya Sukhija
Nathanael Kohler
Miguel Zamora
Simon Zimmermann
Sebastian Curi
Andreas Krause
Stelian Coros
286
14
0
09 Apr 2022
Context is Everything: Implicit Identification for Dynamics Adaptation
Context is Everything: Implicit Identification for Dynamics AdaptationIEEE International Conference on Robotics and Automation (ICRA), 2022
Ben Evans
Abitha Thankaraj
Lerrel Pinto
136
29
0
10 Mar 2022
BADDr: Bayes-Adaptive Deep Dropout RL for POMDPs
BADDr: Bayes-Adaptive Deep Dropout RL for POMDPsAdaptive Agents and Multi-Agent Systems (AAMAS), 2022
Sammie Katt
Hai V. Nguyen
F. Oliehoek
Chris Amato
BDLOffRL
92
2
0
17 Feb 2022
User-Oriented Robust Reinforcement Learning
User-Oriented Robust Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2022
Haoyi You
Beichen Yu
Haiming Jin
Zhaoxing Yang
Jiahui Sun
OffRL
344
1
0
15 Feb 2022
Robust Policy Learning over Multiple Uncertainty Sets
Robust Policy Learning over Multiple Uncertainty SetsInternational Conference on Machine Learning (ICML), 2022
Annie Xie
Shagun Sodhani
Chelsea Finn
Joelle Pineau
Amy Zhang
OODOffRL
315
24
0
14 Feb 2022
Robust Learning from Observation with Model Misspecification
Robust Learning from Observation with Model MisspecificationAdaptive Agents and Multi-Agent Systems (AAMAS), 2022
Luca Viano
Yu-ting Huang
Parameswaran Kamalaruban
Craig Innes
S. Ramamoorthy
Adrian Weller
OOD
184
11
0
12 Feb 2022
Uncertainty Aware System Identification with Universal Policies
Uncertainty Aware System Identification with Universal PoliciesInternational Conference on Pattern Recognition (ICPR), 2022
B. L. Semage
Thommen George Karimpanal
Santu Rana
Svetha Venkatesh
293
3
0
11 Feb 2022
DROPO: Sim-to-Real Transfer with Offline Domain Randomization
DROPO: Sim-to-Real Transfer with Offline Domain Randomization
Gabriele Tiboni
Karol Arndt
Ville Kyrki
182
36
0
20 Jan 2022
Learning Robust Policy against Disturbance in Transition Dynamics via
  State-Conservative Policy Optimization
Learning Robust Policy against Disturbance in Transition Dynamics via State-Conservative Policy OptimizationAAAI Conference on Artificial Intelligence (AAAI), 2021
Yufei Kuang
Miao Lu
Jie Wang
Qi Zhou
Bin Li
Houqiang Li
128
26
0
20 Dec 2021
Sample-Efficient Reinforcement Learning via Conservative Model-Based
  Actor-Critic
Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic
Zhihai Wang
Jie Wang
Qi Zhou
Bin Li
Houqiang Li
192
37
0
16 Dec 2021
Unsupervised Reinforcement Learning in Multiple Environments
Unsupervised Reinforcement Learning in Multiple Environments
Mirco Mutti
Mattia Mancassola
Marcello Restelli
OffRL
155
27
0
16 Dec 2021
Calibrated and Sharp Uncertainties in Deep Learning via Density Estimation
Calibrated and Sharp Uncertainties in Deep Learning via Density Estimation
Volodymyr Kuleshov
Shachi Deshpande
UQCVBDL
749
40
0
14 Dec 2021
On Effective Scheduling of Model-based Reinforcement Learning
On Effective Scheduling of Model-based Reinforcement Learning
Hang Lai
Jian Shen
Weinan Zhang
Yimin Huang
Xingzhi Zhang
Ruiming Tang
Yong Yu
Zhenguo Li
241
22
0
16 Nov 2021
Robot Learning from Randomized Simulations: A Review
Robot Learning from Randomized Simulations: A ReviewFrontiers in Robotics and AI (Front. Robot. AI), 2021
Fabio Muratore
Fabio Ramos
Greg Turk
Wenhao Yu
Michael Gienger
Jan Peters
AI4CE
316
111
0
01 Nov 2021
Block Contextual MDPs for Continual Learning
Block Contextual MDPs for Continual Learning
Shagun Sodhani
Franziska Meier
Joelle Pineau
Amy Zhang
CLL
260
33
0
13 Oct 2021
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPsInternational Conference on Machine Learning (ICML), 2021
Tianwei Ni
Benjamin Eysenbach
Ruslan Salakhutdinov
414
141
0
11 Oct 2021
Neural Network Verification in Control
Neural Network Verification in Control
M. Everett
AAML
155
19
0
30 Sep 2021
Online Robust Reinforcement Learning with Model Uncertainty
Online Robust Reinforcement Learning with Model Uncertainty
Yue Wang
Shaofeng Zou
OODOffRL
381
134
0
29 Sep 2021
Robust Model-based Reinforcement Learning for Autonomous Greenhouse
  Control
Robust Model-based Reinforcement Learning for Autonomous Greenhouse ControlAsian Conference on Machine Learning (ACML), 2021
Wanpeng Zhang
Xiaoyan Cao
Yaowen Yao
Zhicheng An
Xi Xiao
Dijun Luo
OffRL
190
23
0
26 Aug 2021
Safe Learning in Robotics: From Learning-Based Control to Safe
  Reinforcement Learning
Safe Learning in Robotics: From Learning-Based Control to Safe Reinforcement Learning
Lukas Brunke
Melissa Greeff
Adam W. Hall
Zhaocong Yuan
Siqi Zhou
Jacopo Panerati
Angela P. Schoellig
OffRL
270
783
0
13 Aug 2021
MBDP: A Model-based Approach to Achieve both Robustness and Sample
  Efficiency via Double Dropout Planning
MBDP: A Model-based Approach to Achieve both Robustness and Sample Efficiency via Double Dropout Planning
Wanpeng Zhang
Xi Xiao
Yaowen Yao
Mingzhe Chen
Dijun Luo
OffRL
163
1
0
03 Aug 2021
Previous
12345
Next