ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.08331
  4. Cited By
Diverse Exploration for Fast and Safe Policy Improvement

Diverse Exploration for Fast and Safe Policy Improvement

22 February 2018
Andrew Cohen
Lei Yu
Robert Wright
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Diverse Exploration for Fast and Safe Policy Improvement"

4 / 4 papers shown
Learning Transferable Domain Priors for Safe Exploration in
  Reinforcement Learning
Learning Transferable Domain Priors for Safe Exploration in Reinforcement LearningIEEE International Joint Conference on Neural Network (IJCNN), 2019
Thommen George Karimpanal
Santu Rana
Sunil R. Gupta
T. Tran
Svetha Venkatesh
OffRLOnRL
286
11
0
10 Sep 2019
Diversity-Inducing Policy Gradient: Using Maximum Mean Discrepancy to
  Find a Set of Diverse Policies
Diversity-Inducing Policy Gradient: Using Maximum Mean Discrepancy to Find a Set of Diverse PoliciesInternational Joint Conference on Artificial Intelligence (IJCAI), 2019
M. A. Masood
Finale Doshi-Velez
197
57
0
31 May 2019
Smoothing Policies and Safe Policy Gradients
Smoothing Policies and Safe Policy GradientsMachine-mediated learning (ML), 2019
Matteo Papini
Matteo Pirotta
Marcello Restelli
319
36
0
08 May 2019
Diverse Exploration via Conjugate Policies for Policy Gradient Methods
Diverse Exploration via Conjugate Policies for Policy Gradient MethodsAAAI Conference on Artificial Intelligence (AAAI), 2019
Andrew Cohen
Xingye Qiao
Lei Yu
E. Way
Xiangrong Tong
137
9
0
10 Feb 2019
1
Page 1 of 1