ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2007.03574
  4. Cited By
Provably Safe PAC-MDP Exploration Using Analogies
v1v2 (latest)

Provably Safe PAC-MDP Exploration Using Analogies

7 July 2020
Melrose Roderick
Vaishnavh Nagarajan
J. Zico Kolter
ArXiv (abs)PDFHTML

Papers citing "Provably Safe PAC-MDP Exploration Using Analogies"

7 / 7 papers shown
Deterministic Policies for Constrained Reinforcement Learning in
  Polynomial-Time
Deterministic Policies for Constrained Reinforcement Learning in Polynomial-TimeNeural Information Processing Systems (NeurIPS), 2024
Jeremy McMahan
283
3
0
23 May 2024
Long-term Safe Reinforcement Learning with Binary Feedback
Long-term Safe Reinforcement Learning with Binary FeedbackAAAI Conference on Artificial Intelligence (AAAI), 2024
Akifumi Wachi
Wataru Hashimoto
Kazumune Hashimoto
OffRL
415
6
0
08 Jan 2024
Anytime-Constrained Reinforcement Learning
Anytime-Constrained Reinforcement LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Jeremy McMahan
Xiaojin Zhu
347
9
0
09 Nov 2023
Safe Sequential Optimization for Switching Environments
Safe Sequential Optimization for Switching Environments
Durgesh Kalwar
S. VineethB.
253
0
0
03 Nov 2023
Provable Safe Reinforcement Learning with Binary Feedback
Provable Safe Reinforcement Learning with Binary FeedbackInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Andrew Bennett
Dipendra Kumar Misra
Nathan Kallus
OffRL
326
8
0
26 Oct 2022
Safe Reinforcement Learning by Imagining the Near Future
Safe Reinforcement Learning by Imagining the Near FutureNeural Information Processing Systems (NeurIPS), 2022
G. Thomas
Yuping Luo
Tengyu Ma
OffRL
242
107
0
15 Feb 2022
Learning Barrier Certificates: Towards Safe Reinforcement Learning with
  Zero Training-time Violations
Learning Barrier Certificates: Towards Safe Reinforcement Learning with Zero Training-time Violations
Yuping Luo
Tengyu Ma
OffRL
326
50
0
04 Aug 2021
1
Page 1 of 1