ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.19933
  4. Cited By
Enhancing Safety in Reinforcement Learning with Human Feedback via Rectified Policy Optimization

Enhancing Safety in Reinforcement Learning with Human Feedback via Rectified Policy Optimization

25 October 2024
Xiyue Peng
Hengquan Guo
Jiawei Zhang
Dongqing Zou
Ziyu Shao
Honghao Wei
Xin Liu
ArXivPDFHTML

Papers citing "Enhancing Safety in Reinforcement Learning with Human Feedback via Rectified Policy Optimization"

Title
No papers