ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2409.17827
  4. Cited By
BeanCounter: A low-toxicity, large-scale, and open dataset of
  business-oriented text

BeanCounter: A low-toxicity, large-scale, and open dataset of business-oriented text

26 September 2024
Siyan Wang
Bradford Levy
ArXivPDFHTML

Papers citing "BeanCounter: A low-toxicity, large-scale, and open dataset of business-oriented text"

1 / 1 papers shown
Title
Enhancing Safety in Reinforcement Learning with Human Feedback via Rectified Policy Optimization
Enhancing Safety in Reinforcement Learning with Human Feedback via Rectified Policy Optimization
Xiyue Peng
Hengquan Guo
Jiawei Zhang
Dongqing Zou
Ziyu Shao
Honghao Wei
Xin Liu
32
0
0
25 Oct 2024
1