Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.00539
Cited By
Distributionally Robust Reinforcement Learning with Human Feedback
1 March 2025
Debmalya Mandal
Paulius Sasnauskas
Goran Radanović
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Distributionally Robust Reinforcement Learning with Human Feedback"
1 / 1 papers shown
Title
Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning
Kai Ye
Hongyi Zhou
Jin Zhu
Francesco Quinzan
C. Shi
16
0
0
03 Apr 2025
1