Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.08458
Cited By
Simultaneous Reward Distillation and Preference Learning: Get You a Language Model Who Can Do Both
11 October 2024
Abhijnan Nath
Changsoo Jung
Ethan Seefried
Nikhil Krishnaswamy
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Simultaneous Reward Distillation and Preference Learning: Get You a Language Model Who Can Do Both"
Title
No papers