Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.15843
Cited By
Pre-DPO: Improving Data Utilization in Direct Preference Optimization Using a Guiding Reference Model
22 April 2025
Junshu Pan
Wei Shen
Shulin Huang
Qiji Zhou
Yue Zhang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Pre-DPO: Improving Data Utilization in Direct Preference Optimization Using a Guiding Reference Model"
Title
No papers