Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2404.10719
Cited By
Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
16 April 2024
Shusheng Xu
Wei Fu
Jiaxuan Gao
Wenjie Ye
Weiling Liu
Zhiyu Mei
Guangju Wang
Chao Yu
Yi Wu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study"
2 / 102 papers shown
Title
Fine-Tuning Language Models from Human Preferences
Daniel M. Ziegler
Nisan Stiennon
Jeff Wu
Tom B. Brown
Alec Radford
Dario Amodei
Paul Christiano
G. Irving
ALM
273
1,561
0
18 Sep 2019
The Woman Worked as a Babysitter: On Biases in Language Generation
Emily Sheng
Kai-Wei Chang
Premkumar Natarajan
Nanyun Peng
204
607
0
03 Sep 2019
Previous
1
2
3