Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2401.10580
Cited By
PHOENIX: Open-Source Language Adaption for Direct Preference Optimization
19 January 2024
Matthias Uhlig
Sigurd Schacht
Sudarshan Kamath Barkur
ALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PHOENIX: Open-Source Language Adaption for Direct Preference Optimization"
3 / 3 papers shown
Title
Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned
Deep Ganguli
Liane Lovitt
John Kernion
Amanda Askell
Yuntao Bai
...
Nicholas Joseph
Sam McCandlish
C. Olah
Jared Kaplan
Jack Clark
218
441
0
23 Aug 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
Fine-Tuning Language Models from Human Preferences
Daniel M. Ziegler
Nisan Stiennon
Jeff Wu
Tom B. Brown
Alec Radford
Dario Amodei
Paul Christiano
G. Irving
ALM
275
1,561
0
18 Sep 2019
1