Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.09603
Cited By
Towards Data-Centric RLHF: Simple Metrics for Preference Dataset Comparison
15 September 2024
Judy Hanwen Shen
Archit Sharma
Jun Qin
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Towards Data-Centric RLHF: Simple Metrics for Preference Dataset Comparison"
3 / 3 papers shown
Title
HelpSteer2-Preference: Complementing Ratings with Preferences
Zhilin Wang
Alexander Bukharin
Olivier Delalleau
Daniel Egert
Gerald Shen
Jiaqi Zeng
Oleksii Kuchaiev
Yi Dong
ALM
42
37
0
02 Oct 2024
LESS: Selecting Influential Data for Targeted Instruction Tuning
Mengzhou Xia
Sadhika Malladi
Suchin Gururangan
Sanjeev Arora
Danqi Chen
77
180
0
06 Feb 2024
OLMo: Accelerating the Science of Language Models
Dirk Groeneveld
Iz Beltagy
Pete Walsh
Akshita Bhagia
Rodney Michael Kinney
...
Jesse Dodge
Kyle Lo
Luca Soldaini
Noah A. Smith
Hanna Hajishirzi
OSLM
130
349
0
01 Feb 2024
1