Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.20370
Cited By
The Perfect Blend: Redefining RLHF with Mixture of Judges
30 September 2024
Tengyu Xu
Eryk Helenowski
Karthik Abinav Sankararaman
Di Jin
Kaiyan Peng
Eric Han
Shaoliang Nie
Chen Zhu
Hejia Zhang
Wenxuan Zhou
Z. Zeng
Yun He
Karishma Mandyam
Arya Talabzadeh
Madian Khabsa
Gabriel Cohen
Yuandong Tian
Hao Ma
Sinong Wang
Han Fang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Perfect Blend: Redefining RLHF with Mixture of Judges"
2 / 2 papers shown
Title
Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization
Zishun Yu
Tengyu Xu
Di Jin
Karthik Abinav Sankararaman
Yun He
...
Eryk Helenowski
Chen Zhu
Sinong Wang
Hao Ma
Han Fang
LRM
49
4
0
29 Jan 2025
Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks
Rushang Karia
Daniel Bramblett
D. Dobhal
Siddharth Srivastava
ELM
LRM
23
0
0
11 Oct 2024
1