ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2409.20370
  4. Cited By
The Perfect Blend: Redefining RLHF with Mixture of Judges

The Perfect Blend: Redefining RLHF with Mixture of Judges

30 September 2024
Tengyu Xu
Eryk Helenowski
Karthik Abinav Sankararaman
Di Jin
Kaiyan Peng
Eric Han
Shaoliang Nie
Chen Zhu
Hejia Zhang
Wenxuan Zhou
Z. Zeng
Yun He
Karishma Mandyam
Arya Talabzadeh
Madian Khabsa
Gabriel Cohen
Yuandong Tian
Hao Ma
Sinong Wang
Han Fang
ArXivPDFHTML

Papers citing "The Perfect Blend: Redefining RLHF with Mixture of Judges"

2 / 2 papers shown
Title
Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization
Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization
Zishun Yu
Tengyu Xu
Di Jin
Karthik Abinav Sankararaman
Yun He
...
Eryk Helenowski
Chen Zhu
Sinong Wang
Hao Ma
Han Fang
LRM
49
4
0
29 Jan 2025
Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks
Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks
Rushang Karia
Daniel Bramblett
D. Dobhal
Siddharth Srivastava
ELM
LRM
23
0
0
11 Oct 2024
1