ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2412.03822
  4. Cited By
Beyond the Binary: Capturing Diverse Preferences With Reward
  Regularization

Beyond the Binary: Capturing Diverse Preferences With Reward Regularization

5 December 2024
Vishakh Padmakumar
Chuanyang Jin
Hannah Rose Kirk
He He
ArXiv (abs)PDFHTML

Papers citing "Beyond the Binary: Capturing Diverse Preferences With Reward Regularization"

5 / 5 papers shown
Learning to vary: Teaching LMs to reproduce human linguistic variability in next-word prediction
Learning to vary: Teaching LMs to reproduce human linguistic variability in next-word prediction
Tobias Groot
Salo Lacunes
Evgenia Ilia
180
0
0
22 Sep 2025
Towards Reward Fairness in RLHF: From a Resource Allocation Perspective
Towards Reward Fairness in RLHF: From a Resource Allocation PerspectiveAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Sheng Ouyang
Yulan Hu
Ge Chen
Qingyang Li
Fuzheng Zhang
Yong Liu
233
5
0
29 May 2025
Improving LLM-as-a-Judge Inference with the Judgment Distribution
Improving LLM-as-a-Judge Inference with the Judgment Distribution
Victor Wang
Michael J.Q. Zhang
Eunsol Choi
495
18
0
04 Mar 2025
When Personalization Meets Reality: A Multi-Faceted Analysis of Personalized Preference Learning
When Personalization Meets Reality: A Multi-Faceted Analysis of Personalized Preference Learning
Yijiang River Dong
Tiancheng Hu
Yinhong Liu
Ahmet Üstün
Nigel Collier
315
7
0
26 Feb 2025
Diverse Preference Optimization
Diverse Preference Optimization
Jack Lanchantin
Angelica Chen
Shehzaad Dhuliawala
Ping Yu
Jason Weston
Sainbayar Sukhbaatar
Ilia Kulikov
739
23
0
30 Jan 2025
1
Page 1 of 1