Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2402.09764
Cited By

Aligning Crowd Feedback via Distributional Preference Reward Modeling

v1v2v3 (latest)

Aligning Crowd Feedback via Distributional Preference Reward Modeling

15 February 2024

Dexun Li

Derrick-Goh-Xin Deik

Ruiming Tang

Yong Liu

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)Github

Papers citing "Aligning Crowd Feedback via Distributional Preference Reward Modeling"

8 / 8 papers shown

Pluralistic Off-policy Evaluation and Alignment

Pluralistic Off-policy Evaluation and Alignment

221

4

0

15 Sep 2025

A Comprehensive Survey of Reward Models: Taxonomy, Applications, Challenges, and Future

A Comprehensive Survey of Reward Models: Taxonomy, Applications, Challenges, and Future

443

42

0

12 Apr 2025

Improving LLM-as-a-Judge Inference with the Judgment Distribution

Improving LLM-as-a-Judge Inference with the Judgment Distribution

Michael J.Q. Zhang

598

26

0

04 Mar 2025

LIVS: A Pluralistic Alignment Dataset for Inclusive Public Spaces

LIVS: A Pluralistic Alignment Dataset for Inclusive Public Spaces

Rashid Mushkani

Hadrien Bertrand

448

11

0

27 Feb 2025

Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment

Disentangling Preference Representation and Text Generation for Efficient Individual Preference AlignmentInternational Conference on Computational Linguistics (COLING), 2024

397

5

0

31 Dec 2024

Geometric-Averaged Preference Optimization for Soft Preference Labels

Geometric-Averaged Preference Optimization for Soft Preference LabelsNeural Information Processing Systems (NeurIPS), 2024

Shixiang Shane Gu

Aleksandra Faust

483

17

0

31 Dec 2024

Alignment of Diffusion Models: Fundamentals, Challenges, and Future

Alignment of Diffusion Models: Fundamentals, Challenges, and Future

623

29

0

11 Sep 2024

Direct Preference Optimization With Unobserved Preference Heterogeneity: The Necessity of Ternary Preferences

Direct Preference Optimization With Unobserved Preference Heterogeneity: The Necessity of Ternary Preferences

Keertana Chidambaram

Karthik Vinay Seetharaman

Vasilis Syrgkanis

477

11

0

23 May 2024

Page 1 of 1