ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.09764
  4. Cited By
Aligning Crowd Feedback via Distributional Preference Reward Modeling
v1v2v3 (latest)

Aligning Crowd Feedback via Distributional Preference Reward Modeling

15 February 2024
Dexun Li
Cong Zhang
Kuicai Dong
Derrick-Goh-Xin Deik
Ruiming Tang
Yong Liu
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)Github

Papers citing "Aligning Crowd Feedback via Distributional Preference Reward Modeling"

8 / 8 papers shown
Pluralistic Off-policy Evaluation and Alignment
Pluralistic Off-policy Evaluation and Alignment
Chengkai Huang
Junda Wu
Zhouhang Xie
Yu Xia
Rui Wang
Tong Yu
Subrata Mitra
Julian McAuley
L. Yao
OffRL
221
4
0
15 Sep 2025
A Comprehensive Survey of Reward Models: Taxonomy, Applications, Challenges, and Future
A Comprehensive Survey of Reward Models: Taxonomy, Applications, Challenges, and Future
Jialun Zhong
Wei Shen
Yanzeng Li
Songyang Gao
Hua Lu
Yicheng Chen
Yang Zhang
Wei Zhou
Jinjie Gu
Lei Zou
LRM
443
42
0
12 Apr 2025
Improving LLM-as-a-Judge Inference with the Judgment Distribution
Improving LLM-as-a-Judge Inference with the Judgment Distribution
Victor Wang
Michael J.Q. Zhang
Eunsol Choi
598
26
0
04 Mar 2025
LIVS: A Pluralistic Alignment Dataset for Inclusive Public Spaces
LIVS: A Pluralistic Alignment Dataset for Inclusive Public Spaces
Rashid Mushkani
Shravan Nayak
Hugo Berard
Allison Cohen
Shin Koseki
Hadrien Bertrand
448
11
0
27 Feb 2025
Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment
Disentangling Preference Representation and Text Generation for Efficient Individual Preference AlignmentInternational Conference on Computational Linguistics (COLING), 2024
Jianfei Zhang
Jun Bai
Yangqiu Song
Yanmeng Wang
Rumei Li
Chenghua Lin
Wenge Rong
397
5
0
31 Dec 2024
Geometric-Averaged Preference Optimization for Soft Preference Labels
Geometric-Averaged Preference Optimization for Soft Preference LabelsNeural Information Processing Systems (NeurIPS), 2024
Hiroki Furuta
Kuang-Huei Lee
Shixiang Shane Gu
Y. Matsuo
Aleksandra Faust
Heiga Zen
Izzeddin Gur
483
17
0
31 Dec 2024
Alignment of Diffusion Models: Fundamentals, Challenges, and Future
Alignment of Diffusion Models: Fundamentals, Challenges, and Future
Buhua Liu
Shitong Shao
Bao Li
Lichen Bai
Zhiqiang Xu
Haoyi Xiong
James Kwok
Sumi Helal
Bo Han
623
29
0
11 Sep 2024
Direct Preference Optimization With Unobserved Preference Heterogeneity: The Necessity of Ternary Preferences
Direct Preference Optimization With Unobserved Preference Heterogeneity: The Necessity of Ternary Preferences
Keertana Chidambaram
Karthik Vinay Seetharaman
Vasilis Syrgkanis
477
11
0
23 May 2024
1
Page 1 of 1