ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.05328
  4. Cited By
Reward Learning From Preference With Ties

Reward Learning From Preference With Ties

5 October 2024
Jinsong Liu
Dongdong Ge
Ruihao Zhu
ArXiv (abs)PDFHTML

Papers citing "Reward Learning From Preference With Ties"

6 / 6 papers shown
Title
Consecutive Preferential Bayesian Optimization
Consecutive Preferential Bayesian Optimization
Aras Erarslan
Carlos Sevilla Salcedo
Ville Tanskanen
Anni Nisov
Eero Päiväkumpu
Heikki Aisala
Kaisu Honkapää
Arto Klami
Petrus Mikkola
54
0
0
07 Nov 2025
Identity-GRPO: Optimizing Multi-Human Identity-preserving Video Generation via Reinforcement Learning
Identity-GRPO: Optimizing Multi-Human Identity-preserving Video Generation via Reinforcement Learning
Xiangyu Meng
Zixian Zhang
Zhenghao Zhang
Junchao Liao
Long Qin
Weizhi Wang
VGen
119
1
0
16 Oct 2025
MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge
MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge
Guangchen Lan
Sipeng Zhang
Tianle Wang
Yuwei Zhang
Daoan Zhang
Xinpeng Wei
Xiaoman Pan
Hongming Zhang
Dong-Jun Han
Christopher G. Brinton
202
2
0
27 Jul 2025
Improving Video Generation with Human Feedback
Improving Video Generation with Human Feedback
Jie Liu
Gongye Liu
Jiajun Liang
Ziyang Yuan
Xiaokun Liu
...
Fei Yang
Pengfei Wan
Di Zhang
Kun Gai
Yujiu Yang
VGenEGVM
374
92
0
23 Jan 2025
A Statistical Framework for Ranking LLM-Based Chatbots
A Statistical Framework for Ranking LLM-Based ChatbotsInternational Conference on Learning Representations (ICLR), 2024
Siavash Ameli
Siyuan Zhuang
Ion Stoica
Michael W. Mahoney
ELM
175
5
0
24 Dec 2024
Reward Modeling with Ordinal Feedback: Wisdom of the Crowd
Reward Modeling with Ordinal Feedback: Wisdom of the Crowd
Shang Liu
Yu Pan
Guanting Chen
Xiaocheng Li
266
3
0
19 Nov 2024
1