
Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment
Papers citing "Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment"
13 / 13 papers shown
Title |
|---|
![]() Rethinking Diverse Human Preference Learning through Principal Component AnalysisAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() LiPO: Listwise Preference Optimization through Learning-to-RankNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024 |













