ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.03469
  4. Cited By
Rethinking the Role of Proxy Rewards in Language Model Alignment
v1v2 (latest)

Rethinking the Role of Proxy Rewards in Language Model Alignment

2 February 2024
Sungdong Kim
Minjoon Seo
    SyDaALM
ArXiv (abs)PDFHTMLGithub (2★)

Papers citing "Rethinking the Role of Proxy Rewards in Language Model Alignment"

5 / 5 papers shown
Textual Self-attention Network: Test-Time Preference Optimization through Textual Gradient-based Attention
Textual Self-attention Network: Test-Time Preference Optimization through Textual Gradient-based Attention
Shibing Mo
Haoyang Ruan
Kai Wu
Jing Liu
280
0
0
10 Nov 2025
CancerGUIDE: Cancer Guideline Understanding via Internal Disagreement Estimation
CancerGUIDE: Cancer Guideline Understanding via Internal Disagreement Estimation
Alyssa Unell
Noel C. F. Codella
Sam Preston
Peniel Argaw
Wen-wai Yim
...
Jiachen Li
Shrey Jain
Mu-Hsin Wei
M. Lungren
Hoifung Poon
AI4TS
314
0
0
09 Sep 2025
Towards Reward Fairness in RLHF: From a Resource Allocation Perspective
Towards Reward Fairness in RLHF: From a Resource Allocation PerspectiveAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Sheng Ouyang
Yulan Hu
Ge Chen
Qingyang Li
Fuzheng Zhang
Yong Liu
283
8
0
29 May 2025
MPO: Multilingual Safety Alignment via Reward Gap Optimization
MPO: Multilingual Safety Alignment via Reward Gap OptimizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Weixiang Zhao
Yulin Hu
Yang Deng
Tongtong Wu
Wenxuan Zhang
...
An Zhang
Yanyan Zhao
Bing Qin
Tat-Seng Chua
Ting Liu
393
11
0
22 May 2025
Yi: Open Foundation Models by 01.AI
Yi: Open Foundation Models by 01.AI
01. AI
Alex Young
01.AI Alex Young
Bei Chen
Chao Li
...
Yue Wang
Yuxuan Cai
Zhenyu Gu
Zhiyuan Liu
Zonghong Dai
OSLMLRM
1.1K
828
0
07 Mar 2024
1
Page 1 of 1