Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.16714
Cited By
Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model Alignment
22 October 2024
Mingzhi Wang
Chengdong Ma
Qizhi Chen
Linjian Meng
Yang Han
Jiancong Xiao
Zhaowei Zhang
Jing Huo
Weijie Su
Yaodong Yang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model Alignment"
1 / 1 papers shown
Title
Restoring Calibration for Aligned Large Language Models: A Calibration-Aware Fine-Tuning Approach
Jiancong Xiao
Bojian Hou
Zhanliang Wang
Ruochen Jin
Q. Long
Weijie Su
Li Shen
26
0
0
04 May 2025
1