ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.16714
  4. Cited By
Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model Alignment

Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model Alignment

22 October 2024
Mingzhi Wang
Chengdong Ma
Qizhi Chen
Linjian Meng
Yang Han
Jiancong Xiao
Zhaowei Zhang
Jing Huo
Weijie Su
Yaodong Yang
ArXivPDFHTML

Papers citing "Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model Alignment"

1 / 1 papers shown
Title
Restoring Calibration for Aligned Large Language Models: A Calibration-Aware Fine-Tuning Approach
Restoring Calibration for Aligned Large Language Models: A Calibration-Aware Fine-Tuning Approach
Jiancong Xiao
Bojian Hou
Zhanliang Wang
Ruochen Jin
Q. Long
Weijie Su
Li Shen
26
0
0
04 May 2025
1