ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.03690
  4. Cited By
Robust Preference Optimization via Dynamic Target Margins
v1v2 (latest)

Robust Preference Optimization via Dynamic Target Margins

Annual Meeting of the Association for Computational Linguistics (ACL), 2025
4 June 2025
Jie Sun
Junkang Wu
Jiancan Wu
Zhibo Zhu
Xingyu Lu
Jun Zhou
Lintao Ma
Xiang Wang
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)Github (5★)

Papers citing "Robust Preference Optimization via Dynamic Target Margins"

3 / 3 papers shown
Failure Modes of Maximum Entropy RLHF
Failure Modes of Maximum Entropy RLHF
Ömer Veysel Çağatan
Barış Akgün
120
0
0
24 Sep 2025
LIMI: Less is More for Agency
LIMI: Less is More for Agency
Yang Xiao
Mohan Jiang
Jie Sun
Keyu Li
Jifan Lin
...
Y. Cheng
Wenjie Li
Xiang Wang
Dequan Wang
Pengfei Liu
VLM
215
5
0
22 Sep 2025
Dual Caption Preference Optimization for Diffusion Models
Dual Caption Preference Optimization for Diffusion Models
Amir Saeidi
Yiran Luo
Agneet Chatterjee
Shamanthak Hegde
Bimsara Pathiraja
Yezhou Yang
Chitta Baral
DiffM
329
1
0
09 Feb 2025
1