
Title |
|---|
![]() Debiasing Online Preference Learning via Preference Feature PreservationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() APT: Improving Specialist LLM Performance with Weakness Case Acquisition and Iterative Preference TrainingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() Robust Preference Optimization via Dynamic Target MarginsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() Probability-Consistent Preference Optimization for Enhanced LLM ReasoningAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() Frictional Agent Alignment Framework: Slow Down and Don't Break ThingsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() MPO: Multilingual Safety Alignment via Reward Gap OptimizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() Cross-Lingual Optimization for Language Transfer in Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() SGDPO: Self-Guided Direct Preference Optimization for Language Model AlignmentAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |