v1v2v3 (latest)

Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization

Annual Meeting of the Association for Computational Linguistics (ACL), 2023

5 October 2023

Wanli Ouyang

Yu Qiao

ArXiv (abs)PDF HTML

Papers citing "Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization"

50 / 60 papers shown

Precise Attribute Intensity Control in Large Language Models via Targeted Representation Editing

137

14 Oct 2025

OrthAlign: Orthogonal Subspace Decomposition for Non-Interfering Multi-Objective Alignment

...

262

29 Sep 2025

MO-GRPO: Mitigating Reward Hacking of Group Relative Policy Optimization on Multi-Objective Problems

183

26 Sep 2025

Learning to Optimize Multi-Objective Alignment Through Dynamic Reward Weighting

127

14 Sep 2025

FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for Audio-Driven Portrait Animation

136

15 Aug 2025

Data Selection for LLM Alignment Using Fine-Grained Preferences

140

11 Aug 2025

Aligning LLMs on a Budget: Inference-Time Alignment with Heuristic Reward Models

161

07 Aug 2025

Sotopia-RL: Reward Design for Social Intelligence

Bodhisattwa Prasad Majumder

238

05 Aug 2025

PICACO: Pluralistic In-Context Value Alignment of LLMs via Total Correlation Optimization

225

22 Jul 2025

CoSteer: Collaborative Decoding-Time Personalization via Local Delta Steering

207

07 Jul 2025

A Framework for Controllable Multi-objective Learning with Annealed Stein Variational Hypernetworks

Minh-Duc Nguyen

Dung D. Le

307

07 Jun 2025

Conformal Arbitrage: Risk-Controlled Balancing of Competing Objectives in Language Models

William Overman

Mohsen Bayati

282

01 Jun 2025

HSCR: Hierarchical Self-Contrastive Rewarding for Aligning Medical Vision Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

234

01 Jun 2025

Learning Safety Constraints for Large Language Models

Xin Chen

Yarden As

Andreas Krause

202

30 May 2025

Differential Information Distribution: A Bayesian Perspective on Direct Preference Optimization

315

29 May 2025

Multi-objective Large Language Model Alignment with Hierarchical Experts

...

374

27 May 2025

Understanding the Performance Gap in Preference Learning: A Dichotomy of RLHF and DPO

378

26 May 2025

MOSLIM:Align with diverse preferences in prompts through reward classification

Yu Zhang

Wanli Jiang

Zhengyu Yang

206

24 May 2025

Understanding and Mitigating Overrefusal in LLMs from an Unveiling Perspective of Safety Decision Boundary

331

23 May 2025

Online Iterative Self-Alignment for Radiology Report GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

377

17 May 2025

Latent Preference Coding: Aligning Large Language Models via Discrete Latent Codes

396

08 May 2025

PARM: Multi-Objective Test-Time Alignment via Preference-Aware Autoregressive Reward Model

396

06 May 2025

A Survey on Progress in LLM Alignment from the Perspective of Reward Design

413

05 May 2025

Adaptive Helpfulness-Harmlessness Alignment with Preference Vectors

348

27 Apr 2025

ParetoHqD: Fast Offline Multiobjective Alignment of Large Language Models using Pareto High-quality Data

384

23 Apr 2025

Persona-judge: Personalized Alignment of Large Language Models via Token-level Self-judgmentAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

389

17 Apr 2025

REWARD CONSISTENCY: Improving Multi-Objective Alignment from a Data-Centric Perspective

249

15 Apr 2025

Towards Understanding and Improving Refusal in Compressed Models via Mechanistic Interpretability

Vishnu Kabir Chhabra

Mohammad Mahdi Khalili

AI4CE

272

05 Apr 2025

Natural Language GenerationTheoretical Issues In Natural Language Processing (TINLP), 2018

Emiel van Miltenburg

Chenghua Lin

315

20 Mar 2025

BalancedDPO: Adaptive Multi-Metric Alignment

251

16 Mar 2025

UC-MOA: Utility-Conditioned Multi-Objective Alignment for Distributional Pareto-Optimality

530

10 Mar 2025

PEO: Improving Bi-Factorial Preference Alignment with Post-Training Policy Extrapolation

Yuxuan Liu

260

03 Mar 2025

Robust Multi-Objective Preference Alignment with Online DPOAAAI Conference on Artificial Intelligence (AAAI), 2025

250

01 Mar 2025

The Rise of Darkness: Safety-Utility Trade-Offs in Role-Playing Dialogue AgentsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

287

28 Feb 2025

Societal Alignment Frameworks Can Improve LLM Alignment

...

1.1K

27 Feb 2025

Faster, Cheaper, Better: Multi-Objective Hyperparameter Optimization for LLM and RAG Systems

450

25 Feb 2025

MPO: An Efficient Post-Processing Framework for Mixing Diverse Preference Alignment

605

25 Feb 2025

Drift: Decoding-time Personalized Alignments with Implicit User Preferences

614

20 Feb 2025

Rethinking Diverse Human Preference Learning through Principal Component AnalysisAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

431

18 Feb 2025

STAIR: Improving Safety Alignment with Introspective Reasoning

425

04 Feb 2025

Gradient-Based Multi-Objective Deep Learning: Algorithms, Theories, Applications, and Beyond

477

19 Jan 2025

Pareto-Optimal Energy Alignment for Designing Nature-Like Antibodies

279

30 Dec 2024

Orbit: A Framework for Designing and Evaluating Multi-objective RankersInternational Conference on Intelligent User Interfaces (IUI), 2024

324

07 Nov 2024

Comparison-based Active Preference Learning for Multi-dimensional PersonalizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Minhyeon Oh

Seungjoon Lee

Jungseul Ok

345

01 Nov 2024

Constraint Back-translation Improves Complex Instruction Following of Large Language Models

458

31 Oct 2024

L3Ms -- Lagrange Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024

1.1K

28 Oct 2024

2D-DPO: Scaling Direct Preference Optimization with 2-Dimensional Supervision

Yancheng He

Bo Zheng

239

25 Oct 2024

Improving Inverse Folding for Peptide Design with Diversity-regularized Direct Preference Optimization

269

25 Oct 2024

COS-DPO: Conditioned One-Shot Multi-Objective Fine-Tuning FrameworkConference on Uncertainty in Artificial Intelligence (UAI), 2024

343

10 Oct 2024

Inference-Time Language Model Alignment via Integrated Value GuidanceConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Zhixuan Liu

Zhanhui Zhou

Yuanfu Wang

Chao Yang

Yu Qiao

179

26 Sep 2024