v1v2 (latest)

Rethinking Direct Preference Optimization in Diffusion Models

24 May 2025

Papers citing "Rethinking Direct Preference Optimization in Diffusion Models"

33 / 33 papers shown

Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse RewardsComputer Vision and Pattern Recognition (CVPR), 2025

595

14 Mar 2025

RainbowPO: A Unified Framework for Combining Improvements in Preference OptimizationInternational Conference on Learning Representations (ICLR), 2024

Sambit Sahu

389

05 Oct 2024

Margin-aware Preference Optimization for Aligning Diffusion Models without Reference

323

10 Jun 2024

SimPO: Simple Preference Optimization with a Reference-Free RewardNeural Information Processing Systems (NeurIPS), 2024

Yu Meng

Mengzhou Xia

Danqi Chen

542

785

23 May 2024

Learn Your Reference Model for Real Good Alignment

570

15 Apr 2024

Aligning Diffusion Models by Optimizing Human Utility

Shufan Li

Konstantinos Kallidromitis

Akash Gokul

Yusuke Kato

Kazuki Kozuka

308

06 Apr 2024

ORPO: Monolithic Preference Optimization without Reference ModelConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

702

446

12 Mar 2024

A Dense Reward View on Aligning Text-to-Image Diffusion with Preference

Shentao Yang

Tianqi Chen

Mingyuan Zhou

EGVM

348

13 Feb 2024

KTO: Model Alignment as Prospect Theoretic Optimization

Kawin Ethayarajh

Winnie Xu

Niklas Muennighoff

Dan Jurafsky

Douwe Kiela

840

834

02 Feb 2024

Using Human Feedback to Fine-tune Diffusion Models without Any Reward ModelComputer Vision and Pattern Recognition (CVPR), 2023

493

188

22 Nov 2023

Diffusion Model Alignment Using Direct Preference OptimizationComputer Vision and Pattern Recognition (CVPR), 2023

450

516

21 Nov 2023

A General Theoretical Paradigm to Understand Learning from Human PreferencesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023

Bilal Piot

Daniele Calandriello

615

845

18 Oct 2023

Aligning Text-to-Image Diffusion Models with Reward Backpropagation

468

207

05 Oct 2023

Directly Fine-Tuning Diffusion Models on Differentiable RewardsInternational Conference on Learning Representations (ICLR), 2023

Amita Gajewar

Paul Vicol

G. Bansal

David J Fleet

272

303

29 Sep 2023

Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis

Keqiang Sun

294

567

15 Jun 2023

Direct Preference Optimization: Your Language Model is Secretly a Reward ModelNeural Information Processing Systems (NeurIPS), 2023

Christopher D. Manning

Chelsea Finn

ALM

906

6,769

29 May 2023

DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion ModelsNeural Information Processing Systems (NeurIPS), 2023

Pieter Abbeel

418

287

25 May 2023

Training Diffusion Models with Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2023

590

640

22 May 2023

Pick-a-Pic: An Open Dataset of User Preferences for Text-to-Image GenerationNeural Information Processing Systems (NeurIPS), 2023

977

703

02 May 2023

MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and EditingIEEE International Conference on Computer Vision (ICCV), 2023

Ying Shan

232

680

17 Apr 2023

ImageReward: Learning and Evaluating Human Preferences for Text-to-Image GenerationNeural Information Processing Systems (NeurIPS), 2023

Xiao Liu

Yuxiao Dong

570

745

12 Apr 2023

Efficient Diffusion Training via Min-SNR Weighting StrategyIEEE International Conference on Computer Vision (ICCV), 2023

Jianmin Bao

306

221

16 Mar 2023

Diffusion Models Generate Images Like Painters: an Analytical Theory of Outline First, Details Later

Binxu Wang

John J. Vastola

DiffM

501

04 Mar 2023

eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers

...

641

984

02 Nov 2022

Scaling Autoregressive Models for Content-Rich Text-to-Image Generation

...

725

1,367

22 Jun 2022

Photorealistic Text-to-Image Diffusion Models with Deep Language UnderstandingNeural Information Processing Systems (NeurIPS), 2022

...

Raphael Gontijo-Lopes

David J Fleet

1.2K

7,527

23 May 2022

Perception Prioritized Training of Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2022

299

327

01 Apr 2022

Training language models to follow instructions with human feedbackNeural Information Processing Systems (NeurIPS), 2022

Carroll L. Wainwright

...

2.1K

17,754

04 Mar 2022

High-Resolution Image Synthesis with Latent Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2021

3.0K

21,220

20 Dec 2021

Learning Transferable Visual Models From Natural Language SupervisionInternational Conference on Machine Learning (ICML), 2021

...

2.0K

41,575

26 Feb 2021

Score-Based Generative Modeling through Stochastic Differential EquationsInternational Conference on Learning Representations (ICLR), 2020

Yang Song

Jascha Narain Sohl-Dickstein

2.2K

8,952

26 Nov 2020

Denoising Diffusion Probabilistic Models

Jonathan Ho

Ajay Jain

Pieter Abbeel

DiffM

5.1K

26,105

19 Jun 2020

Generative Modeling by Estimating Gradients of the Data DistributionNeural Information Processing Systems (NeurIPS), 2019

Yang Song

Stefano Ermon

SyDa DiffM

804

4,884

12 Jul 2019