v1v2v3v4v5 (latest)

Prototype-Guided Diffusion: Visual Conditioning without External Memory

13 August 2025

ArXiv (abs)PDF HTML Github

Main:4 Pages

3 Figures

Bibliography:2 Pages

2 Tables

Abstract

Diffusion models achieve state-of-the-art image generation but remain computationally costly due to iterative denoising. Latent-space models like Stable Diffusion reduce overhead yet lose fine detail, while retrieval-augmented methods improve efficiency but rely on large memory banks, static similarity models, and rigid infrastructures. We introduce the Prototype Diffusion Model (PDM), which embeds prototype learning into the diffusion process to provide adaptive, memory-free conditioning. Instead of retrieving references, PDM learns compact visual prototypes from clean features via contrastive learning, then aligns noisy representations with semantically relevant patterns during denoising. Experiments demonstrate that PDM sustains high generation quality while lowering computational and storage costs, offering a scalable alternative to retrieval-based conditioning.

View on arXiv

Comments on this paper