513
v1v2 (latest)

Distilling Two-Timed Flow Models by Separately Matching Initial and Terminal Velocities

Main:8 Pages
10 Figures
Bibliography:3 Pages
6 Tables
Appendix:20 Pages
Abstract

A flow matching model learns a time-dependent vector field vt(x)v_t(x) that generates a probability path {pt}0t1\{ p_t \}_{0 \leq t \leq 1} that interpolates between a well-known noise distribution (p0p_0) and the data distribution (p1p_1). It can be distilled into a two-timed flow model (TTFM) ϕs,x(t)\phi_{s,x}(t) that can transform a sample belonging to the distribution at an initial time ss to another belonging to the distribution at a terminal time tt in one function evaluation. We present a new loss function for TTFM distillation called the \emph{initial/terminal velocity matching} (ITVM) loss that extends the Lagrangian Flow Map Distillation (LFMD) loss proposed by Boffi et al. by adding redundant terms to match the initial velocities at time ss, removing the derivative from the terminal velocity term at time tt, and using a version of the model under training, stabilized by exponential moving averaging (EMA), to compute the target terminal average velocity. Preliminary experiments show that our loss leads to better few-step generation performance on multiple types of datasets and model architectures over baselines.

View on arXiv
Comments on this paper