A flow matching model learns a time-dependent vector field that generates a probability path that interpolates between a well-known noise distribution () and the data distribution (). It can be distilled into a two-timed flow model (TTFM) that can transform a sample belonging to the distribution at an initial time to another belonging to the distribution at a terminal time in one function evaluation. We present a new loss function for TTFM distillation called the \emph{initial/terminal velocity matching} (ITVM) loss that extends the Lagrangian Flow Map Distillation (LFMD) loss proposed by Boffi et al. by adding redundant terms to match the initial velocities at time , removing the derivative from the terminal velocity term at time , and using a version of the model under training, stabilized by exponential moving averaging (EMA), to compute the target terminal average velocity. Preliminary experiments show that our loss leads to better few-step generation performance on multiple types of datasets and model architectures over baselines.
View on arXiv@article{khungurn2025_2505.01169, title={ Distilling Two-Timed Flow Models by Separately Matching Initial and Terminal Velocities }, author={ Pramook Khungurn and Pratch Piyawongwisal and Sira Sriswasdi and Supasorn Suwajanakorn }, journal={arXiv preprint arXiv:2505.01169}, year={ 2025 } }