ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2412.20292
77
18
v1v2 (latest)

An analytic theory of creativity in convolutional diffusion models

28 December 2024
Mason Kamb
Surya Ganguli
    DiffM
ArXiv (abs)PDFHTML
Abstract

We obtain an analytic, interpretable and predictive theory of creativity in convolutional diffusion models. Indeed, score-matching diffusion models can generate highly original images that lie far from their training data. However, optimal score-matching theory suggests that these models should only be able to produce memorized training examples. To reconcile this theory-experiment gap, we identify two simple inductive biases, locality and equivariance, that: (1) induce a form of combinatorial creativity by preventing optimal score-matching; (2) result in fully analytic, completely mechanistically interpretable, local score (LS) and equivariant local score (ELS) machines that, (3) after calibrating a single time-dependent hyperparameter can quantitatively predict the outputs of trained convolution only diffusion models (like ResNets and UNets) with high accuracy (median r2r^2r2 of 0.95,0.94,0.94,0.960.95, 0.94, 0.94, 0.960.95,0.94,0.94,0.96 for our top model on CIFAR10, FashionMNIST, MNIST, and CelebA). Our model reveals a locally consistent patch mosaic mechanism of creativity, in which diffusion models create exponentially many novel images by mixing and matching different local training set patches at different scales and image locations. Our theory also partially predicts the outputs of pre-trained self-attention enabled UNets (median r2∼0.77r^2 \sim 0.77r2∼0.77 on CIFAR10), revealing an intriguing role for attention in carving out semantic coherence from local patch mosaics.

View on arXiv
@article{kamb2025_2412.20292,
  title={ An analytic theory of creativity in convolutional diffusion models },
  author={ Mason Kamb and Surya Ganguli },
  journal={arXiv preprint arXiv:2412.20292},
  year={ 2025 }
}
Comments on this paper