620

Astraea: A GPU-Oriented Token-wise Acceleration Framework for Video Diffusion Transformers

Main:8 Pages
12 Figures
Bibliography:4 Pages
12 Tables
Appendix:11 Pages
Abstract

Video diffusion transformers (vDiTs) have made impressive progress in text-to-video generation, but their high computational demands present major challenges for practical deployment. While existing acceleration methods reduce workload at various granularities, they often rely on heuristics, limiting their applicability.

View on arXiv
Comments on this paper