Astraea: A GPU-Oriented Token-wise Acceleration Framework for Video Diffusion Transformers
- VGen
Main:8 Pages
12 Figures
Bibliography:4 Pages
12 Tables
Appendix:11 Pages
Abstract
Video diffusion transformers (vDiTs) have made impressive progress in text-to-video generation, but their high computational demands present major challenges for practical deployment. While existing acceleration methods reduce workload at various granularities, they often rely on heuristics, limiting their applicability.
View on arXivComments on this paper
