13
0

SketchColour: Channel Concat Guided DiT-based Sketch-to-Colour Pipeline for 2D Animation

Bryan Constantine Sadihin
Michael Hua Wang
Shei Pern Chua
Hang Su
Main:6 Pages
5 Figures
Bibliography:2 Pages
1 Tables
Abstract

The production of high-quality 2D animation is highly labor-intensive process, as animators are currently required to draw and color a large number of frames by hand. We present SketchColour, the first sketch-to-colour pipeline for 2D animation built on a diffusion transformer (DiT) backbone. By replacing the conventional U-Net denoiser with a DiT-style architecture and injecting sketch information via lightweight channel-concatenation adapters accompanied with LoRA finetuning, our method natively integrates conditioning without the parameter and memory bloat of a duplicated ControlNet, greatly reducing parameter count and GPU memory usage. Evaluated on the SAKUGA dataset, SketchColour outperforms previous state-of-the-art video colourization methods across all metrics, despite using only half the training data of competing models. Our approach produces temporally coherent animations with minimal artifacts such as colour bleeding or object deformation. Our code is available at:this https URL.

View on arXiv
@article{sadihin2025_2507.01586,
  title={ SketchColour: Channel Concat Guided DiT-based Sketch-to-Colour Pipeline for 2D Animation },
  author={ Bryan Constantine Sadihin and Michael Hua Wang and Shei Pern Chua and Hang Su },
  journal={arXiv preprint arXiv:2507.01586},
  year={ 2025 }
}
Comments on this paper