Backbone Augmented Training for Adaptations
Adaptations facilitate efficient training of large backbone models, including diffusion models for image generation and transformer-based language models. While various adaptation techniques enhance performance with minimal computational resources, limited adaptation data often leads to challenges in training. To address this, we focus on the enormous amount of backbone data used to pre-train the backbone models. We propose Backbone Augmented Training (BAT), a method that leverages backbone data to augment the adaptation dataset. First, we formulate and prove two mathematical key propositions: one establishes the validity of BAT, while the other identifies a condition under which BAT benefits adaptation. Furthermore, we introduce an advanced data selection scheme that satisfies these propositions and present ALBAT algorithm to implement this approach. ALBAT efficiently enhances adaptation training in both personalization and language generation tasks with scarce data.
View on arXiv@article{park2025_2506.04288, title={ Backbone Augmented Training for Adaptations }, author={ Jae Wan Park and Junhyeok Kim and Youngjun Jun and Hyunah Ko and Seong Jae Hwang }, journal={arXiv preprint arXiv:2506.04288}, year={ 2025 } }