158

Forgetting of task-specific knowledge in model merging-based continual learning

Main:4 Pages
3 Figures
Bibliography:2 Pages
3 Tables
Appendix:7 Pages
Abstract

This paper investigates the linear merging of models in the context of continual learning (CL). Using controlled visual cues in computer vision experiments, we demonstrate that merging largely preserves or enhances shared knowledge, while unshared task-specific knowledge rapidly degrades. We further find that merging models from an incremental training process consistently outperforms merging models trained in parallel.

View on arXiv
Comments on this paper