29
0

Classroom-Inspired Multi-Mentor Distillation with Adaptive Learning Strategies

Shalini Sarode
Muhammad Saif Ullah Khan
Tahira Shehzadi
Didier Stricker
Muhammad Zeshan Afzal
Abstract

We propose ClassroomKD, a novel multi-mentor knowledge distillation framework inspired by classroom environments to enhance knowledge transfer between the student and multiple mentors with different knowledge levels. Unlike traditional methods that rely on fixed mentor-student relationships, our framework dynamically selects and adapts the teaching strategies of diverse mentors based on their effectiveness for each data sample. ClassroomKD comprises two main modules: the Knowledge Filtering (KF) module and the Mentoring module. The KF Module dynamically ranks mentors based on their performance for each input, activating only high-quality mentors to minimize error accumulation and prevent information loss. The Mentoring Module adjusts the distillation strategy by tuning each mentor's influence according to the dynamic performance gap between the student and mentors, effectively modulating the learning pace. Extensive experiments on image classification (CIFAR-100 and ImageNet) and 2D human pose estimation (COCO Keypoints and MPII Human Pose) demonstrate that ClassroomKD outperforms existing knowledge distillation methods for different network architectures. Our results highlight that a dynamic and adaptive approach to mentor selection and guidance leads to more effective knowledge transfer, paving the way for enhanced model performance through distillation.

View on arXiv
@article{sarode2025_2409.20237,
  title={ Classroom-Inspired Multi-Mentor Distillation with Adaptive Learning Strategies },
  author={ Shalini Sarode and Muhammad Saif Ullah Khan and Tahira Shehzadi and Didier Stricker and Muhammad Zeshan Afzal },
  journal={arXiv preprint arXiv:2409.20237},
  year={ 2025 }
}
Comments on this paper