130

Provable Accelerated Bayesian Optimization with Knowledge Transfer

Main:12 Pages
3 Figures
Bibliography:3 Pages
1 Tables
Appendix:9 Pages
Abstract

We study how Bayesian optimization (BO) can be accelerated on a target task with historical knowledge transferred from related source tasks. Existing works on BO with knowledge transfer either do not have theoretical guarantees or achieve the same regret as BO in the non-transfer setting, O~(Tγf)\tilde{\mathcal{O}}(\sqrt{T \gamma_f}), where TT is the number of evaluations of the target function and γf\gamma_f denotes its information gain. In this paper, we propose the DeltaBO algorithm, in which a novel uncertainty-quantification approach is built on the difference function δ\delta between the source and target functions, which are allowed to belong to different reproducing kernel Hilbert spaces (RKHSs). Under mild assumptions, we prove that the regret of DeltaBO is of order O~(T(T/N+γδ))\tilde{\mathcal{O}}(\sqrt{T (T/N + \gamma_\delta)}), where NN denotes the number of evaluations from source tasks and typically NTN \gg T. In many applications, source and target tasks are similar, which implies that γδ\gamma_\delta can be much smaller than γf\gamma_f. Empirical studies on both real-world hyperparameter tuning tasks and synthetic functions show that DeltaBO outperforms other baseline methods and support our theoretical claims.

View on arXiv
Comments on this paper