A History of Meta-gradient: Gradient Methods for Meta-learning

Abstract
The history of meta-learning methods based on gradient descent is reviewed, focusing primarily on methods that adapt step-size (learning rate) meta-parameters.
View on arXivComments on this paper