Recurrent Neural Network Training with Convex Loss and Regularization Functions by Extended Kalman Filtering

IEEE Transactions on Automatic Control (IEEE TAC), 2021

4 November 2021

Abstract

We investigate the use of extended Kalman filtering to train recurrent neural networks for data-driven nonlinear, possibly adaptive, model-based control design. We show that the approach can be applied to rather arbitrary convex loss functions and regularization terms on the network parameters. We show that the learning method outperforms stochastic gradient descent in a nonlinear system identification benchmark and in training a linear system with binary outputs. We also explore the use of the algorithm in data-driven nonlinear model predictive control and its relation with disturbance models for offset-free tracking.

View on arXiv

Comments on this paper