Deep Q-Networks for Accelerating the Training of Deep Neural Networks

5 June 2016

Jie Fu

Zichuan Lin

Danlu Chen

Miao Liu

Nicholas Leonard

Jiashi Feng

Tat-Seng Chua

AI4CE

ArXiv (abs)PDF HTML

Abstract

We present a method, codenamed QAN, for improving the generalization ability of a deep neural network (DNN). It achieves this by using a deep Q-network (DQN) to learn policies to accelerate the training of the DNN across episodes. The state features of the DQN are learned from the weight statistics of the DNN during training. The reward function of this DQN is designed to learn policies to minimize the training time needed by that DNN. The actions of the DQN correspond to some optimization choices during training. The code can be downloaded from https://github.com/bigaidream-projects/qan

View on arXiv

Comments on this paper