Deep Learning using Rectified Linear Units (ReLU)

Abstract
We introduce the use of rectified linear units (ReLU) as the classification function in a deep neural network (DNN). Conventionally, ReLU is used as an activation function in DNNs, with Softmax function as their classification function. However, there have been several studies on using a classification function other than Softmax, and this study is an addition to those. We accomplish this by taking the activation of the penultimate layer in a neural network, then multiply it by weight parameters to get the raw scores . Afterwards, we threshold the raw scores by , i.e. , where is the ReLU function. We provide class predictions through argmax function, i.e. argmax .
View on arXivComments on this paper