Skip to main content

Train a neural network

Training a Neural network means changing its Parameter aka the coefficients of the Matrix that compose the network to make it better at a specific task.

Do to this, we try to reduce the Loss function of the network. This is done by taking the derivative of the Loss function with respect to every Parameter so as it find the value of the parameters that minimises the loss.

Because f(x0+Ο΅)=f(x0)+Ο΅fβ€²(x0)+o(Ο΅)f(x_0 + \epsilon) = f(x_0) + \epsilon f'(x_0) + o(\epsilon) we know that ff gets smaller as Ο΅fβ€²(x0)\epsilon f'(x_0) decreases, so using the sign of fβ€²f', we know how to tweak x0x_0 to reduce the value of ff. In this case, ff is the loss function and x0x_0 is the current value of a parameter and Ο΅\epsilon is a tiny modification made to Ο΅\epsilon.