For the backward phase (figure 7) the neuron j in the output layer calculates the error between its
actual output value
, known from the forward phase, and the expected nominal target value
:

The error
is propagated backwards to the previous hidden layer.
The neuron i in a hidden layer calculates an error
that is propagated
backwards again to its previous layer. Therefor a column of the weight matrix is used.

To minimize the error the weights of the projective edges of neuron i and the bias values in the receptive layer have to be changed. The old values have to be increased by:

is the training rate and has an empirical value:
.
The back-propagation algorithm optimizes the error by the method of gradient descent,
where
ist the length of each step.