Assume is a weight in one of the MLP networks and we have evaluated the partial derivatives and . The variance is easy to update with a fixed point update rule derived by setting the derivative to zero
By looking at the form of the cost function for Gaussian terms, we can find an approximation for the second derivatives with respect to the means as [34]
There are some minor corrections to these update rules as explained in [34].