use custom loss with respect to initial weights

currently the weight decay is as following

loss=||y-f(x)||+lamda*||w - 0||

for sequential training, we want to penalize departure of weights from initial value of weights w0 as following:

loss=||y-f(x)||+lamda*||w - w0||

this can be implemented in a training step by modifying the loss computation. Here are a handful of examples

NOAA-PSL / model_error_correction_with_ai