Add warning message when deterministic training loss stagnates too quickly in partial BNNs

ziatdinovmax / NeuroBayes

Fully and Partially Bayesian Neural Nets

30 stars 5 forks source link

Add warning message when deterministic training loss stagnates too quickly in partial BNNs #11

Closed sarah-allec closed 2 weeks ago

sarah-allec commented 1 month ago

I've encountered one dataset for which mini-batching in the deterministic neural network (DNN) training of a partial Bayesian neural network (PBNN) causes the DNN training loss to stagnate too quickly, resulting in significantly lower performance than non-Bayesian machine learning models. Increasing the batch size (or not doing any batching) alleviated the issue. It would be helpful if there was a check during DNN training that triggers a warning when the DNN training loss stagnates prematurely, offering a suggestion to increase the batch size as a possible solution. dnn_loss

ziatdinovmax commented 1 month ago

Thanks for bringing this up! We could potentially implement this as a utility function in neurobayes/utils that monitors the DNN training loss over a set number of epochs and triggers a warning if it detects that the loss has plateaued prematurely. This could include a suggestion to try changing the batch size, adjusting the MAP sigma value, or modifying the learning rate (the first two are the most frequent culprits in my experience).

Would you be open to submitting a PR with such a utility function?

sarah-allec commented 1 month ago

Yes, will do!