Closed asapsmc closed 1 year ago
TensorFlow Addons is transitioning to a minimal maintenance and release mode. New features will not be added to this repository. For more information, please see our public messaging on this decision: TensorFlow Addons Wind Down
Please consider sending feature requests / contributions to other repositories in the TF community with a similar charters to TFA: Keras Keras-CV Keras-NLP
System information
Describe the bug
I'm using a 3rd party pre-trained model (TCN-based network), for which I want to fine-tune it to different files. (Thus, I can't provide reproducible code as this code needs the pre-trained model.)
I'm doing this in a Mac M1, and due to several errors with installation (see) and using some advanced optimizers (e.g. Lookahead), I started using the
with tf.device('/cpu:0'):
instruction before interacting with the model (e.g. load, compile and fit), otherwise I'd get errors related with attempting to use the gpu.Recently, I realised that if I use this instruction when fitting, I get completely different results, and can't figure why. This is the common code:
In the end, if I do:
I get the following training loss: On the contrary, if I do:
I get the very different training loss:
Any idea on what may be causing this issue?
Note: I've also tried with the same models, but with the simple Adam optimizer, and this behaviour did not occur.