Closed jonbry closed 6 months ago
Any advice on what may be causing the issue? Let me know if there is any information that I can provide to help troubleshoot the issue.
This code is known to work, so it's likely a bad initialization. Some common steps you can take:
@fchollet I am able to reproduce this. I haven't had a chance to dig into the root cause yet, but I can confirm that this is a bug in Keras 3; the same code produces a model that trains just fine w/ TF 2.15 + Keras 2.
Looking into it.
I have fixed a related issue with dataset shuffling. Can you try installing v3.3.3 and checking if your code works with that version?
Thanks! Looks like it's fixed now. I can confirm the model trains fine with Keras v3.3.3
Looks like v3.3.3 fixed the issue. Thanks for all of your help!
By the way, just noticed that github release tagged v3.3.3 has a typo in the title (Kears
vs Keras
): Kears 3.3.3
Maybe this is the reason v3.3.2 is still listed as the "latest release" on the repo landing page?
@t-kalinowski , I just updated the latest release tag in the landing page
The following code from Deep Learning with Python, Second Edition fails to train when using Keras 3.3.2 and TensorFlow 2.16.1 on a Linux machine (Ubuntu 20.04):
The accuracy over 100 epochs hovers around 50%:
The same results were reproduced with different linux machines, regardless whether it was run on the GPU or CPU, as well as using a JAX backend
What is strange about this issue is that trains successfully with the following configurations:
Any advice on what may be causing the issue? Let me know if there is any information that I can provide to help troubleshoot the issue.
Thank you!