Open shashaka opened 3 months ago
@sachinprasadhs When I install the keras from the source as github master branch, I found that this issue was reproduced. Can you check the below colab notebook?
https://colab.sandbox.google.com/gist/sachinprasadhs/e73e2c7428f44ccc0d2ef486bed047c6/20027.ipynb
Hi @shashaka, could we try and get a pared down colab of this issue? Please remove anything non-relevant to Tensorflow and to this reproduction. Please also add keras.config.disable_traceback_filtering()
so we can get a full trace error.
Here is a simplified gist (shows the error)
(With disabled traceback filtering.)
Happens both with GPU and CPU. (It does happen only some times !)
PS: This might be obvious but without the test environment there seems to be no error (gist).
I also updated my gist based on @ghsanti 's one. It seems that this error occurred when slicing the data on data generator.
https://colab.research.google.com/gist/shashaka/71e1e97d1459498c0bcca1fb4fc084d8/20027.ipynb
Thank you @shashaka and @ghsanti, unless this shows up in our own testing environment (internally/github CI) we are unlikely to have the bandwidth to dive deeper into what is happening since this might be environment specific. If you're taking a closer look and find the code pointer responsible we'd be happy to support any PR's. Leaving open for now!
On keras/src/trainers/data_adapters/generator_data_adapter_test.py, I found that there is intermittent test case failed on tensorflow gpu env. This is related in test_basic_flow method on this test case, so, I made test code for this on my local side.
And I got an error as below, many of running was succeeded, however, some are failed.
So, is there anyone can confirm whether this is the bug or not??