Closed gombru closed 2 years ago
Note that the numerical difference threshold TF/Keras usually use is 1e-6, which is larger than the current difference. The difference is caused by lower level ops and hardward. You might see bigger difference if you try to run it on GPU.
I am going to close this issue since this is working as intended, and we don't have anything need to address here.
I've seen that I get different results out of a Dense layer during inference with batched inference vs single inference. As in this simple example:
Colab reproducing error.
In some runs the outputs do not match.
This is tested with Keras 2.8.0