Open phgilde opened 4 years ago
Thank you for reporting this @phgilde . Are you running this script on Windows or WSL?
@PatriceVignola I'm running this on windows
We've implemented the single-step/block-based LSTM/GRU/RNN ops, but these are really better suited to CPU architectures. Models typically use the multi-step cuDNN ops when executing on a GPU device. It's not unsurprising that there's some more work here to make DML perform better with recurrent networks.
@phgilde What GPU you're running this with? You mentioned standard tensorflow and that your config is with Intel HD graphics. Is this training script running on CPU?
I've had the same issue on an RX 560. In task manager neither the GPU nor the CPU seems to take on any load.
I have same problem with 4750u amd apu , also gpu load not even %1-2
@onurberkay What does tf.config.list_physical_devices()
give you?
This training loop takes more than a second per epoch using tensorflow-directml but a fraction of a second with standard tensorflow. It actually doesnt work at all (error is NaN after a couple of iterations) but I already opened another Issue for that.
Code:
System: Intel i5-7200U with Intel HD graphics 620