When it creates a tensor it always puts it on CPU. Despite I can copy it to GPU, it still has a foreboding warning that changes to that TF_Tensor do not reflect the actual device. So, my tensor claims I copied it to GPU but it is still peforming worse than CPU. Even Eigen is way faster than this thing.
When it creates a tensor it always puts it on CPU. Despite I can copy it to GPU, it still has a foreboding warning that changes to that TF_Tensor do not reflect the actual device. So, my tensor claims I copied it to GPU but it is still peforming worse than CPU. Even Eigen is way faster than this thing.