Open QueensGambit opened 3 years ago
I was able to link the binary to cudnn_cnn_infer64_8.dll
but this didn't seem to help unfortunately.
Also adding certain optimization options such as /O2 (Maximize Speed), /GL (Whole program optimization), /LTCG (Link-time code generation) didn't result in a NPS improvement.
The library
cudnn_cnn_infer64_8.dll
is not used on Windows, butlibcudnn_cnn_infer.so.8
is used on Linux. This seems to make a visible NPS difference.e.g. Ubuntu 18.04:
GPU: RTX 2070 OC
GPU-Utility: 91%
e.g. Windows 10:
GPU: RTX 2070 OC
GPU-Utility: 85%