cudnn 8 doesnt work on v100 T4; do work on P100 P4 K80

AlexeyAB / darknet

YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )

Other

21.69k stars 7.96k forks source link

I have tested darknet on google cloud machines with one of the gpu's:

V100 T4 P100 P4 K80

cudnn7 with cuda 10.1 works on all GPU's ( probably same for 10.2 ) cudnn8 ( 8.05 ) with cuda 10.1 ; or cudnn8 ( 8.1) with cuda 11.2; causes an error in validation. The error is only in T4 and V100, the rest works fine. The error is:

cuDNN Error: CUDNN_STATUS_BAD_PARAM in convolutional_kernels.cu : forward_convolutional_layer_gpu()

It seems that there is a specific bug with cudnn8 support regardless of the cuda version, for specific gpu's. It could be that is bug for all turing and volta architectures.

AlexeyAB / darknet

cudnn 8 doesnt work on v100 T4; do work on P100 P4 K80 #7647