Open alvin9964 opened 4 years ago
can you reproduce this problem ?
So apparently I tired several combination of CUDA version with CUDNN version, here is the result
10.1 - 7.6.1 same error 10.1 - 7.6.2 OK! 10.1 - 7.6.3 OK! 10.1 - 7.6.4 OK! 10.1 - 7.6.5 OK!
10.2 - 7.6.5 same error
I think the workaround solution for me is to prevent those CUDNN versions that cause error and stick with CUDA 10.1 first.
2020-09-09 14:01:37 WARN BatchNormalization:382 - CuDNN BatchNormalization forward pass execution failed - falling back on built-in implementation java.lang.RuntimeException: cuDNN status = 3: CUDNN_STATUS_BAD_PARAM at org.deeplearning4j.cuda.BaseCudnnHelper.checkCudnn(BaseCudnnHelper.java:48) at org.deeplearning4j.cuda.normalization.CudnnBatchNormalizationHelper.preOutput(CudnnBatchNormalizationHelper.java:320) at org.deeplearning4j.nn.layers.normalization.BatchNormalization.preOutput(BatchNormalization.java:462) at org.deeplearning4j.nn.layers.normalization.BatchNormalization.activate(BatchNormalization.java:404) at org.deeplearning4j.nn.graph.vertex.impl.LayerVertex.doForward(LayerVertex.java:111) at org.deeplearning4j.nn.graph.ComputationGraph.outputOfLayersDetached(ComputationGraph.java:2380) at org.deeplearning4j.nn.graph.ComputationGraph.output(ComputationGraph.java:1741) at org.deeplearning4j.nn.graph.ComputationGraph.output(ComputationGraph.java:1697) at org.deeplearning4j.nn.graph.ComputationGraph.output(ComputationGraph.java:1627) at SocialDistanceCheckerVideo.getPredictedObjects(SocialDistanceCheckerVideo.java:150) at SocialDistanceCheckerVideo.main(SocialDistanceCheckerVideo.java:76)
Pom file as below : <?xml version="1.0" encoding="UTF-8"?> <project xmlns="http://maven.apache.org/POM/4.0.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd">
Cuda version 10.2 Cudnn 7.6.5