Closed bertsky closed 2 years ago
Too bad: This currently yields CUDA_ERROR_SYSTEM_DRIVER_MISMATCH
in Tensorflow. Should have checked earlier (in core-cuda)...
Too bad: This currently yields
CUDA_ERROR_SYSTEM_DRIVER_MISMATCH
in Tensorflow. Should have checked earlier (in core-cuda)...
It seems that the choice nvidia/cuda:11.3.1-cudnn8-runtime-ubuntu18.04
as base image now requires at least nvidia-driver-470 on the host system. I have 440 and 465 on systems available to me, neither of them can work the image. But that means we are making a sacrifice here: to be able to support the newest Tensorflow/CUDA as well, we are forcing all host systems to get a newer driver. (It just might be that upgrading the driver is easier than upgrading CUDA. But it's still quite inconvenient.)
If you have the Nvidia repo source, you can just update cuda-drivers-470
which will take care of all dependencies. (But a fresh installation might work, too.)
Anyway, this does work (based on a locally built ocrd/core-cuda
from https://github.com/OCR-D/core/pull/704).
for venv in /usr/local/sub-venv/headless-tf*; do . $venv/bin/activate && python -c "import tensorflow as tf; print(tf.test.is_gpu_available())"; done
– yields True
3x
Conflicting files
core
How are you supposed to keep PRs alive which involve subrepos then? I guess I'll have to update https://github.com/OCR-D/core/pull/704 each time core master changes, and then in turn update here.
So to sum up, we have two drawbacks here:
-cuda
variants becomes even larger (for ocrd:core-cuda
it's already 12 GB)But
ocrd/all:maximum-cuda
build),make cuda-ubuntu
or merely make cuda-ldconfig
as fixup),I'd say let's merge!
Implements #263