jamesdolezal / slideflow

Deep learning library for digital pathology, with both Tensorflow and PyTorch support.
https://slideflow.dev
GNU General Public License v3.0
230 stars 38 forks source link

[BUG] Following slideflow documentation for running the containers leads to error. #387

Open Nicolai-98 opened 2 days ago

Nicolai-98 commented 2 days ago

Description

While following https://slideflow.dev/installation/#run-a-docker-container for the torch backend container I encountered a problem with starting the container.

To Reproduce

docker pull jamesdolezal/slideflow:latest-torch docker run -it --shm-size=2g --gpus all jamesdolezal/slideflow:latest-torch Output: latest-torch: Pulling from jamesdolezal/slideflow cf06a7c31611: Pull complete 41acec2bfcb9: Pull complete f2531a2e2fb3: Pull complete 491f1d30a6d5: Pull complete b47137a77b34: Pull complete 20f3d07a7d65: Pull complete 2861d6217cee: Pull complete 0ce2375d834b: Pull complete c994d0811ab5: Pull complete d40823bdf444: Pull complete 843f2129020c: Pull complete 3523444eb9cd: Pull complete f832a00b3ac0: Pull complete Digest: sha256:7ec6a62bee473387fa110e569b38e756f9be836090c50fc120bca636758e0b9c Status: Downloaded newer image for jamesdolezal/slideflow:latest-torch docker.io/jamesdolezal/slideflow:latest-torch docker: Error response from daemon: failed to create task for container: failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error running hook #0: error running hook: exit status 1, stdout: , stderr: Auto-detected mode as 'legacy' nvidia-container-cli: initialization error: load library failed: libnvidia-ml.so.1: cannot open shared object file: no such file or directory: unknown. ERRO[0000] error waiting for container:

Expected behavior

The container should start without errors.

Environment:

Additional context

I also tried this on a windows machine with wsl2 and got: docker: Error response from daemon: could not select device driver "" with capabilities: [gpu]
After running: docker run -it --shm-size=2g --gpus all jamesdolezal/slideflow:latest-torch