TRI-ML / dgp

ML Dataset Governance Policy for Autonomous Vehicle Datasets
https://tri-ml.github.io/dgp/
MIT License
93 stars 63 forks source link

`build-docker` GH Actions workflow fails on master #106

Closed tk-woven closed 2 years ago

tk-woven commented 2 years ago

Description

The build-docker workflow fails on master with

#4 [internal] load metadata for docker.io/nvidia/cuda:11.1-devel-ubuntu18.04
#4 ERROR: docker.io/nvidia/cuda:11.1-devel-ubuntu18.04: not found
------
 > [internal] load metadata for docker.io/nvidia/cuda:11.1-devel-ubuntu18.04:
------
error: failed to solve: rpc error: code = Unknown desc = failed to solve with frontend dockerfile.v0: failed to create LLB definition: docker.io/nvidia/cuda:11.1-devel-ubuntu18.04: not found
Error: buildx failed with: error: failed to solve: rpc error: code = Unknown desc = failed to solve with frontend dockerfile.v0: failed to create LLB definition: docker.io/nvidia/cuda:11.1-devel-ubuntu18.04: not found

Reproduction

Run the build-docker workflow on master manually or trigger it via a merge to master.

Fix

Update the base image in our Dockerfile.

Formerly:

FROM nvidia/cuda:11.1-devel-ubuntu18.04

I don't see this entry in the docker registry. I suspect it was renamed or replaced with alternatives.

Fix:

FROM nvidia/cuda:11.1.1-devel-ubuntu18.04

The new image is here.

tk-woven commented 2 years ago

For those who can see it, here is an example of a failed run.

tk-woven commented 2 years ago

Fixed in #107 .