iree-org / iree

A retargetable MLIR-based machine learning compiler and runtime toolkit.
http://iree.dev/
Apache License 2.0
2.57k stars 574 forks source link

Update nvidia docker image. This was missed last time. #17661

Closed jpienaar closed 3 months ago

ScottTodd commented 3 months ago

The postsubmit test_nvidia_a100 job started failing after this was merged: https://github.com/iree-org/iree/actions/runs/9493590493/job/26189543324#step:8:60

The persistent runner is out of disk space: docker: failed to register layer: write /var/cuda-repo-ubuntu2004-12-2-local/nsight-systems-2023.2.3_2023.2.3.1001-1_amd64.deb: no space left on device.

Ideas:

ScottTodd commented 3 months ago
  • Check the docker image size before and after this change

12.3 GB both before and after