coreweave / ml-containers

MIT License
19 stars 3 forks source link

build(torch): Error on timeouts while fetching packages #61

Closed Eta0 closed 5 months ago

Eta0 commented 5 months ago

Error on apt-get update timeouts

Sometimes, apt would semi-silently fail to fetch packages while adding the ppa:ubuntu-toolchain-r/test package repository, emitting a warning, but also a successful exit code. This change checks for that warning and fails the build, since it is inappropriate to continue if those packages could not be properly updated.

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810730 Image: ghcr.io/coreweave/ml-containers/torch:es-error-on-timeout-9842f8d-base-cuda12.0.1-ubuntu20.04-torch2.2.0-vision0.17.0-audio2.2.0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810730 Image: ghcr.io/coreweave/ml-containers/torch:es-error-on-timeout-9842f8d-base-cuda12.1.1-ubuntu20.04-torch2.2.0-vision0.17.0-audio2.2.0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810730 Image: ghcr.io/coreweave/ml-containers/torch:es-error-on-timeout-9842f8d-base-cuda11.8.0-ubuntu20.04-torch2.2.0-vision0.17.0-audio2.2.0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810730 Image: ghcr.io/coreweave/ml-containers/torch:es-error-on-timeout-9842f8d-base-cuda12.0.1-ubuntu22.04-torch2.2.0-vision0.17.0-audio2.2.0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810730 Image: ghcr.io/coreweave/ml-containers/torch:es-error-on-timeout-9842f8d-base-cuda12.1.1-ubuntu22.04-torch2.2.0-vision0.17.0-audio2.2.0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810730 Image: ghcr.io/coreweave/ml-containers/torch:es-error-on-timeout-9842f8d-base-cuda12.2.2-ubuntu20.04-torch2.2.0-vision0.17.0-audio2.2.0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810730 Image: ghcr.io/coreweave/ml-containers/torch:es-error-on-timeout-9842f8d-base-cuda12.2.2-ubuntu22.04-torch2.2.0-vision0.17.0-audio2.2.0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810723 Image: ghcr.io/coreweave/ml-containers/torch:es-error-on-timeout-9842f8d-nccl-cuda11.8.0-ubuntu20.04-nccl2.16.5-1-torch2.2.0-vision0.17.0-audio2.2.0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810723 Image: ghcr.io/coreweave/ml-containers/torch:es-error-on-timeout-9842f8d-nccl-cuda12.0.1-ubuntu20.04-nccl2.19.3-1-torch2.2.0-vision0.17.0-audio2.2.0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810723 Image: ghcr.io/coreweave/ml-containers/torch:es-error-on-timeout-9842f8d-nccl-cuda12.0.1-ubuntu22.04-nccl2.18.5-1-torch2.2.0-vision0.17.0-audio2.2.0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810723 Image: ghcr.io/coreweave/ml-containers/torch:es-error-on-timeout-9842f8d-nccl-cuda12.1.1-ubuntu20.04-nccl2.18.3-1-torch2.2.0-vision0.17.0-audio2.2.0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810723 Image: ghcr.io/coreweave/ml-containers/torch:es-error-on-timeout-9842f8d-nccl-cuda12.1.1-ubuntu22.04-nccl2.18.3-1-torch2.2.0-vision0.17.0-audio2.2.0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810723 Image: ghcr.io/coreweave/ml-containers/torch:es-error-on-timeout-9842f8d-nccl-cuda12.2.2-ubuntu20.04-nccl2.19.3-1-torch2.2.0-vision0.17.0-audio2.2.0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810723 Image: ghcr.io/coreweave/ml-containers/torch:es-error-on-timeout-9842f8d-nccl-cuda12.2.2-ubuntu22.04-nccl2.19.3-1-torch2.2.0-vision0.17.0-audio2.2.0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810759 Image: ghcr.io/coreweave/ml-containers/nightly-torch:es-error-on-timeout-9842f8d-nccl-24032021-cuda12.1.1-ubuntu20.04-nccl2.18.3-1-torch2.4.0a0-vision0.19.0a0-audio2.2.0a0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810759 Image: ghcr.io/coreweave/ml-containers/nightly-torch:es-error-on-timeout-9842f8d-nccl-24032021-cuda11.8.0-ubuntu20.04-nccl2.16.5-1-torch2.4.0a0-vision0.19.0a0-audio2.2.0a0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810759 Image: ghcr.io/coreweave/ml-containers/nightly-torch:es-error-on-timeout-9842f8d-nccl-24032021-cuda12.0.1-ubuntu20.04-nccl2.19.3-1-torch2.4.0a0-vision0.19.0a0-audio2.2.0a0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810759 Image: ghcr.io/coreweave/ml-containers/nightly-torch:es-error-on-timeout-9842f8d-nccl-24032021-cuda12.0.1-ubuntu22.04-nccl2.18.5-1-torch2.4.0a0-vision0.19.0a0-audio2.2.0a0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810759 Image: ghcr.io/coreweave/ml-containers/nightly-torch:es-error-on-timeout-9842f8d-nccl-24032021-cuda12.1.1-ubuntu22.04-nccl2.18.3-1-torch2.4.0a0-vision0.19.0a0-audio2.2.0a0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810759 Image: ghcr.io/coreweave/ml-containers/nightly-torch:es-error-on-timeout-9842f8d-nccl-24032021-cuda12.2.2-ubuntu20.04-nccl2.19.3-1-torch2.4.0a0-vision0.19.0a0-audio2.2.0a0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810759 Image: ghcr.io/coreweave/ml-containers/nightly-torch:es-error-on-timeout-9842f8d-nccl-24032021-cuda12.2.2-ubuntu22.04-nccl2.19.3-1-torch2.4.0a0-vision0.19.0a0-audio2.2.0a0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810759 Image: ghcr.io/coreweave/ml-containers/nightly-torch:es-error-on-timeout-9842f8d-base-24032021-cuda11.8.0-ubuntu20.04-torch2.4.0a0-vision0.19.0a0-audio2.2.0a0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810759 Image: ghcr.io/coreweave/ml-containers/nightly-torch:es-error-on-timeout-9842f8d-base-24032021-cuda12.1.1-ubuntu20.04-torch2.4.0a0-vision0.19.0a0-audio2.2.0a0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810759 Image: ghcr.io/coreweave/ml-containers/nightly-torch:es-error-on-timeout-9842f8d-base-24032021-cuda12.0.1-ubuntu22.04-torch2.4.0a0-vision0.19.0a0-audio2.2.0a0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810759 Image: ghcr.io/coreweave/ml-containers/nightly-torch:es-error-on-timeout-9842f8d-base-24032021-cuda12.0.1-ubuntu20.04-torch2.4.0a0-vision0.19.0a0-audio2.2.0a0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810759 Image: ghcr.io/coreweave/ml-containers/nightly-torch:es-error-on-timeout-9842f8d-base-24032021-cuda12.1.1-ubuntu22.04-torch2.4.0a0-vision0.19.0a0-audio2.2.0a0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810759 Image: ghcr.io/coreweave/ml-containers/nightly-torch:es-error-on-timeout-9842f8d-base-24032021-cuda12.2.2-ubuntu20.04-torch2.4.0a0-vision0.19.0a0-audio2.2.0a0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810759 Image: ghcr.io/coreweave/ml-containers/nightly-torch:es-error-on-timeout-9842f8d-base-24032021-cuda12.2.2-ubuntu22.04-torch2.4.0a0-vision0.19.0a0-audio2.2.0a0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810730 Image: ghcr.io/coreweave/ml-containers/torch-extras:es-error-on-timeout-9842f8d-base-cuda12.0.1-ubuntu20.04-torch2.2.0-vision0.17.0-audio2.2.0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810730 Image: ghcr.io/coreweave/ml-containers/torch-extras:es-error-on-timeout-9842f8d-base-cuda12.1.1-ubuntu20.04-torch2.2.0-vision0.17.0-audio2.2.0

github-actions[bot] commented 5 months ago

@Eta0 Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/8365810730 Image: ghcr.io/coreweave/ml-containers/torch-extras:es-error-on-timeout-9842f8d-base-cuda11.8.0-ubuntu20.04-torch2.2.0-vision0.17.0-audio2.2.0