Checkbox is a testing framework used to validate device compatibility with Ubuntu Linux. It’s the testing tool developed for the purposes of the Ubuntu Certification program.
Tests have been added to check setting up DSS for using CUDA with NVIDIA GPUs, and then some simple tests are run to see if PyTorch and Tensorflow notebook can actually use the GPUs.
resource.pxu has been added and checks for whether GPUs from Intel or NVIDIA are available have been moved here and made into a resource. These used to be test jobs before, but now being a resource, they enabling skipping respective tests when the relevant GPUs are not available.
Tests have been added to run only the CPU. These tests will be run on all machines irrespective of available GPUs so that basic DSS functionality of creating and running notebooks with PyTorch and Tensorflow continue to work.
The command to trigger the tests from the checkbox-dss snap produced with the provider has been changed from validate-intel-gpu to validate-with-gpu. Relevant changes have been made to the snap's sources and to the README. This is a backwards-incompatible change to the snap and hence its version has been bumped from 2.0 to 3.0 .
The GitHub workflow for running DSS has been refactored to now need a single job definition that can be used for all the values from the test matrix.
An NVIDIA DGX machine has been added as a target machine for running the tests on, representing a machine that does not have any Intel GPUs, instead, only NVIDIA GPUs.
These DSS validations need to be run on machines from Testflinger. The provisioning has been unfortunately flaky in the days around the creation of this PR. See a run of the relevant workflow here: any failures that are reported happened during provisioning phase from Testflinger, not with the tests themselves.
The success of this job means that the CPU tests as well as tests for both Intel and NVIDIA GPUs passed, since this machine has both GPUs available.
The success of this job on NVIDIA DGX means that the Intel GPU tests were successfully skipped, and only the CPU and NVIDIA GPU tests were executed, since this machine only has NVIDIA GPUs.
We can try to run the workflow again when the "weather" at Testflinger has calmed down, and I am confident that they will pass. The machines where these validations need to be run is soon to be revised anyway based on the spec KF114 and we could then have a more stable set of machines available to carry out these tests.
Description
resource.pxu
has been added and checks for whether GPUs from Intel or NVIDIA are available have been moved here and made into a resource. These used to be test jobs before, but now being a resource, they enabling skipping respective tests when the relevant GPUs are not available.checkbox-dss
snap produced with the provider has been changed fromvalidate-intel-gpu
tovalidate-with-gpu
. Relevant changes have been made to the snap's sources and to the README. This is a backwards-incompatible change to the snap and hence its version has been bumped from 2.0 to 3.0 .Resolved issues
Documentation
No changes to Checkbox's documentation.
Tests
These DSS validations need to be run on machines from Testflinger. The provisioning has been unfortunately flaky in the days around the creation of this PR. See a run of the relevant workflow here: any failures that are reported happened during provisioning phase from Testflinger, not with the tests themselves.
We can try to run the workflow again when the "weather" at Testflinger has calmed down, and I am confident that they will pass. The machines where these validations need to be run is soon to be revised anyway based on the spec KF114 and we could then have a more stable set of machines available to carry out these tests.