sky-uk / kfp-operator

https://sky-uk.github.io/kfp-operator
BSD 3-Clause "New" or "Revised" License
15 stars 2 forks source link

Reduce Quickstart Pipeline image size #347

Closed aidandunlop closed 3 months ago

aidandunlop commented 3 months ago

Closes #346 The quickstart pipeline uses the TFX image as it's base, which is ~9GB and includes several extra tools which aren't needed by the quickstart pipeline. This PR uses a slim python image as the base instead, and installs the tfx python package and other dependencies directly which will reduce the image size dramatically.

Image size when using base TFX image:

Image size after using tfx python package on slim python:

We encountered a recent breaking change in the setuptools dependency indirectly imported by TFX which was causing CsvExampleGen to fail (see main card). This was fixed by pinning the dependency version.

Tasks

aidandunlop commented 3 months ago

OK to test