kedro-org / kedro-starters

Templates for your Kedro projects.
Apache License 2.0
64 stars 59 forks source link

Create a databricks-iris starter that enables packaged deployment on Databricks #129

Closed jmholzer closed 1 year ago

jmholzer commented 1 year ago

Motivation and Context

The guide on deploying packaged projects to Databricks proposed in https://github.com/kedro-org/kedro/pull/2595 uses the databricks-iris starter. This PR adds this starter. The databricks-iris starter is a duplicate of the pyspark-iris starter with a few changes.

This PR has a large diff because it is a brand new starter, only the following files have been changed from pyspark-iris:

How has this been tested?

Manually on Databricks in conjunction with the new guide.

Checklist

astrojuanlu commented 1 year ago

To test this:

kedro new --starter git+https://github.com/kedro-org/kedro-starters.git --directory databricks-iris --checkout feat/modify-pyspark-iris-databricks-packaged-deployment
jmholzer commented 1 year ago

To test this:

kedro new --starter git+https://github.com/kedro-org/kedro-starters.git --directory databricks-iris --checkout feat/modify-pyspark-iris-databricks-packaged-deployment

Thanks for figuring this out @astrojuanlu!