Rebuild Kedro/Databricks workflow recommendations

yetudada commented 1 year ago

Description

We concluded a research item on how Kedro is being used on Databricks (#2105). This task makes a recommendation to improve our Deployment to a Databricks cluster documentation.

Context

We will work on a Kedro-Databricks plugin at a later stage but first we'll overhaul the documentation because there was an insight about how much our users rely on it to get their work done. At this point in time, we'll recommend use of dbx and Databricks Repos as a way to use Kedro on Databricks.

Possible Implementation

Our Deployment to a Databricks cluster documentation needs quite a bit of help in the following ways:

High-priority
- Include an introduction about why you would choose to use Kedro on Databricks
- Recommend a workflow for syncing the latest version of their code written in an IDE to the Databricks workspace; we should recommend Databricks Repos and dbx sync as the way to do this
- Recommend a workflow for running their pipelines on Databricks; we should recommend use of the iPython extension (used through a Databricks notebook) or use of dbx deploy
- Recommend a workflow for visualising their pipeline through a Databricks notebook (this section is written, it just needs to be made more prominent)
- Additionally, please walk users through being able to configure dbx and Databricks Repos so that they can use this functionality
Medium-priority
- Provide recommendations specific to Azure; our documentation is heavily based on AWS

jmholzer commented 1 year ago

This parent issue needs to be broken down further:

Define a new workflow with Databricks repos, dbx and kedro
Document our new workflow, make changes to existing documentation
Document recommendations for use of Azure databricks (medium priority)

astrojuanlu commented 1 year ago

I guess only the Azure databricks part is missing?

merelcht commented 1 year ago

All subtasks have now been completed. The remaining work is blog posts and has been removed to kedro-devrel.

kedro-org / kedro