kreuzwerker / kreuzlaker

11 stars 2 forks source link

Add a Notebook solution #1

Open fabdy opened 1 year ago

fabdy commented 1 year ago

For data science, adding a notebook solutions is a nice way to prototype transformations or build one off analyses.

Tasks:

Figure out how to build a notebook environment (likely some hints in https://aws.amazon.com/blogs/startups/a-data-lake-as-code-featuring-chembl-and-opentargets/) Implement it in cdk If AWS has a Jupyter service: Add group for this and add example user to it Setup jupyter service in CDK and give that group access to it

DoD:

The example user (or a different user separate form the deployment user) can open a notebook and access the data lake via some python or sql code