azavea / noaa-hydro-data

NOAA Phase 2 Hydrological Data Processing
11 stars 3 forks source link

Setup Kubernetes/Dask/Jupyter locally #7

Closed lewfish closed 2 years ago

lewfish commented 2 years ago

We need a way to run Dask with many cores on AWS. There are a few routes we could take:

lewfish commented 2 years ago

As a first step, we should get JupyterHub and then DaskHub running on Minikube locally using Helm.

lewfish commented 2 years ago

Setup Instructions

This describes how to install DaskHub which combines JupyterHub (a multi-user Jupyter notebook server) with Dask Gateway on Kubernetes.

Install DaskHub locally

Notebook Configuration

Make a new Jupyter notebook and run the following code.

from dask_gateway import Gateway, GatewayCluster
cluster = GatewayCluster()
gateway = Gateway()
gateway.list_clusters()

client = cluster.get_client()
client