informatics-lab / our-pangeo

A helm chart for deploying a custom flavour Pangeo stack
10 stars 5 forks source link

Create CLI to simplify local Pangeo #8

Open jacobtomlinson opened 6 years ago

jacobtomlinson commented 6 years ago

You can run all of Pangeo locally using minikube, however you may wish to do part local and part cloud based. For example you may want your notebook locally but maintain the ability to create dask-kubernetes clusters in the cloud.

Requirements for running locally:

We could write a CLI application which checks for these dependencies, installs/configures them if they are missing and then starts the container.

tam203 commented 6 years ago

What I want... A way to use the tools I know and love but 'leverage' the power of cloud when it's there.

So:

Some Ideas:

jacobtomlinson commented 6 years ago

I don't think using docker precludes you from using your own tools, but it does complicate it. In theory you could connect the debugger and shell in VS Code into your container.

My preference would be code locally and data remotely (via FUSE). The story we keep discussing is "How would I use Pangeo on a train heading for a tunnel?". I like the idea that I could work on a notebook which would execute dask jobs in the cloud to touch the data. That way when I enter the tunnel I can continue coding and just have to wait until the tunnel is over to compute the dask graph.

VPNs are interesting but pushing the dask scheduler into the cluster may cover most of the requirements for this one.