nlesc-sherlock / emma

Ansible playbook to create a cluster with GlusterFS, Docker, Spark and JupyterHub services
Apache License 2.0
3 stars 4 forks source link

Add dask role #117

Open sverhoeven opened 5 years ago

sverhoeven commented 5 years ago

Big data handling in Python is done using either Spark or Dask (dask.pydata.org/). Currently emma does not do Dask.

Lets create a role which installs a Dask cluster. Also add some geospatial and plotting libraries like xarray/geopandas/geoviews/datashader for example notebooks.