Closed mrocklin closed 8 years ago
Binder requires we host the data on google storage?
You can spin up a cluster with binder? (wow)
I suggest that we don't yet worry about the cluster but just do this on a single machine. They're beefy
At the moment binder is hosted on GCE, so GS is free.
This is live, but there are a few probelms:
In[3]
stating the size is 4.6GB when it is 2.7GB. Some evidence for not using comments cc @danielfrg type
takes around 8 minutes to complete. This is the first real compute
. Options are:
With raw json, the same computation above takes 5 minutes. Which is still too long.
The binder machines shrunk down quite a bit recently. They used to be really powerful.
This blogpost http://continuum.io/blog/dask-distributed-cluster by @cowlicks might make a fun live notebook with binder.
This work would include the following: