Open jorisvandenbossche opened 3 years ago
Posting my initial brainstorm list of possible topics here:
dask-geopandas
itself:
spatialpandas
is another library also having a dask implementation: what can we learn from them? Ways to collaborate / share code?Thanks for starting this!
I would leave GPU out of the discussion for now. The situation there is very different at the moment and it would probably require its own introduction and discussion topics, not necessarily linked to dask.
I would like to spend a reasonable amount of time on spatial partitioning and overlapping computations because figuring out this bit properly is key in my eyes. It is not straightforward task at all because one approach needs to be used for postcode zones (contiguous compact polygons) and another one for, say, linestring trajectories.
Agree on IO. I guess that PostGIS links will be more important in dask-geopandas than they're in geopandas.
We can touch visualisation while talking about spatialpandas, since that is used as a direct interface to datashader. (As a side note, it may be useful to work out dask-based conversion between dask-geopandas and spatialpandas geometries.)
I updated the top post with a summary of what we discussed yesterday (and to be completed if people confirm)
Should we maybe switch use cases and my bit on partitioning and indexing? That way I can try to summarise them and open the floor for the main discussion in which we can reflect on real-life use cases along the way.
edit: I switched it above
Sounds good. I am only wondering if we then should also move spatialpandas to just before your talk (since it will mainly touch on the spatial partitioning / hilbert curve for repartitioning) ? Although on the other hand it also fits after my dask-geopandas explanation.
I'd leave it where it is to cover the existing packages first.
During the Dask Summit, we have a 2-hour workshop scheduled about scaling geospatial vector data on Thursday May 20th at 11-13:00 UTC (https://summit.dask.org/schedule/presentation/22/scaling-geospatial-vector-data/)
We can use this issue to further gather ideas and discuss the exact content of the workshop.
Workshop abstract:
More detailed agenda:
cc @martinfleis @jsignell