Open BSchilperoort opened 7 months ago
This post by @rabernat actually goes through the entire workflow, in an efficient manner: https://discourse.pangeo.io/t/conservative-region-aggregation-with-xarray-geopandas-and-sparse/2715
Instead of geopandas, we can make use of shapely and pyshp.
While it is not really regridding, aggregation to a shapefile (polygons) should also be possible as an extension to the current capabilities. This can aid, e.g., hydrologists in aggregation of precipitation to a basin.
PyShp can read shapefiles and is implemented in pure python. Shapely has binary wheels on PyPI so it does not require compilation or conda.
In a quick test I have managed to get a 2D array with overlap fractions between a polygon from a shapefile and a grid.
shapely.box
This 2D array can be converted into weights to transform the gridded data into mean values. The 2D array can be (very) sparse.
Additional optimization can occur by making off part of the grid that is out of bounds of the shapefile polygon. This will speed up computation significantly.
Quick demo here: https://gist.github.com/BSchilperoort/db955c4183b3721493875688d7417026