Open bernardpazio opened 2 years ago
Looks like the failures are not related to my changes.
FYI this problem seems to be only an issue with dask<2022.8 and pandas<1.5
FYI this problem seems to be only an issue with dask<2022.8 and pandas<1.5
In this case I suggest we recommend people updating their environment and close this PR.
I have come across an issue with spatially partitioning data where the distance between geometry as calculated by hilbert_distance is greater than a 32bit integer. The meta for the returned series is set to 32bit which causes dask to cast down to 32bit when setting index, this causes an overflow and the resulting divisions being out of order. Here is a minimal example to show the issue.
setting correctly to int64 before passing to dask fixes this issue.
It should be a simple fix to handle these cases, so I've gone ahead and created a branch for your review.