geopandas / dask-geopandas

Parallel GeoPandas with Dask
https://dask-geopandas.readthedocs.io/
BSD 3-Clause "New" or "Revised" License
486 stars 45 forks source link

Question regarding parallelism over many seperate GeoSeries #259

Closed kurt-rhee closed 11 months ago

kurt-rhee commented 11 months ago

Hello,

Thank you for your awesome project. I have a question where I have around 10,000 separate geoseries containing polygons. For each of these geoseries I would like to perform a separate unary_union so that at the end I have 10,000 polygons one for each geoseries.

For example: if geoseries_0 has three polygons A, B, C, and geoseries_1 has polygons D, E... I would like to return the unary union of ABC and the unary union of DE.

Is it possible to do this in a parallel fashion? or is the parallelism only within a geoseries.

Thank you!

TomAugspurger commented 11 months ago

I don’t think that will be parallel. dask-geopandas, like dask.dataframe, parallelizes along the index so it well suited for long and not too wide data frames.

You might be able to use dask.delayed or dask.bag on a sequence of regular geopandas.GeoSeries to do your unary unions in parallel.

On Oct 5, 2023, at 3:50 PM, Kurt Rhee @.***> wrote:

Hello,

Thank you for your awesome project. I have a question where I have around 10,000 separate geoseries containing polygons. For each of these geoseries I would like to perform a separate unary_union so that at the end I have 10,000 polygons one for each geoseries.

For example: if geoseries_0 has three polygons A, B, C, and geoseries_1 has polygons D, E... I would like to return the unary union of ABC and the unary union of DE.

Is it possible to do this in a parallel fashion? or is the parallelism only within a geoseries.

Thank you!

— Reply to this email directly, view it on GitHub https://github.com/geopandas/dask-geopandas/issues/259 or unsubscribe https://github.com/notifications/unsubscribe-auth/AAKAOIST67BYZMSSXNCIU3LX54MQNBFKMF2HI4TJMJ2XIZLTSOBKK5TBNR2WLJDUOJ2WLJDOMFWWLO3UNBZGKYLEL5YGC4TUNFRWS4DBNZ2F6YLDORUXM2LUPGBKK5TBNR2WLJLJONZXKZNENZQW2ZNLORUHEZLBMRPXI6LQMWBKK5TBNR2WLJDUOJ2WLJDOMFWWLLTXMF2GG2C7MFRXI2LWNF2HTLDTOVRGUZLDORPXI6LQMWSUS43TOVS2M5DPOBUWG44SQKSHI6LQMWVHEZLQN5ZWS5DPOJ42K5TBNR2WLKJSGQYDGNRTGY2TLAVEOR4XAZNFNFZXG5LFUV3GC3DVMWVDCOJSHEYDGNBRGIY2O5DSNFTWOZLSUZRXEZLBORSQ. You are receiving this email because you are subscribed to this thread.

Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

kurt-rhee commented 11 months ago

Understood thank you!