holoviz / spatialpandas

Pandas extension arrays for spatial/geometric operations
BSD 2-Clause "Simplified" License
308 stars 25 forks source link

Add temp_format argument to pack_partitions_to_parquet #22

Closed jonmmease closed 4 years ago

jonmmease commented 4 years ago

This PR adds a new argument to DaskGeoDataFrame.pack_partitions_to_parquet named temp_format. This argument may be set to a format string containing a {partition} replacement field. If provided, this string is formatted with the output partition number to generate the temporary directory path for that partition.

For example temp_format="/tmp/spatial/part-{partition}" would create temporary directories:

The temp_format string may also contain a {uuid} replacement field. If provided this will be replaced by a randomly generated UUID string. This makes it possible to reuse the same temp_format string in multiple simultaneous jobs without conflict.