planetlabs / gpq

Utility for working with GeoParquet
https://planetlabs.github.io/gpq/
Apache License 2.0
135 stars 7 forks source link

Feature suggestion: extract command #100

Open jtmiclat opened 8 months ago

jtmiclat commented 8 months ago

Hi! A processing I found useful when using geoparquet files is creating subsets of data with either using bbox or excluding/selecting columns.

rough suggested implementation

gpq extract -bbox=120,10.1,121.4,11 -geom_col=geometry -exclude_cols=value,label source.geoparquet target.geoparquet

I can work on the implementation of this in the upcoming weeks but would like to know if others would find this useful!

tschaub commented 8 months ago

Hi @jtmiclat - I the idea of an extract command. Minor, but the CLI flags will end up dash-delimited (e.g. --exclude-cols - saving wear/tear on that shift key).

cholmes commented 8 months ago

I'd definitely find it useful! Especially if it could work on remote files, which I think should be possible with #98