Closed cholmes closed 2 months ago
Hi! This seems like a duplicate of https://github.com/duckdb/duckdb/issues/13914 which was fixed yesterday. Bugfix release is scheduled for monday.
Oh great! I'll try it out.
Just tried the nightly on os/x - v1.0.1-dev4917 45787e5f9f
and I see the same problem, though maybe it wasn't in yet? Or maybe the nightlies aren't on 1.1 yet? Happy to try anything out, or to wait for monday.
Hmm, that seems like an old nightly build? The fix should be included in the 1.1.1 nightlies, e.g. v1.1.1-dev238 f0f78913a5
(installed from https://duckdb.org/docs/installation/?version=main&environment=cli&platform=macos&download_method=direct)
Yeah, I was surprised to get v1.0.1 when I clicked download - I'm pretty sure I clicked the link just like the one you shared. But just did it again and got v1.1.1, so must have been user error or some weird quirk. Will try to try it today. Thanks!
Worked great! Thanks!
I've been working with a ~10 million row dataset, trying to get it to write out a GeoParquet, and it's been having a number of hard crashes. I think I narrowed it down to when it's writing out the Parquet, and different compression formats perform differently:
zstd fails:
snappy fails:
brotli works, but takes 10+ minutes. And gzip works perfectly, and I think was under a minute. Scratch that, my second attempt to do gzip and measure the time (as it was more like 10 seconds) failed halfway through:
And then future ones didn't go anywhere:
If I exclude the geometry all works and is 5-6 seconds:
Also tested uncompressed (failed):
And lz4 almost works - super fast to 99%, then failed:
I tried out the GDAL outputs, but none of them work as this is STAC GeoParquet and has a lot of nested structs:
The schema is:
I'm on OS/X on M2, with DuckDB 1.1. Happy to share more info if needed.