Closed mrocklin closed 6 months ago
I did some digging into this. A couple of observations
So, we're running on relatively large files already and are merging those partitions way too aggressively.
To this I also had to sent n_workers to 6 or something to generate scale 100. Didn't look further into it as I had other goals in mind at the time.
Edit, that's also with 1 thread per worker.
Ah sorry, forget my earlier comment. I was referring to the actual run of the TPCH data. The failures during generation originate from running duckdb on multiple threads which triggers a segfault. just running with one thread does the trick. This is fixed with https://github.com/coiled/benchmarks/pull/1490
Not an urgent issue, but somewhat concerning
This is after running installation as in #1493 on a MacBookPro