coiled / benchmarks

BSD 3-Clause "New" or "Revised" License
27 stars 17 forks source link

Make TPC-H data publicly available #1517

Open jaychia opened 2 months ago

jaychia commented 2 months ago

Hi, I am trying to access the TPC-H benchmarking data but running into some permissioning issues:

image

Am I accessing the right data, or could the data be made public please?

mrocklin commented 2 months ago

Ah, I thought that this was in coiled-datasets-rp, the requester pays bucket.

@fjetter who I think manages a lot of this is on PTO this week. @hendrikmakait or @phofl thoughts? (also ok to wait until Florian gets back)

fjetter commented 1 month ago

I triggered a copy of the data to our requester pays bucket. The data will be available under

coiled-datasets-rp/tpc-h/snappy/scale-*/

with the scales 1, 10, 100, 1k, 10k

At the time of writing 1k and 10k is still in progress but should be available shortly.

I will amend our readme once this is through