h2oai / db-benchmark

reproducible benchmark of database-like ops
https://h2oai.github.io/db-benchmark
Mozilla Public License 2.0
321 stars 85 forks source link

new tasks: sort and groupby/join #203

Open jangorecki opened 3 years ago

jangorecki commented 3 years ago

Currently we do have pre-sorted data cases in groupby and join task. The problem is that solutions cannot fully utilize that because they are unaware of the data being sorted. We could have another task where sorting data is explicitly made at the beginning of the script and then queries against the data can be aware of the data being sorted.