prestodb / presto

The official home of the Presto distributed SQL query engine for big data
http://prestodb.io
Apache License 2.0
16.09k stars 5.39k forks source link

Make the Java TPCDS connector generate maintenance data sets #23135

Open yingsu00 opened 5 months ago

yingsu00 commented 5 months ago

The current TPCDS connector only generates base tables. It needs to generate maintenance data sets. The dsdgen option for this is --update.

First we need to find out how to add the new tables for these. Then we need to select the right rows for TINY scale factor, since the dsdgen can only generate 1GB as the minimum SF. For that we will need to see how the TINY SF were chosen on the base tables, then apply the same strategy to the maintenance tables.

yingsu00 commented 5 months ago

cc @ethanyzhang @pramodsatya @pdabre12