The TPCDS specification and dsdgen have been updated several times in the last a few years, but Presto TPCDS connector hasn't been updated for a long time. There are several changes we noticed recently:
The row count error margin was removed. It used to allow 0.01% difference in the row counts in spec v2.1.0, but the margin was removed in 2019. Now all row counts should exactly match the latest dsdgen.
The old dsdgen generates different column order than the spec for the modification data sets. This bug should have been fixed in 2021.
Expected Behavior or Use Case
The Presto TPCDS connector generates exactly the same data as in the latest spec and dsdgen
The TPCDS specification and dsdgen have been updated several times in the last a few years, but Presto TPCDS connector hasn't been updated for a long time. There are several changes we noticed recently:
Expected Behavior or Use Case
The Presto TPCDS connector generates exactly the same data as in the latest spec and dsdgen
Presto Component, Service, or Connector
The Presto Java based TPCDS connector
Possible Implementation
Example Screenshots (if appropriate):
Context