gregrahn / tpcds-kit

TPC-DS benchmark kit with some modifications/fixes
320 stars 205 forks source link

How can I generate data which can fit answer_sets? #60

Open RinChanNOWWW opened 1 year ago

RinChanNOWWW commented 1 year ago

The data generation is random, but I found there is an answer_sets dir in this repo. So how can I check the answer?

manucorujo commented 1 year ago

You need to generate the executable queries using query templates with the qualification substitution parameters as defined in Appendix B. I would like dsqgen to have an option for this, but section 4.3.2 of the specification says this:

Comment: The substitution parameter values for the qualification queries are provided in Appendix B: They must be manually inserted into the query templates.

So I understand that's why they don't provide the option. Nevertheless, I think it would be a very interesting and useful option, although the results might not be official.