gregrahn / tpcds-kit

TPC-DS benchmark kit with some modifications/fixes
317 stars 200 forks source link

Where is the query type classification for TPC-DS queries? #15

Closed krdeepakio closed 8 years ago

krdeepakio commented 9 years ago

The queries are categorized into four types and few examples are there.

Reporting Class 41 Ad Hoc Class 59 Iterative Class 4 Data Mining Class 23

I did not find the type of each query anywhere. One hint is that store_sales queries are ad-hoc and catalog_sales are reporting. There is no other data point to classify the queries.

dleybz commented 8 years ago

Is there any update on this? Also interested.

gregrahn commented 8 years ago

Where did these categories come from?

dleybz commented 8 years ago

TPC-DS mentions a categorization multiple times. For insance, in Section 4.1 of this document "Reporting", "Ad Hoc", "Iterative", and "Data Mining" classes are referred to but not defined.

gregrahn commented 8 years ago

Thanks @dleybz. There is a reference to these also in the spec in section 1.4.1 but no mapping of query name/number to query class that I see. Bummer.

dleybz commented 8 years ago

Very odd that they seem to have a consistently named and even defined schema, but don't provide this mapping anywhere as far as I can tell.

leonardBang commented 5 years ago

Did you finally find the mapping? I met this issue too.

tharsha18 commented 4 years ago

Here is what i have been using for years

TPCDS Query classification: discovery/Interactive (ad-hoc) - q19, q42, q52, q55, q63, q68, q73, q98

BI Reporting - q3, q7, q27, q43, q53, q89

Analytics - q34, q46, q59, q79, q96, q48