Open QiuMM opened 4 years ago
Yes, there were some bugs in previous versions of quantiles aggregator. I don't have a list of GitHub issues and pull requests, and it is a bit difficult now to point out exactly what versions had what bug. For instance, https://github.com/apache/incubator-druid/pull/7320 Unfortunately, when quantiles sketch was fixed, a bug in Theta sketch was introduced. So I would recommend upgrading to the latest version of Druid (0.16.0-incubating).
@AlexanderSaydakov thanks, and is it enough if I upgrade the data sketch extension only rather than the whole Druid.
I am not sure which version of the extension would be compatible with which version of Druid. It always is built as a part of the whole Druid package.
Okay, I'll have a try, thanks @AlexanderSaydakov
We used the DataSketches to compute quantiles and got very weird query results.
Affected Version
0.12.2
Description
Metrics spec at ingestion time:
My query:
The query result:
As we can see, the value of 0.5-quantile is 1185.0, so there must be nearly half of the
cm_value
greater than or equal to 1185.0. However, if we multiply 1185 and 2111 (half of the count) , we found the result is 2501535 which is much greater than the sum of cm_value 667109. Impossible! this should not be happen. We have loaded the same data into hive, and queried hive we got the result:@AlexanderSaydakov is there any bug of DataSketches Quantiles Sketch or I used it in a wrong way?