mozilla / participation-metrics-org

Participation metrics planning repository
4 stars 4 forks source link

Git Panel: "Number of Authors" and "Names of Authors table" mismatch #96

Closed hmitsch closed 7 years ago

hmitsch commented 7 years ago

Dashboard URL: https://analytics.mozilla.community:443/goto/28df4a5431ee6e9a9c57a2a2e8e506b6

Artifacts

image

Issue

The amount of Authors differ. Strange, right?

jgbarah commented 7 years ago

That's because, for performance reasons, the table is limited to show the top 20 authors. I've modified it to 200, so that you can see how now there are much more authors. We don't have tables with long lists because that means more traffic from/to the ElasticSearch server, and more time for the panel to load.

In any case, you will notice that the number of rows in the table now is 62 (with 64 being the number in the metrics widget). That's because, despite how strange it may seem, ElasticSearch calculates unique counts (and the number in the metrics widget is a unique count) only approximately. The degree of approximation can be increased, but again at the cost of less performance (more delay in executing the query).

For most uses this is not really an issue, and the "right" number is the number of rows in the table. But I know it is annoying. If you want, we can adjust to a more strict approximation, and check if performance is good enough. But I would do that only after we have improved the server side (performance of the AWS ElasticSearch instance).

hmitsch commented 7 years ago

Abandoned.