iter8-tools / iter8

Kubernetes release optimizer
https://iter8.tools
Apache License 2.0
254 stars 34 forks source link

Number of buckets in histogram #1525

Open Alan-Cha opened 1 year ago

Alan-Cha commented 1 year ago

Currently, the number of buckets in the histogram is an arbitrary constant. This leads to issues. As of now, there are too many buckets which means that we cannot fit all the bucket labels without covering each other.

Additionally, if we have multiple versions, then there will be less space for each bucket. The number of the buckets can be determined by the number of versions. For example, the product of the number of buckets and versions can be made to be below a predetermined threshold.

Furthermore, should the user have the ability to control the number of buckets in the histogram? Some distributions of data might be better represented with a certain number of buckets. For example, if there are too many buckets, the data may look sparse.