skuschel / generatorpipeline

Parallelize your data-processing pipelines with just a decorator.
GNU General Public License v3.0
2 stars 4 forks source link

Better histograms with equiprobale cells #19

Closed r-radloff closed 1 year ago

r-radloff commented 1 year ago

When chosing the width of histogram cells the use of equiprobale cells is advantageous over the use of equal-size cells as it ensures a better representation of the underlying distribution (see paragraph 6 of the paper of Jain and Chlamtac).

A histogram with $b$ equiprobale cells makes use of $(b - 1)$ quantiles. The needed quantile, we could estimate using the $P^2$-algorithm (Isssue #18).

skuschel commented 1 year ago

Use CDFEstimator class. Added in PR #30