OpenMined / PipelineDP

PipelineDP is a Python framework for applying differentially private aggregations to large datasets using batch processing systems such as Apache Spark, Apache Beam, and more.
https://pipelinedp.io/
Apache License 2.0
270 stars 75 forks source link

Sum per partition histogram #512

Closed dvadym closed 7 months ago

dvadym commented 7 months ago

Computing histogram for sum per partition (i.e. of non-dp sum per partition). Its computation similar to linf_sum_contributions_histogram: i.e. the range between min/max split on 10^4 bins and count/sum/max for each bin is computed.