OpenMined / PipelineDP

PipelineDP is a Python framework for applying differentially private aggregations to large datasets using batch processing systems such as Apache Spark, Apache Beam, and more.
https://pipelinedp.io/
Apache License 2.0
270 stars 75 forks source link

Fix setting partition selection field in utility analysis result #511

Closed dvadym closed 7 months ago

dvadym commented 7 months ago

Partition selection strategy (which is enum of TruncatedGeometric, LaplaceThresholding and GaussianThresholding) is a part of utility analysis parameters. So it should be part of the output as well. But before this PR it was set incorectly, namely configuration_index is used for finding what is partition selection for a fixed configuration of parameters. But configuration_index was not yet available in that moment. This PR fixes that by moving setting partition_selection_strategy later in the pipeline, when configuration_index is available