OpenMined / PipelineDP

PipelineDP is a Python framework for applying differentially private aggregations to large datasets using batch processing systems such as Apache Spark, Apache Beam, and more.
https://pipelinedp.io/
Apache License 2.0
272 stars 77 forks source link

Refactor analysis output metrics: part 2 Per-partition utlity metrics to cross-partition #409

Closed dvadym closed 1 year ago

dvadym commented 1 year ago

This refactoring is about introducing new output dataclasses for cross-partition utility metrics. It includes the following parts.

  1. Introducing the new dataclasses, PR
  2. Converting per-partition utility metric for 1 partition to a cross partition utility metric (this PR)
  3. Combining cross partition utility metric (next PRs), the new combiners for that (in cross_partition_combiner.py)
  4. Updating perform_utility_analysis for using new combiners.
dvadym commented 1 year ago

Thanks for review!