OpenMined / PipelineDP

PipelineDP is a Python framework for applying differentially private aggregations to large datasets using batch processing systems such as Apache Spark, Apache Beam, and more.
https://pipelinedp.io/
Apache License 2.0
270 stars 75 forks source link

Compute public partition summary #465

Closed dvadym closed 1 year ago

dvadym commented 1 year ago

Public partition summary, namely:

class PublicPartitionsSummary:
    num_dataset_public_partitions: int  # in dataset and public partitions
    num_dataset_non_public_partitions: int # in dataset and not in public partitions
    num_empty_public_partitions: int # not in dataset and in public partitions