PipelineDP is a Python framework for applying differentially private aggregations to large datasets using batch processing systems such as Apache Spark, Apache Beam, and more.
Added notebook that can be used on AWS Glue for PipelineDP with SparkRDDBackend as backend.
I've included simple instructions at the beginning of the notebook with links to help set everything up on AWS.
How has this been tested?
Created job on AWS Glue and executed notebook.
Here's a screenshot of the results to show it worked:
Description
Added notebook that can be used on AWS Glue for PipelineDP with SparkRDDBackend as backend. I've included simple instructions at the beginning of the notebook with links to help set everything up on AWS.
How has this been tested?
Checklist