Added notebook with example for spark on AWS glue

OpenMined / PipelineDP

PipelineDP is a Python framework for applying differentially private aggregations to large datasets using batch processing systems such as Apache Spark, Apache Beam, and more.

Apache License 2.0

275 stars 77 forks source link

Description

Added notebook that can be used on AWS Glue for PipelineDP with SparkRDDBackend as backend. I've included simple instructions at the beginning of the notebook with links to help set everything up on AWS.

How has this been tested?

Created job on AWS Glue and executed notebook.
Here's a screenshot of the results to show it worked:

Checklist

[x] I have followed the Contribution Guidelines and Code of Conduct
[x] I have commented my code following the OpenMined Styleguide
[x] I have labeled this PR with the relevant Type labels
[x] My changes are covered by tests

OpenMined / PipelineDP

Added notebook with example for spark on AWS glue #408

Description

How has this been tested?

Checklist