OpenMined / PipelineDP

PipelineDP is a Python framework for applying differentially private aggregations to large datasets using batch processing systems such as Apache Spark, Apache Beam, and more.
https://pipelinedp.io/
Apache License 2.0
270 stars 75 forks source link

DataFrame API: QueryBuilder #493

Closed dvadym closed 9 months ago

dvadym commented 9 months ago

This PR implements QueryBuilder class, which implements builder pattern for DataFrame queries. Query will run DP aggregation (it will be implemented in the following PR). QueryBuilder and Query are agnostic to the type of DataFrames (Pandas, Spark, Beam).