HK3-Lab-Team / pytrousse

PyTrousse collects into one toolbox a set of data wrangling procedures tailored for composing reproducible analytics pipelines.
Apache License 2.0
0 stars 1 forks source link

FeatureOperation: AnonymizeDataset #59

Open alessiamarcolini opened 4 years ago

alessiamarcolini commented 4 years ago

[3e] Takes private columns list (that will be removed from Dataset and used to compute the unique ID), path to store private info file, “private_cols_to _keep” (columns to keep both in private and public df).

Returns the anonymized Dataset and saves the private info dataset.