HK3-Lab-Team / pytrousse

PyTrousse collects into one toolbox a set of data wrangling procedures tailored for composing reproducible analytics pipelines.
Apache License 2.0
0 stars 1 forks source link

Added classes for synthetic test data creation #69

Closed lorenz-gorini closed 4 years ago

lorenz-gorini commented 4 years ago

Added a new framework for synthetic data creation that can be used for testing the pytrousse FeatureOperation application.

The idea is to have a tool for simulating raw data starting from clean ones. Since some ReverseFeatureOperation cannot be perfectly reversed by applying pytrousse FeatureOperation, the simulated raw data must be created along with the data that the user will get after that the related FeatureOperation is applied.

alessiamarcolini commented 4 years ago

@lorenz-gorini please be aware that the code is in both files now