Closed ViktorKaz closed 6 years ago
This is now implemented. The datasets are saved in raw format in the HDF5 database. Data pre-processing is performed by the orchestrator class before training based on metadata attached to the estimator class.
Discussion point: Should the metadata about the pre-processing be attached to the estimator, the dataset or both?
Data pre-processing and transformation should be moved to the estimator module and performed before training. Currently this is done when the datasets are saved in the HDF5 database.