Updated the AlexNet tutorial with the new code for dataset handling.
Thoughts on data handling for the upgrade:
Different sources of data that need to be handled: (a) target train and test data, (b) population/aux data for the attack.
The target model's data can be a subset of the population data, or have some overlapping data with the population data.
So we need to have some way of removing the overlapping data from the population data. Currently we have a buggy way of hashing the string version of the train data and removing these data points from the population data if the same hashes are present.
A better way would be to let the user specify the overlapping data point indices, if there are any.
Both TargetDataset and PopulationDataset can inherit from a parent Dataset class, which will have the actual tf-datasets code for creating and using a dataset.
Updated the AlexNet tutorial with the new code for dataset handling.
Thoughts on data handling for the upgrade:
Example workflow:
Both TargetDataset and PopulationDataset can inherit from a parent Dataset class, which will have the actual tf-datasets code for creating and using a dataset.