Is your feature request related to a problem? Please describe.
Eliminate all information related to the training set so that models trained with proprietary data can be released
Additional context
We are working on this in the dev branch. The latest commit ensures the anonymization process works when run at the end of model training, and new molecules can be predicted, both test sets (with associated data, hence, reports are created) as well as de novo molecules.
We only need to add a flag for the cli to run the anonymization at the end of the training. Currently, I simply run
from zairachem.finish.finisher import Anonymizer
path = "path_to_model"
an = Anonymizer(path=path)
an.run()
Dev will be merged once the changes are incorporated
Is your feature request related to a problem? Please describe. Eliminate all information related to the training set so that models trained with proprietary data can be released
Additional context We are working on this in the
dev
branch. The latest commit ensures the anonymization process works when run at the end of model training, and new molecules can be predicted, both test sets (with associated data, hence, reports are created) as well as de novo molecules.We only need to add a flag for the cli to run the anonymization at the end of the training. Currently, I simply run
Dev will be merged once the changes are incorporated