Open lewisacidic opened 8 years ago
The converter module should be refactored to offer more flexibility.
It might be nice to be able to add features after the package is generated.
This could be done by allowing a Converter to set itself up from a HDF5 file, rather than making it from scratch.
e.g. with preexisting data.h5, extra features and splits could be added as:
data.h5
conv = Converter(..., output_path='data.h5') conv.features += skchem.descriptors.MorganFingerprinter() conv.splits += pd.Series([True, False ...])
This would probably be easiest once we have unique string representations for featurizers.
The string representation of features in #35 would be helpful for this.
The converter module should be refactored to offer more flexibility.
It might be nice to be able to add features after the package is generated.
This could be done by allowing a Converter to set itself up from a HDF5 file, rather than making it from scratch.
e.g. with preexisting
data.h5
, extra features and splits could be added as:This would probably be easiest once we have unique string representations for featurizers.