Closed konstin closed 4 years ago
I've added hdf5 support, which allows embedding bigger-than-ram datasets, and made the get_embeddings
function a generator so I can do some postprocessing in my code.
again: would be good to have this in https://github.com/sacdallago/bio_embeddings rather than here
@konstin
If you don't sum up per protein, using a numpy array will fail due to the different lengthes of the proteins.