bio_embeddings.utilities.exceptions.MD5ClashException: There is at least one MD5 hash clash.
This most likely indicates there are multiple identical sequences in your FASTA file.
MD5 hashes are used to remap sequence identifiers from the input FASTA.
This error exists to prevent wasting resources (computing the same embedding twice).
There's a (very) low probability of this indicating a real MD5 clash.
i think it should allow user select, in yml config file.
if i just has few redundant sequence, and it so difference ro record these redundant sequence.
bio_embeddings raise Exception
i think it should allow user select, in yml config file. if i just has few redundant sequence, and it so difference ro record these redundant sequence.