Open tomsercu opened 2 years ago
http://www0.cs.ucl.ac.uk/staff/d.jones/crcnote.pdf known problem with alphafold/uniprot crc64 :
I have found ~500 sequences pairs in AF universe with the same crc64, but different only in two positions with step 8: looks like very rare event.
Best decision will be switch from crc64 to md5 in all 3D-S databases simultaneously. But I don't think we have such level of cooperation/synchronization between interested parties. crc64 - is the best choice for now.
Discussed in https://github.com/facebookresearch/esm/discussions/340