Closed zas97 closed 1 year ago
The split100.csv file contains duplicate structures with different uniprot_id but same ec_number. Is that expected?
Do you mean same amino acid sequence?
yes
It is not expected. All of our data are downloaded from UniProt, which might contain duplicates in their database.
The split100.csv file contains duplicate structures with different uniprot_id but same ec_number. Is that expected?