rejuve-bio / biocypher-kg

MIT License
0 stars 5 forks source link

Error in aux_files/string_ensembl_uniprot_map.pkl #4

Open sauloapp opened 4 days ago

sauloapp commented 4 days ago

The string_ensembl_uniprot_map.pkl dict is mapping Ensembl ids to Uniprot record name. But it should map to Uniprot accession instead. Thanks!

Habush commented 4 days ago

Hi Saulo,

The reason we're using Ensembl IDs as a key is because STRING uses Ensembl IDs for proteins. Please check the files in the download section

sauloapp commented 4 days ago

Hmm... maybe I was not clear enough... The string_ensembl_uniprot_map.pkl maps, for instance: ENSP00000000233 to ARF5 ARF5 is the name of the Uniprot record. But it should map ENSP00000000233 to P84085 that is the Uniprot accession.

Habush commented 4 days ago

I see. In that case, it should be fixed. @dawit-melka Please look at this.