clulab / bioresources

Data resources from the biomedical domain
Apache License 2.0
3 stars 1 forks source link

Merge HGNC synonyms into UniProt resource file #56

Closed bgyori closed 3 years ago

bgyori commented 3 years ago

This PR integrates human gene/protein synonyms from HGNC into the UniProt resource file (uniprot-proteins.tsv.gz) so that they are treated as "equal". This fixes #55.

Specific changes are:

This requires changes in Reach as well which I am working on and will PR there.

MihaiSurdeanu commented 3 years ago

LGTM!

@kwalcock or @enoriega : if tests pass, can you please merge?