This PR integrates human gene/protein synonyms from HGNC into the UniProt resource file (uniprot-proteins.tsv.gz) so that they are treated as "equal". This fixes #55.
Specific changes are:
Migrate code from update_hgnc_genes.py into update_uniprot_proteins.py
Remove hgnc from ner_kb.config
Remove kb/hgnc.tsv.gz
Update extended uniprot-proteins.tsv.gz and ner/Gene_or_gene_product.tsv.gz
This requires changes in Reach as well which I am working on and will PR there.
This PR integrates human gene/protein synonyms from HGNC into the UniProt resource file (
uniprot-proteins.tsv.gz
) so that they are treated as "equal". This fixes #55.Specific changes are:
This requires changes in Reach as well which I am working on and will PR there.