NCATSTranslator / Text-Mining-Provider-Roadmap

Roadmap and issue tracking for the NCATS Translator Text Mining Provider
MIT License
2 stars 2 forks source link

Release new KG that uses human UniProt identifiers for nodes instead of species non-specific Protein Ontology identifiers #88

Open bill-baumgartner opened 3 years ago

bill-baumgartner commented 3 years ago

The aim of this proposal is to improve integration of the Text Mining Provider's KGs with the rest of the Translator ecosystem for reasons described in https://github.com/NCATSTranslator/Text-Mining-Provider-Roadmap/issues/81. We note that this is not an ideal solution as the gene/protein mentions that have been identified in text are not necessarily human, however for the near term this appears to be an acceptable solution as discussed during the mini-hackathon on 6/17, and noted in https://github.com/NCATSTranslator/minihackathons/issues/14.

bill-baumgartner commented 3 years ago

Open question regarding protein/gene conflation: Should this KG also include HGNC identifiers so that genes can be linked in the text-mined assertions as well? Or should there be a separate KG that uses HGNC identifiers for its nodes?