monarch-initiative / mondo-ingest

Coordinating the mondo-ingest with external sources
https://monarch-initiative.github.io/mondo-ingest/
6 stars 3 forks source link

Create an EFO ingest #18

Open matentzn opened 2 years ago

matentzn commented 2 years ago

See https://github.com/monarch-initiative/mondo/issues/4589

matentzn commented 1 year ago

Rather than what we said in the original plan, we should sync with EFO the same way as everything else.

@zoependlington created a basic analysis of the disease branch here: https://docs.google.com/spreadsheets/d/1AS0U-EfpZXlSV6Y1CA2_Omu95N7Q1Rua6W7_biP0REU/edit#gid=525768314

It seems to me that if we

  1. Discard the MONDO ids (already mapped, naturally)
  2. Ignore the ORDO ids (already mapped, even if deprecated)

There are just about 964 unmapped disease terms left which we should cycle through our alignment pipeline. No need to curate the missing terms, but at least, this should give us another resource alignment we can list for the paper, with more than 93% coverage.