monarch-initiative / dipper

Data Ingestion Pipeline for Monarch
https://dipper.readthedocs.io/en/latest/
BSD 3-Clause "New" or "Revised" License
56 stars 26 forks source link

Orphanet #934

Closed TomConlin closed 3 years ago

TomConlin commented 4 years ago

Has updated their (xml) schema. the ingest will need to be adapted to it.

new schema: en_product6.pdf

old schema: en_product6_2018.pdf

TomConlin commented 4 years ago

Most important diff is they no longer provide Orpha identifiers for genes (which is good). It does mean we loose or must accommodate five entries without any associated genes .

They now include Cryogenic locations for genes, but I would rather we pulled locations from an primary source, say UCSC or Ensembl.

TomConlin commented 4 years ago

Have the ingest running on therir new schema need to address unit tests and resolve what to do with new features

TomConlin commented 4 years ago

937