monarch-initiative / dipper

Data Ingestion Pipeline for Monarch
https://dipper.readthedocs.io/en/latest/
BSD 3-Clause "New" or "Revised" License
56 stars 26 forks source link

Flybase ingest incorrectly parses allele to phenotype lines without an anatomical or GO term #910

Closed kshefchek closed 3 years ago

kshefchek commented 4 years ago

For example given the line:

FBal0038994 dendritic spine & lobular plate tangential neuron, with @FBal0220362:Scer\GAL4<up>DB331</up>@   derived_pheno_manifest  FBrf0206747 Characterization of dendritic spines in the Drosophila central nervous system.  19160442

We generate the incorrect triple

FlyBase:FBal0038994 RO:0002200 OBO:FBal0220362PHENOTYPE

This is from: https://github.com/monarch-initiative/dipper/blob/299af4e32/dipper/sources/FlyBase.py#L301

We should get rid of this entire chunk of code given #909