monarch-initiative / oncoexporter

Cancer data to GA4GH phenopacket
https://monarch-initiative.github.io/oncoexporter
MIT License
6 stars 1 forks source link

Errors in GDC age_at_diagnosis #89

Open msierk opened 5 months ago

msierk commented 5 months ago

I’ve discovered a problem with the onset data in GDC. The age_at_diagnosis is supposed to be “Age at the time of diagnosis expressed in number of days since birth.” (see here). However many entries are listed in days, e.g. here. Under diagnosis it says Age At Diagnosis is 66 days. Days to birth is -24,386 (66.8 years). There’s a bunch that are this way. I think we can put in a check that if the onset is much smaller than the age we can assume that the value is in years instead of days.