Open vojtechhuser opened 6 years ago
it is as convention #15 in person but not in METADATA
related issue is here https://github.com/OHDSI/Themis/issues/9
@vojtechhuser There are many reasons a person may be deleted from the CDM. How do we capture all the reasons in a standardized format within the METADATA table?
I see two sides. Documenting the size of the deletion and documenting the reasons.
If count of deleted = 0 - that is a good fact to know. It make me trust the data more. So this issue is JUST about the count. not the reason.
Ok, do you have plans to add in reason for deletion? Just curious
From my perspective: If I see a count of deleted = 0 for EHR data, it makes me trust the data less. Every EHR dataset I have seen has impossible data. Persons with birth year in the 1860's, birth dates after death dates, etc.
@vojtechhuser Do you you want to sponsor this issue? Or do you want to close this issue?
It is encouraged that when a CDM ETL deletes persons from the data for one reason or another that that information be tracked in the METADATA table. For example, if an ETL deletes persons when they are missing data, then the METADATA table should capture the count of persons deleted for this specific rule. If no persons are deleted between the raw and CDM this should also be captured in the METADATA table.
Add ACHILLES HEEL rule that checks patient loss is documented in the METADATA table.
Asking @alondhe if this is a best way to document this.