OHDSI / dbt-synthea

[Under development] A dbt ETL project to convert a Synthea synthetic data set into the OMOP CDM
https://ohdsi.github.io/dbt-synthea/
Apache License 2.0
16 stars 6 forks source link

Location table is orphaned #82

Closed katy-sadowski closed 1 month ago

katy-sadowski commented 1 month ago

It doesn't connect to any other tables right now, but it should actually have FKs in PERSON and PROVIDER. We should figure out how to link back each person/provider to their location in this table. As part of this, locations should also be de-duped.

lawrenceadams commented 1 month ago

Scoping

Effects the care_site and person table: at present all locations data comes from seeds/synthea/patients.csv and trickles into location from stg_synthea__patients. An interesting oversight from the original SyntheaOMOP project as this violates the CDM and doesn't even bother to dedup. Nice catch @katy-sadowski !

Given Synthea doesn't normalise locations at all I'd imagine we will have to hoover up all locations synthea generates for patients and care sites and then use the various aspects of the address as a natural key to join onto