OHDSI / Themis

Repository for OMOP CDM conventions as defined by THEMIS. These can be reference lists of concepts, pieces of standardized code for data generation or quality certification, and debates.
Apache License 2.0
25 stars 8 forks source link

Duplicate Providers? #161

Closed ericaVoss closed 2 months ago

ericaVoss commented 2 months ago

Duplicate Providers?

CDM or THEMIS convention?

THEMIS

Table or Field level?

TABLE

Is this a general convention?

PROVIDER table

Summary of issues

Summary of answer

This is the THEMIS rule: Multiple Provider Specialities #43

In the table, each provider should appear only once; duplication is not allowed. If a provider specializes in more than one area, record their primary or most frequently practiced specialty.

Related links

Other comments/notes

jiawei-qian commented 2 months ago

@ericaVoss, I also noticed the https://github.com/OHDSI/Themis/issues/43 issue about “Multiple Provider Specialties”. It's important to remove duplication.

I just saw one OHDSI Forum post which has Christian's reply on how to choose the appropriate specialty. If a provider has multiple specialties in different areas (pediatrics and adult), try to take out some rare outliers and find the common denominator. (for reference) https://forums.ohdsi.org/t/question-about-provider-table/1795/7

(From my understanding, Christian means to find the common ancestor for these specialties in Provider domain in ATHENA hierarchy. Because sometimes, ETL developers don't have much medical background and source data may not show us which is the primary/most frequent specialty. For example, there are only 2 source records with different specialties for the same provider. It may be hard to choose which is the primary or most frequent specialty. As a mechanism, we can also use this rule if it is hard to choose a specialty from multiple specialties especially they are in different areas.)

Hope this helps!

clairblacketer commented 2 months ago

Thanks @ericaVoss and @jiawei-qian. Given the wording of this convention I am going to put it in the CDM documentation since it should apply to every CDM