OHDSI / GIS

https://ohdsi.github.io/GIS
Apache License 2.0
10 stars 9 forks source link

Identify Datasets for testing SDoH and Exposome vocabularies #323

Closed kzollove closed 4 months ago

kzollove commented 9 months ago

Project Lead:

@p-talapova

Purpose:

Tasks:

kzollove commented 7 months ago
p-talapova commented 6 months ago

I would like to share the interim mapping progress on expanding and refining the GIS Vocabulary Package with respect to the selected use cases. Attached is the XLSX file with the following mappings for your review: GIS-vocab-package-v2-vocabulary-candidates.xlsx

Please review the mappings and let me know if you have any questions or considerations.

jaygee-on-github commented 4 months ago

Here are some first questions i had...

For some LOINC panels like US FSS the SDOH includes all the panel members and subsume the “same” panel members in the LOINC vocabulary in ATHENA.

This, however, isn’t the pattern you follow with the GAD-7 where you only capture concepts for the panel itself in SDOH and for its score (unless I missed something).

On the other hand the PHQ-9 is a panel that ATHENA captures in the LOINC vocabulary together with its components but neither this panel nor it score nor any of its components are captured in SDOH.

These are three different treatments of LOINC panels in SDOH. How should I think about that?

This leads to whether you think it is appropriate to follow yet another pattern in SDOH. Let’s say I am using a scale for psychosis that is not a panel in LOINC. Nor is it part of any ATHENA vocabulary — standard or non-standard. Can I put this “panel” into SDOH — not just the score and the panel itself like with the GAD-7 — but all the members and structure them in SDOH just the way LOINC panel components are structured in ATHENA? Like with "contained in panel", “has answer”, “has method”, “has system” and so forth relationships?

Additionally...

Andrew suggested in the discussion following the 7/5 presentation and I agree that some additional classification besides what is in domain_id_1 (mostly "phenotypic feature") and domain_id_2 (mostly “observation") may be needed in order to better appreciate and navigate all this breadth and depth that you have created in both the SDOH and Exposome vocabularies

Thanks so much for these great vocabs, Polina.

jaygee-on-github commented 4 months ago

Maybe the SDoH vocabulary covers several domains, some of which are location specific and some of which are more person specific.

Recall the concept of the exposome as it was originally described by Wild and elaborated by others.

It includes the “specific external exposome” and the “general external exposome”.

The "specific external exposome” is largely place-based and corresponds in large part to the “exposome" vocabulary that Polina and team has developed.

The "general external exposome” corresponds in large part to the SDoH. And the "general external exposome”, like Polina and team’s SDoH, is much more of a mixed bag.

Right now let’s recall two “views" on the external exposome. They come from different papers and they are not altogether consistent.

Details are in the text following the image

vocab1

Here “environmental exposures” map to the specific external exposome. In the National Children’s Study (smile) we measured some pesticides, some EDCs and Cosmetics in the home. We did place-based measurements of other pesticides and EDCs.

Here the “humanistic exposure” maps to the general external exposome and it is much more of a mixed bag when it comes to measurement.

For example, here is another “view” of the external exposome:

vocab2

In this view the general external exposome on the left side includes “Ecosystems” and “Ecosystems” are largely place-based. On the other hand, “Lifestyle” is person-based. And “Social” can and is measured both ways.

Recall that in the overall framework that OHDSI uses to standardize vocabularies in which Polina has hosted the two vocabularies, there are two domains: domain_id_1 is largely filled with “Phenotypic feature” and domain_id_2 is sometimes filled with “Measurement” and sometimes filled with “Observation”. Can we use one of the other or both of these domains to indicate whether a measurement in SDoH is person and/or place-based? For example we might have two rows for the same concept. In one row the measurement occurs at the person-level. In another row the same concept is measured by place. On another hand, “psychological and mental stress” are SDoHs we measure at the person-level.

It is argued that this sort of complexity can be used downstream. It creates a "multilayer network-based framework” capable of unveiling the role of exposures and their combinations on our health. Arguably, this is what neural networks were invented for.