Biomarker data import format

rbroth commented 3 years ago

We want to agree on a format for importing biomarker data into the system. It should be csv

rbroth commented 3 years ago

@fannymawbey , questions about the dataset:

what is the name of this survey? "Malawi DHS survey"?
Do you have information about the household that these people live in? We assume that every person surveyed lives in ahousehold and that sometimes multiple people live in the same household.
Has the data been adjusted for altitude and smoking?
MALARIA_TEST_RESULT - does this mean that the person has had malaria in the last two weeks?
If a person has had malaria in the last two weeks, does that also count as the person having been ill in the last two weeks
Can we treat "serum" and "plasma" as synonyms? For example, we store "serum folate" in our system, but the dataset lists "plasma folate"

fannymawbey commented 3 years ago

Malawi MNS 2016
yes sometimes they live in the same household. We have variables that allow us to know that. Do you need that?
Hemoglobin has been adjusted for altitude and smoking for WRA but we don't have the raw data.

Le jeu. 29 avr. 2021 à 10:19, Roman @.***> a écrit :

@fannymawbey https://github.com/fannymawbey , questions about the dataset:

what is the name? "Malawi DHS survey"?

Do you have information about the household that these people live in? We assume that every person surveyed lives in ahousehold and that sometimes multiple people live in the same household.

Has the data been adjusted for altitude and smoking?

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/micronutrientsupport/database-architecture/issues/142#issuecomment-829075437, or unsubscribe https://github.com/notifications/unsubscribe-auth/AO7ZVATHRWOLGNMNGUF5QFDTLEQAHANCNFSM43VDYCGA .

fannymawbey commented 3 years ago

It's the result of malaria testing with a Rapid Test Kit. It measures antigens in the blood at the time of the test, but doesn't tell you when the person got infected.

There is another question that is self reported - 'Did you have malaria in the last 2 weeks?', but this is different and we try to avoid using this one if we actually have a test result.

Le jeu. 29 avr. 2021 à 10:35, Roman @.***> a écrit :

MALARIA_TEST_RESULT - does this mean that the person has had malaria in the last two weeks?

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/micronutrientsupport/database-architecture/issues/142#issuecomment-829086037, or unsubscribe https://github.com/notifications/unsubscribe-auth/AO7ZVAXCLY442XMAJD4GJBLTLER6LANCNFSM43VDYCGA .

fannymawbey commented 3 years ago

No, we have to be specific. Sometimes they will collect serum, sometimes plasma, and we need to use the name of the one used by the survey. I think the idea that we had, as shown in the 'threshold' file, is that you will have a micronutrient 'folate' and then a matrix 'plasma, serum, red blood cell, whole blood, urine, breastmilk'.

Le jeu. 29 avr. 2021 à 10:46, Roman @.***> a écrit :

Can we treat "serum" and "plasma" as synonyms? For example, we store "serum folate" in our system, but the dataset lists "plasma folate"

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/micronutrientsupport/database-architecture/issues/142#issuecomment-829092931, or unsubscribe https://github.com/notifications/unsubscribe-auth/AO7ZVAWLHTFVMRSE7L5GIETTLETHJANCNFSM43VDYCGA .

rbroth commented 3 years ago

yes sometimes they live in the same household. We have variables that allow us to know that. Do you need that?

Yes, our data model assumes that we'll know what household a person belongs to. @spenny-liam is working on providing you with a template csv for importing data; that will have all the columns we want