wengxyu1030 / Recode_V_training

This repository is only for training use.
1 stars 0 forks source link

Pakistan2006-forJianing #19

Open robin-wang opened 3 years ago

robin-wang commented 3 years ago

WORKFLOW DOC

A. General Procedural Steps

B. Checklist and Guidance

jianingwwww commented 3 years ago

Hi @robin-wang , sorry to bother you but I met with some problems when I run the codes used to merge the dataset to the microdata. I need to match them with the external iso data, which I suppose to be some external features of the country? I have searched the branch and the documents and data you shared with me, but the iso dta seems not to be in there. Would you mind showing me where I can find it and what it represents? Thank you very much!

robin-wang commented 3 years ago

@imarabbit Hi Jianing, thank you for your time just now. You raised very good questions!

With regard to your points on quality checking, could you please share them in writing again? I also invite @Xian152 from the Bank's side to share insights if needed.

On the actions needed after checking specific parts of the survey for data validity and coding consistency, it might be helpful when in doubt to reach out to us for clarification or discussion.

jianingwwww commented 3 years ago

Here are the two questions:

  1. How do I understand the variables in the quality control results (i.e., value_my, value_dhs, flag_dhs, value_hefpi, flag_hefpi)? What's their formal definitions? - @robin-wang @Xian152

  2. What am I suppose to do after checking specific parts of the survey? - delve into each do file and keep in touch with you if I have any questions

Xian152 commented 3 years ago

@imarabbit Hi Jianing,

  1. external iso data can be found in raw/Recode XX/external. Please remember to download the external file as well when setting up the raw datafile locally.
  2. value_my are weighted average for each variables generated from the dataset. value_dhs & value_hefpi are the weighted average generated by DHS and 'old' HEFPI, extracted from an exisiting datafiles. Actually you can find more details and the generaing process of value_my, flag_dhs, flag_hefpi in Quality_control.do.
  3. Theoritically, you need to check if EACH variable is generated properly according to the definition given by 'DHS Codebook.xlsx', not just for variables appear in the quality control result.
    In practice, since we are using one set of codes for all surveys from the same wave and the code for the most common/standard situation have been set up, you need to at least identify the survey-specific situation for EACH variable and adjust the code accordingly (such as different missing value, different definition of skilled provider...). The code have shown how we are dealing with survey-specific situation and those variables that vary across survey (usually by if inlist(name,"survey")....). Focusing on those variables in the early stage would help you to start easily. Also, if you could identify the mistakes in the code for would be great.
robin-wang commented 3 years ago

@Xian152 Thank you Xian. So very thoughtful of you!

robin-wang commented 3 years ago

@imarabbit Hi Jianing, I have done a quick review on your training homework, here are my notes:

  1. Internal checks would be most relevant as we move forward to formal assignments.

  2. Child Illness - how did you check and arrive at this list? "Check provider list being consistent with existing list (if not make changes in program) 【(?)h12a-x: chemist, homeopath, shop, hakim, Dai&Tba are NA or excluded】"

  3. Child Illness - Diarrhea Med On this one, you asked about whether the two types of meds should be considered formal. We could have a small discussion on this, but what was your observation in the survey? Notably, the notes under the table?

  4. Child Anthropometrics Very good that you raised hc70 and hc71 not available, even though you might referenced our earlier discussions. No actions needed on this homework specifically. In formal assignments, we would certainly have a chat about what to do.

  5. Reproductive Health You did the right thing by looking at the modern methods of birth control. No need to write them all down though. Cheers.

  6. Vaccination I commend you on identfying the extra vaccines. Have you searched DHS survey reports for definitions on these extra vaccines?

  7. ANC

    • Did the right commentary on blood pressure/blood sample/urine sample. Exactly what we wanted to note down in writing
    • Tetanus: in theory, i would not only noting this in writing ( as you did ), but in terms of other questions or other datasets that might be used for this particular question/variable, i would advise doing so. Let's talk about this if not clear enough.
Xian152 commented 3 years ago

@robin-wang Hi Robin, thank you for your well-summarized and detailed intrusction. @imarabbit Hi,Jianing. Glad you capture most of the survey-specific cases and find some mistakes. I have left comments on your commit in the 2 pull requests you opened