Closed Taha-Bahadori closed 6 years ago
@Taha-Bahadori good catch.
So if I understand correctly, you are proposing the following changes
That all makes sense to me -- my primary concern regards the sizes of groups 5 (AMERICAN INDIAN: 54) and 6 (NATIVE HAWAIIAN: 18). Those are pretty small, and they might get smaller after filtering.
Regardless I think we'll add some version of your proposal to the next release, which is coming soon.
Based on your statistics, I think we can easily merge AMERICAN INDIAN
and NATIVE HAWAIIAN
categories to 0 category (which is essentially everything else). I also understand that UNKOWN
, DECLINED
, and UNABLE
are conceptually different from those two but I think statistically this won't make any difference.
See this PR: https://github.com/YerevaNN/mimic3-benchmarks/pull/33
Will merge the PR soon now that the 1.0 release is done.
Merged !
I believe the following is a better ways of processing the demographics:
Your preprocessing ignores American Indians and Native Hawaiians and also does not treat Caribbean Islanders, South Americans, Middle Easterns properly.