orchid-initiative / synthetic-database-project

MIT License
4 stars 2 forks source link

Fix Incomplete HCAI Fields #63

Closed NickKramer87 closed 2 months ago

NickKramer87 commented 10 months ago

As a data analyst, I need the synthetic database to contain all relevant fields so that I can ensure that the software that I will write to perform the analysis will work on the real-world data.

Proposed Subtasks:

  1. Identify all missing fields and determine which are necessary for analysis.
  2. Add all additional necessary fields.
  3. Create a separate Task for enumerating clearly the fabricated fields, truncated fields, and not-fully-mapped fields for later consideration during analysis.

Acceptance Criteria:

  1. A list of all relevant fields with checkmark boxes. Those fields that are currently correctly outputted will have a checkmark.
  2. Every item on the checklist is checked off.

Field Notes (running)

Remaining Critical fields with mistakes are:

Remaining fully missing fields are:

TravisHaussler commented 10 months ago

This is a bit open ended - sort of a catch-all project. However, there are only a few remaining fully blank fields so lets leave this. Once we add the last few (like external causes of morbidity) we will create a new task to track the fields that need additional work or have important caveats for analysis.

TravisHaussler commented 10 months ago

Remaining fully missing fields are:

rileeki commented 10 months ago

@TravisHaussler and @NickKramer87 - Re: "external causes of morbidity", the analytics team does not regularly use these fields. Also, Medicare's synthetic files leave these fields blank (source pg 29). I think we're good to leave these blank and consider that done.

TravisHaussler commented 10 months ago

Excellent!

On Fri, Nov 3, 2023 at 12:22 PM rileeki @.***> wrote:

@TravisHaussler https://github.com/TravisHaussler and @NickKramer87 https://github.com/NickKramer87 - Re: "external causes of morbidity", the analytics team does not regularly use these fields. Also, Medicare's synthetic files leave these fields blank (source https://data.cms.gov/sites/default/files/2023-05/d51e1218-68c3-4c7c-9598-0b81f22fe903/User%20Guide%20-%20CMS%20Synthetic%20RIF%20Files%20May%202023_AM508_v2.pdf pg 29). I think we're good to leave these blank and consider that done.

— Reply to this email directly, view it on GitHub https://github.com/orchid-initiative/synthetic-database-project/issues/63#issuecomment-1792978938, or unsubscribe https://github.com/notifications/unsubscribe-auth/AZL3CWJOMQYJL4MDBOWI6KTYCU75VAVCNFSM6AAAAAA6H6452OVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTOOJSHE3TQOJTHA . You are receiving this because you were mentioned.Message ID: @.*** com>

beckyphan commented 2 months ago

Per thread above, remaining issues are resolved and/or captured in new issues.