orchid-initiative / synthetic-database-project

MIT License
4 stars 2 forks source link

Automatic Summary Generation #30

Open rileeki opened 1 year ago

rileeki commented 1 year ago

User Story As a data analyst, I want the database generation tool to automatically generate summary statistics for the patient files it creates so that I can easily compare them to existing summary files of real data.

Note: the principal diagnosis group and principal procedure group tables can be left blank for this particular user story. Identifying the diagnosis group and procedure group based on CCSR (Clinical Classifications Software Refined) will be a separate, large task. For more information: https://www.hcup-us.ahrq.gov/toolssoftware/ccsr/ccs_refined.jsp

Acceptance Criteria

  1. The program generates an Excel summary sheet that is similar, if not identical, to the Hospital Inpatient - Characteristics by Facility (Pivot Profile) summary generated annually by HCAI, leaving only the principal diagnosis group and principal procedure group tables blank.
  2. The categorizations used in each summary table should match the specifications on the "INSTRUCTIONS AND FOOTNOTES" tab of the HCAI summary workbook (referenced in acceptance criterion 1)
  3. The summary sheet is successfully read in by the same program (not yet in existence) that reads real-world patient data summaries, as it is functionally identical.
  4. A pull request has been submitted, reviewed, merged, and closed.