orchid-initiative / synthetic-database-project

MIT License
4 stars 2 forks source link

Synthetic Hospital Discharge Data Project (synthetic-database-project)

Summary

Our project leverages SyntheaTM, an open-source tool developed by the MITRE Corporation, to create synthetic hospital discharge data. SyntheaTM uses research-based models to generate rich medical histories for synthetic patients. We extract the hospital visits and create datasets that match the format of administrative data available to healthcare organizations. This synthetic data allows students and researchers to explore patient records without privacy concerns and develop analyses for hospitals to run on their own real data. Our goal is to make it easier for hospitals, public health officials, and researchers to collaborate and gain insights from administrative hospital data, while keeping patient information private.

Status

Version 1

Version 2

Version 3

Sample Data

Explore our synthetic hospital discharge data with our downloadable dataset. This dataset contains synthetic patient records in the format that California hospitals use to submit abstracted patient records to the California Department of Health Care Access and Information (HCAI).

Summary Statistic Workbook

The Summary Statistic Workbook provides an overview of the synthetic hospital discharge data generated by our project. It includes aggregate statistics on patient demographics and fill rates for each field. The workbook is available for download in Excel format here. (Coming soon!)

General Information

Additional Resources