Medical-Event-Data-Standard / meds

Schema definitions and Python types for Medical Event Data Standard, a standard for medical event data such as EHR and claims data
Apache License 2.0
38 stars 3 forks source link

We should provide a MEDS synthetic data generator to help with testing #44

Open mmcdermott opened 3 months ago

mmcdermott commented 3 months ago

E.g., a simple function that takes an input directory and writes out one static MEDS dataset that is version compatible that users can see in this repo that they can use to test downstream tools.

@EthanSteinberg, would you have any objections to this?

EthanSteinberg commented 3 months ago

I agree, but I would change the naming of this issue to "We should provide a MEDS synthetic data generator to help with testing"

mmcdermott commented 3 months ago

This is not perfect by any means, but some starter code that I'm using is here: https://github.com/mmcdermott/MEDS_transforms/blob/main/tests/transform_tester_base.py in case (when we eventually get to this) it is helpful.