Medical-Event-Data-Standard / meds

Schema definitions and Python types for Medical Event Data Standard, a standard for medical event data such as EHR and claims data
Apache License 2.0
33 stars 2 forks source link

We should provide a MEDS synthetic data generator to help with testing #44

Open mmcdermott opened 1 month ago

mmcdermott commented 1 month ago

E.g., a simple function that takes an input directory and writes out one static MEDS dataset that is version compatible that users can see in this repo that they can use to test downstream tools.

@EthanSteinberg, would you have any objections to this?

EthanSteinberg commented 1 month ago

I agree, but I would change the naming of this issue to "We should provide a MEDS synthetic data generator to help with testing"

mmcdermott commented 1 month ago

This is not perfect by any means, but some starter code that I'm using is here: https://github.com/mmcdermott/MEDS_transforms/blob/main/tests/transform_tester_base.py in case (when we eventually get to this) it is helpful.