Open mmcdermott opened 3 months ago
I agree, but I would change the naming of this issue to "We should provide a MEDS synthetic data generator to help with testing"
This is not perfect by any means, but some starter code that I'm using is here: https://github.com/mmcdermott/MEDS_transforms/blob/main/tests/transform_tester_base.py in case (when we eventually get to this) it is helpful.
E.g., a simple function that takes an input directory and writes out one static MEDS dataset that is version compatible that users can see in this repo that they can use to test downstream tools.
@EthanSteinberg, would you have any objections to this?