populationgenomics / automated-interpretation-pipeline

Rare Disease variant prioritisation MVP
MIT License
5 stars 4 forks source link

Consider a formal schema #347

Closed MattWellie closed 4 months ago

MattWellie commented 5 months ago

The data handling in this application is currently hectic - it's a lot of dictionaries with a lot of post-processing, and the only fixed content it a couple of dataclasses which are still evolving.

Consider a formally spec'd model to underpin object concepts e.g. the representation of a variant, a MOI-confirmation event, a previously seen variant

https://avro.apache.org/docs/1.11.1/getting-started-python/ might be an example

This would benefit from an established way of navigating updates to models, such that if the model develops, older versions of the schema can be made compatible