Sage-Bionetworks / schematic

Package for biomedical data model and metadata ingress management
https://schematicpy.readthedocs.io/en/stable/cli_reference.html
MIT License
22 stars 26 forks source link

Refactor data_model_parser #857

Closed mialy-defelice closed 2 weeks ago

mialy-defelice commented 2 years ago

Actually start writing all the code for the new data model parser, which takes in either a JSONLD or CSV data model and returns a dictionary that can be used to create nodes and edges in a network graph.

For @MiekoHash and others, there are two main user scenarios we want to enable with this work (each comprising multiple user stories):

"As a RSE, I want to be able to add new features of schema classes and properties (such as attribute context relevant for all DCC projects) and have corresponding functions in the data model parser that I'd need to change in one place for each new class or property feature. That way I would avoid having to look up and make changes in redundant data-model parser code snippets, and that would allow me to support new types of schema object features at least 3x faster. (Currently code snippets repeat the same business logic applied to properties and classes more than three times in the data model parser code and are not encapsulated as modules.)"

"As a RSE, I want to be able to add new schema formats that schematic can parse natively, so that I can add support for reading LinkML data schema files (e.g. the latter capturing data models in INCLUDE and Bridge2AI)."

milen-sage commented 1 year ago

@mialy-defelice do you think breaking down this in smaller issues would be helpful? E.g. we can capture the parser methods you have laid out in the refactor design as their own issues?

mialy-defelice commented 1 year ago

For Sprint2.2023 the AC is to break down this portion of the refactor into additional issues. Can link them here.

Sending documentation to Mieko.

MiekoHash commented 1 year ago

Requested LucidChart account via IT ticket.

MiekoHash commented 1 year ago

Rec'd LucidChart account and now I can see the class diagram. Plan to review the docs in the next couple of weeks.

mialy-defelice commented 1 year ago

Create issues for the following:

mialy-defelice commented 1 year ago

I have just been coming up with User Stories. Need catalog of actual user stories to link them properly.