GenomicsStandardsConsortium / mixs

Minimum Information about any (X) Sequence” (MIxS) specification
https://w3id.org/mixs
Creative Commons Zero v1.0 Universal
36 stars 21 forks source link

Source of truth for MIxS 6.2 release and transition to schema sheets #715

Open ramonawalls opened 1 year ago

ramonawalls commented 1 year ago

On today's TWG call, we decided that the SOT for MIxS should be the TSV files formatted for schema sheets, instead of YAML. That way they can be edited in Excel. The TSVs should be transformed back into YAML as part of a PR (with each commit?).

However, we will keep YAML as the SOT for MIxS 6.2, then work on the schema sheets implementation either for a 6.3 version of for 7.0 (expected ~ Feb. 2024).

@turbomam @sujaypatil96 please review my notes above and make any corrections needed or ask clarifying questions here.

turbomam commented 1 year ago

The stored source is truth will be LinkML YAML. We did discuss allowing contributors to make their initial submission in TSV. It was not clear to me whether the TSVs could be fully validateable schemasheets, TSVs that use schemasheets headers but are otherwise unconstrained, or truly free- form spreadsheets.

Regarding the use of spreadsheet applications like MS Excel, I say that people can use any tool they want, but they are responsible for sharing UTF-8 encoded, tasb delimited documents.

Who is going to be responsible for converting the submitted sheets to something that can merged into the schema? I.E. either validateable schemasheets or validateable LinkML YAML? I think we may be really underestimating how much work that will take, if the submissions are not required to be valid schemasheets.