Closed turbomam closed 3 months ago
@sujay, please see model_sections.py for the way I want to model DH sections within a LinkML model like soil_biosample.yaml
, based on the contents of the Sections_order tab in the Soil-NMDC-Template_Compiled Google Sheet
Google Sheets is no longer used the pipeline. If there is still an open issue with column and/or section ordering please open an issue in the submission-schema
repo.
review: column ordering is being handled well now (pandas sort over column_order annotations), but no action on section annotation yet
test: all slots claimed by a DH-specifying class (like soil_biosample) should have section and column_order annotations
test: all section values/names assigned to section annotation should be defined in the section order tab
lower priority test (or design decision): what to do if the column order values aren't exactly 1..(count(slots per section))
New functionality:
DataHarmonizer
data.tsv
row types (see @turbomam's "reunited" etc.)