microbiomedata / DataHarmonizer

Our dev interface is available via github pages:
https://microbiomedata.github.io/DataHarmonizer/main.html
MIT License
1 stars 0 forks source link

final steps for section composition, section order and column order #107

Closed turbomam closed 3 months ago

turbomam commented 2 years ago

review: column ordering is being handled well now (pandas sort over column_order annotations), but no action on section annotation yet

test: all slots claimed by a DH-specifying class (like soil_biosample) should have section and column_order annotations

test: all section values/names assigned to section annotation should be defined in the section order tab

lower priority test (or design decision): what to do if the column order values aren't exactly 1..(count(slots per section))

New functionality:

DataHarmonizer data.tsv row types (see @turbomam's "reunited" etc.)

turbomam commented 2 years ago

@sujay, please see model_sections.py for the way I want to model DH sections within a LinkML model like soil_biosample.yaml, based on the contents of the Sections_order tab in the Soil-NMDC-Template_Compiled Google Sheet

pkalita-lbl commented 3 months ago

Google Sheets is no longer used the pipeline. If there is still an open issue with column and/or section ordering please open an issue in the submission-schema repo.