microbiomedata / DataHarmonizer

Our dev interface is available via github pages:
https://microbiomedata.github.io/DataHarmonizer/main.html
MIT License
1 stars 0 forks source link

Update Elevation help text to include units #157

Closed pvangay closed 1 year ago

pvangay commented 2 years ago

Typing "299" into elevation fails on validation. Typing "299 m" succeeds.

The help description and guidance do not state that a unit is required. This should be specified in the help.

turbomam commented 2 years ago

Note that MIxS as-is columns like this generally require the user to specify units, whereas several MIxS modified column have been created in which a bare value is required, and the inclusion of a unit will fail validation.

This highlights the need to be more consistent with documentation (including the double click help) and validation. I think the idea for the MIxS modified columns was to communicate the required units to the submitter but require numerical entries only.

Since elev is a MIxS as-is column, specified in the mixs_packages_x_slots tab, it inherits all attributes from MIxS, including the description, DH guidance (from LinkML comments), and the required range, QuantityValue, which is converted into the regular expression ^[-+]?[0-9]*\.?[0-9]+([eE][-+]?[0-9]+)? \S+$, where the left-hand \S+ means a space and them at least one other character.

Solving this issue will require either

ssarrafan commented 1 year ago

Adding to the current sprint per the task list from the subport squad. Is this reasonable @turbomam?

mslarae13 commented 1 year ago

I am going to take over this issue. If I remember correctly, the decision is elevation should always be in meters? Or will we accept ft?

If ONLY meters, we need to make the validation allow for the number only. And update the text to reflect this. I can update the guidance & example. @turbomam can you update the validation rule?

Unless we want to allow ft? If we do, I'll just update the text. No validation change needed.

turbomam commented 1 year ago

Thanks @mslarae13

I think there are several questions to consider for elevation and other measurement-like slots, to maximize consistent slot definitions. Let's try to minimize the number of patterns we use, and write really consistent annotations (comments, description, examples, etc.)

I will generate a table of Biosample-related slots that illustrates the current state of the criteria above.

mslarae13 commented 1 year ago

Thanks @turbomam !

Should we talk about this on the 3rd? or on the 10th?

mslarae13 commented 1 year ago

@ssarrafan overdue, please add to January 2023 sprint

ssarrafan commented 1 year ago

@mslarae13 moving to the next sprint, let me know if you don't plan to work on it in the next few weeks please

mslarae13 commented 1 year ago

elevation should always be reported in meters. Mark and I met today and update oxy_stat_sample & elev

mslarae13 commented 1 year ago

https://github.com/microbiomedata/nmdc-schema/pull/689