cidgoh / DataHarmonizer

A standardized browser-based spreadsheet editor and validator that can be run offline and locally, and which includes templates for SARS-CoV-2 and Monkeypox sampling data. This project, created by the Centre for Infectious Disease Genomics and One Health (CIDGOH), at Simon Fraser University, is now an open-source collaboration with contributions from the National Microbiome Data Collaborative (NMDC), the LinkML development team, and others.
MIT License
91 stars 25 forks source link

CanCOGeN template: add corrections for null values in all fields upon LIMS export #256

Closed griffie closed 2 years ago

griffie commented 2 years ago

Right now the DH corrects spacing and capitalization of null values in required fields, but errors in null values entered in the optional fields are causing problems in submissions to the data portal.

Can we add some code for the NML LIMS export to search all fields for null values, and once identified, to correct for errant spacing and capitalization?

ddooley commented 2 years ago

Done on DH draft! Note that though some fields like GISAID Accession are marked as capitals, if content is a null value string, that will be adjusted for NML LIMS export, i.e. "MISSING" is converted to "Missing" on export.