gbif / metabarcoding-data-toolkit-ui

Frontend for the eDNA tool
5 stars 1 forks source link

include "ignore this xxx" in the sample and taxon sheets #103

Open tobiasgf opened 6 months ago

tobiasgf commented 6 months ago

User often have control samples or other samples they wish to exclude from the GBIF published dataset, but wish to keep in the upload data for completeness. The same goes for specific taxa (contaminants). Could we have an optional "ignore" field for both the sample and taxon sheets, that would mean that any content in that field would exclude the corresponding row entirely from the data that eventually goes into the DwC-A?

Potential problems:

tobiasgf commented 4 months ago

The new field for the taxon sheet could be called e.g. excludedTaxon, exclusionCriterion, or similar. (or do we already have something that could be used (with caution): e.g. identificationRemarks Values could be: Known contaminantion Suspected contamination Spurious detection Habitat mismatch non-indigenous Suspicious sequence non-target low abundance low frequency positive control other ...

For the sample data we could include the two existing fields: neg_cont_type and pos_cont_type, and exclude (from the generated DwC) all samples that carry any value in either of these fields. Values for neg_cont_type could be: field blank, blank filter, extraction blank, PCR non template control other

Values for pos_cont_type could be free text (a string of taxon names, the name of a known positive control mock sample, etc.