Open mdugenne opened 2 years ago
considering that we don't want to get rid of data (rows) for the "cleaned" version of the tsv files, this part of the file reading function https://github.com/jessluo/PSSdb/blob/cd3116e484aba24f4bf99849bfc65a86bd06bba7/scripts/Size_spectra_functions.py#L75-L82
needs to be included after the data is read and standardized but before the data is used for PSS analysis
Known issues (leak, air bubbles, camera focus, and settings) with the IFCB instrument may alter the detection of plankton and result in inaccurate count and/or size estimates, affecting the observed size distribution and predicted slope of the size spectrum.
Given that (1) IFCB projects uploaded on Ecotaxa have not been flagged, (2) only a small percentage of IFCB projects have been validated (3) some predicted artefacts are actually good quality images of plankton, is there a reliable way to flag and filter out bad quality IFCB samples?
Few ideas: Within the possible artefacts (e.g. bubbles, beads, badfocus), bubbles are likely the best predicted, followed by beads.