lemieuxl / pyGenClean

Automated genetic data clean up procedure in Python.
GNU General Public License v3.0
3 stars 1 forks source link

output message for marker/sample missingness indicates the opposite of what's calculated #24

Closed jgrundstad closed 8 years ago

jgrundstad commented 8 years ago

Considering the documentation for the Sample and Marker Missingness modules (e.g. The missingness threshold (remove samples with more than x percent missing genotypes). [Default: 0.100]), the logged results appear to indicatethe opposite:

Number of samples with missing rate less or equal to 0.1 5 -5*

Am I understanding this correctly? Thanks!

lemieuxl commented 8 years ago

The output message is indeed wrong. The text in the result description is correct, but the information in the summary results table should read:

Number of samples with missing rate higher than 0.1    5        -5

This represents the number of samples that were excluded from the dataset.