VEuPathDB / microbiomeComputations

1 stars 0 forks source link

Correlations: pre-filter features before performing calculation #69

Closed d-callan closed 7 months ago

d-callan commented 9 months ago

Examples might be to filter taxa so that only those are kept which are seen in 50% of samples or more, or filter pathways by coverage, or filter either by abundance, or probably other ideas we should consider as well...

whatever we decide, we should be able to support w literature and add to user-facing documentation, OR add user-controls for

asizemore commented 8 months ago

We're a go on pre-filtering on the backend and writing about it in our user documentation. We should remove variables that don't meet a min proportion of non-zero values. @d-callan did you say you usually saw values around 50% for the min proportion?

asizemore commented 7 months ago

@d-callan can we close this?