Closed mortonjt closed 1 year ago
Are they being filtered out by songbird?
Yes, there are a couple of filters in songbird, in particular
--min-feature-count
: filters microbes according to the number of samples that they are observed in. By default, if a microbe appears in less than 10 samples, then it will be filtered out (since regression estimates will be poor for those microbes).--min-sample-count
: filters samples according to the number of reads that they have. By default, if a sample has less than 1000 reads, it will be removedSee below for more details https://github.com/biocore/songbird#73-faqs-parameters-
--min-sample-count
was indeed the problem. This raises another question: does songbird work on the relative abundance table produced by MetaPhlAn? Or would you need to convert it to read counts by multiplying each taxa by the total no. of reads per sample? Thanks!
Getting the raw read counts would be ideal. But scaling the relative abundances to say 10k should also be ok
On Fri, Aug 28, 2020, 5:28 AM johannesbjork notifications@github.com wrote:
--min-sample-count was indeed the problem. This raises another question: does songbird work on the relative abundance table produced by MetaPhlAn? Or would you need to convert it to read counts by multiplying sample relative abundances by sample specific sequencing depths? Thanks!
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/biocore/songbird/issues/137#issuecomment-682473917, or unsubscribe https://github.com/notifications/unsubscribe-auth/AA75VXMKZ52U7BJV6HYO3Q3SC6IFLANCNFSM4QKXV2AA .
Below is an example of how this error can arise
One issue with this run was all of the samples were accidentally being filtered out, since the sample names between the biom and the sample metadata didn't match.
We should have a better error message for this sort of error.