I have a question regarding the minimum_match threshold. I have supplied LULU with a matchlist generated by vsearch at 84% sequence similarity, and then run LULU at multiple thresholds. For example,
I am finding that at some thresholds children ASVs that are below minimum match are merged (e.g., at 93% minimum match I get a match of 91.5 merged; at 95% minimum match several children between 94-95% are merged).
Is there a simple explanation for this? I thought maybe rounding, but the 91.5% match at 93% minimum would seem to indicate that's not the issue. So far this has affected only very low frequency ASVs (in terms of sample count) so it's not a huge issue, but curious to know if this is intentional.
Thank you for the nice algorithm.
I have a question regarding the
minimum_match
threshold. I have supplied LULU with a matchlist generated byvsearch
at 84% sequence similarity, and then run LULU at multiple thresholds. For example,I am finding that at some thresholds children ASVs that are below minimum match are merged (e.g., at 93% minimum match I get a match of 91.5 merged; at 95% minimum match several children between 94-95% are merged).
Is there a simple explanation for this? I thought maybe rounding, but the 91.5% match at 93% minimum would seem to indicate that's not the issue. So far this has affected only very low frequency ASVs (in terms of sample count) so it's not a huge issue, but curious to know if this is intentional.
Thanks! Best, Eric