mikemc / speedyseq

Speedy versions of phyloseq functions
https://mikemc.github.io/speedyseq/
Other
46 stars 6 forks source link

Can we refactor `tax_glom` and 'merge_taxa_vec` to avoid calling `prune_taxa()`? #22

Open mikemc opened 4 years ago

mikemc commented 4 years ago

When doing a tax_glom with a taxrankwhere many taxa are unassigned and NArm = TRUE, then tax_glom() can be bottlenecked by the initial call to prune_taxa() to remove the taxa with bad_empty values at taxrank. I suspect could achieve the same outcome without needing to call prune_taxa() and thereby perhaps gain a significant speed up in such circumstances (especially if taxa_are_rows = TRUE so that the transpose step is skipped).