Speed up PAPRICA by only using unique sequences?

bowmanjeffs / paprica

paprica - PAthway PRediction by phylogenetIC plAcement

27 stars 8 forks source link

Hi, I have a very large 16S dataset, and I was wondering if there is an option to use PAPRICA only with unique 16S sequences to speed up the alignment step in the beginning? If I understand the logs correctly after the cmalign the program continues with unique sequences anyway... Of course, PAPRICA will work with unique 16S sequences now, but as far as I could see, there is no option to take sequence counts (abundance) of a non-redundant dataset into account when calculating the metabolic profile.

Thanks!

Cheers, Christiane

bowmanjeffs / paprica

Speed up PAPRICA by only using unique sequences? #47