Arcadia-Science / noveltree

NovelTree is a highly parallelized and computationally efficient phylogenomic workflow that infers gene families, gene family trees, species trees, and gene family evolutionary history.
GNU Affero General Public License v3.0
17 stars 3 forks source link

Ap/parallel annotation #49

Closed austinhpatton closed 1 year ago

austinhpatton commented 1 year ago

This is a minor change the the protein annotation R script. Previously, when multiple annotations (anywhere from 1-16 sets) were requested for download, this would occur serially and when done for many species could take up a good chunk of time.

Now, I parallelize the annotation process - the number of parallel tasks equals the minimum of either the number of available cores, or the number of annotations requested. This now means that since the module uses "process_medium", we can download all annotation sets per species simultaneously, increasing computational efficiency and speeding up the module by up to ~16X.