Arcadia-Science / noveltree

NovelTree is a highly parallelized and computationally efficient phylogenomic workflow that infers gene families, gene family trees, species trees, and gene family evolutionary history.
GNU Affero General Public License v3.0
17 stars 3 forks source link

speciesrax/generax fix for excluded gene families #46

Closed austinhpatton closed 1 year ago

austinhpatton commented 1 year ago

SpeciesRax and GeneRax both use RAxML under the hood to optimize tree topologies. Whereas IQ-TREE and MAFFt can handle some rare amino acids (I know at least selenocysteine - U, but possible also pyrrolysine - O), RAxML can't. So, if these are present in MSAs, these gene families are excluded from generax/speciesrax analysis.

This change just goes in and removes these from staged files in the generax and speciesrax modules. Seems to fix the issue.