Arcadia-Science / noveltree

NovelTree is a highly parallelized and computationally efficient phylogenomic workflow that infers gene families, gene family trees, species trees, and gene family evolutionary history.
GNU Affero General Public License v3.0
17 stars 3 forks source link

Monophyly constraints for gene tree and/or species tree inference #21

Closed austinhpatton closed 1 year ago

austinhpatton commented 1 year ago

Description of feature

Create option to provide a list of monophyly constraints to expedite gene family tree and (possibly) species tree inference.

For gene family trees, this will likely mean a list of species per group for which we confidently expect there to be no legitimate reason for HGT/hybridization with other groups - e.g. at the level of taxonomic supergroup, or enforcing monophyly for euks and archaea respectively.

Internally will need to correspond sequence names (Genus_species:SeqID) to species names, since this will entail monophyly of species, each with (potentially) multiple gene copies.