naturalis / supersmart

Self-Updating Platform for the Estimation of Rates of Speciation, Migration And Relationships of Taxa
MIT License
17 stars 5 forks source link

*BEAST issues #67

Closed rvosa closed 9 years ago

rvosa commented 9 years ago

There are some issues with the *BEAST implementation:

rvosa commented 9 years ago

We set sortTranslationTable="true" in the BEAST xml, which means we should be able to append tree descriptions verbatim.

rvosa commented 9 years ago

--append makes the next available suffix for the .nex and .log files (e.g. *.nex.1). At the end of the run the additional tree results are appended to the stem file sans --burnin.

rvosa commented 9 years ago

The problem with including more haplotypes in *BEAST is that we have a separate CLADE_MAX_DISTANCE setting, so we can't re-use the results from smrt orthologize, because these are filtered using BACKBONE_MAX_DISTANCE. We basically need to redo the orthologize, which is why most of its functionality has been moved to SequenceGetter, as of e1f4973055d497d52b293abc8c89c48be2ff7675

rvosa commented 9 years ago

Perhaps the easiest way to do this is to implement something like smrt clademerge --enrich, which adds up to CLADE_HAPLOTYPES (e.g. 3) similar sequences to each species while verifying that CLADE_MAX_DISTANCE is not exceeded.

rvosa commented 9 years ago

As of e5be3f95801e111f0f00b5155aca7e6e79085735 this now more or less works.