ssolo / ALE

Amalgamated likelihood estimation (ALE) is a probabilistic approach to exhaustively explore all reconciled gene trees that can be amalgamated as a combination of clades observed in a sample of gene trees. We implement the ALE approach in the context of a reconciliation model (cf. http://arxiv.org/abs/1211.4606 ), which allows for the duplication, transfer and loss of genes. We use ALE to efficiently approximate the sum of the joint likelihood over amalgamations and to find the reconciled gene tree that maximizes the joint likelihood among all such trees.
44 stars 15 forks source link

How to parse ALEobserve output to retrieve CCP values #36

Closed Gullumluvl closed 10 months ago

Gullumluvl commented 2 years ago

Hi,

I would like to measure Conditional Clade Probabilities on a sample of trees; I thought that this is exactly what ALEobserve does, but need some explanations on the .ale output.

Would you mind confirming the following suppositions on the format?

I have a subsidiary question regarding CCP definition: the paper of Höhna et al. 2012 is cited, but I have also read Larget 2013 which states that his definition is slightly different:

I note that this equation is similar to the conditional clade probability (CCP) formulas given in Höhna and Drummond (2012). The key difference is that in their formulas, the unnormalized probabilities for each tree are the product over all clades in the tree of the conditional clade probabilities of the form P(clade|parent of clade). In the approach in this article, the probability of each tree is calculated as the product over all parent clades in the tree of the conditional clade probabilities of the form P(all children clades|parent clade).

It seems to me that the latter definition is the one implemented in ALE, right?

Thanks in advance!

ssolo commented 10 months ago

We have more detailed description here, hope that helps: https://www.biorxiv.org/content/10.1101/2023.10.06.561091v2.supplementary-material