Use parsimony algorithm to compress likelihoods

The current approach to compressing likelihoods on the tree is a simple scheme based on merging values that are the same as their parents. This will not result in the smallest posslble number of state transitions on the tree. To do this we need to use the Hartigan parsimony algorithm like we do in the tskit haplotype matching code.

This "should" have a significant impact on performance, and make the precision parameter more meaningful, but we'll have to see. It's not a trivial change, as the code is highly optimised for the current method.

tskit-dev / tsinfer

Use parsimony algorithm to compress likelihoods #374