percentage of heterozygosity

Hi Mojtaba,

I would suggest to use Genomescope2 for that purpose, using the read kmers histogram obtainable with

meryl histogram reads.meryl > reads.hist

1) using shared k-mers between the two haplotypes - I assume are obtained from your haplotype assemblies? I wouldn't rely on the assembled sequences, as it is likely to contain errors and haplotype switches.

2) haplotype-specific k-mers - I guess you mean to get the heterozygosity = (maternal + paternal hapmers) / all read mers?

The hapmers are the inherited, distinguishable kmers from the parental genome. Any k-mer from a shared heterozygous region between the parents (e.g. AB AB, inherited as AB or BA) are not included, thus using the equation above would result in an under-estimated level the heterozygosity.

Thanks, Arang

marbl / merqury

percentage of heterozygosity #63