marschall-lab / panacus

Panacus is a tool for computing statistics for GFA-formatted pangenome graphs
MIT License
73 stars 4 forks source link

what dose coverage and quorum mean? #31

Closed peanut-nu closed 1 month ago

peanut-nu commented 1 month ago

hi, I'm sorry to bother you.But i really don't konw what dose coverage and quorum mean.Can you help me ?And can you help me see what's wrong with my results and how they can be interpreted? Again, my apologies. image image image image image image

danydoerr commented 1 month ago

Hey @peanut-nu, sure no apologies needed, I'm happy to help. I don't think there's anything wrong with your results. Regarding the coverage and quorum parameters:

You have very few samples/haplotypes, and thus the edge effects of the quorum threshold is particularly pronounced.

Hope this helps!

peanut-nu commented 1 month ago

Hey @peanut-nu, sure no apologies needed, I'm happy to help. I don't think there's anything wrong with your results. Regarding the coverage and quorum parameters:

  • coverage lets you calculate the growth statistic for the subset of countables (bps/nodes/edges) that are shared by a minimum number of haplotypes/samples. In other words, Panacus will only take those bars of the coverage histogram (the three last plots that you posted) into account when calculating pangenome growth that satisfy the threshold
  • quorum: This parameter allows you to control the minimum number of shared countables when a new haplotype/sample is "added" to the pangenome (progression along the x-axis). For instance, if you set the quorum to 100%, then this means that only those countables in are counted in the newly added genome that are already part of the pangenome. That way you can measure what is known as pangenome core. If you modulate the threshold, you can calculate e.g. soft core (90%), or be a bit more strict on pangenome growth (e.g. 10%, which means that each countable of the new genome must be shared by at least 10% of the already contained genomes).

You have very few samples/haplotypes, and thus the edge effects of the quorum threshold is particularly pronounced.

Hope this helps!

You've explained it very well, I'll look into it a bit more carefully, thanks a lot really!Hope you have a nice day !

danydoerr commented 1 month ago

Thanks, you too!