Open DBecker7 opened 8 months ago
Output format should be matrices, similar to something like cov(data)
. This makes it nice to plot as a heatmap for easy visual inspection.
Some good work has been done. However, the Jaccard similarity is not calculated correctly, and the code could use some clean-up. I also need to make the plotting functions easier to differentiate, or add them to one function with arguments/dispatch to specify which.
Jaccard similarity of all pairwise combinations of variants.
Extract the set of mutations for each variant, then compare pairwise across all variants.
There are a couple of interesting cases:
The function should be able to calculate these before data are fused (i.e. with varmat) as well as after (i.e. with the fused data).