Open emilysouthworth opened 6 months ago
@emilysouthworth
As each mutation exists in at least one sample, the row count does not represent the total number of mutations across all samples from V974. The number of unduplicated muts can be calculated using dim(dplyr::distinct(maf$V974@data[, c("Chromosome", "Start_Position", "Reference_Allele")])
Thank you for the reply. So, is the number 298 pertaining to the total number of mutations shared across all samples?
What does the number mean in each plotted mutHeatmap for a given sample?
For example, in example data V974 has 298 n. I assumed this was the total mutations however the maf file has 1588 rows, assuming (without duplicate variants) this is total number of mutations observed across all samples. Using the calculated percentages for shared, public and private events also doesn't get me to the number 298.