Niinleslie / MesKit

A tool kit for dissecting cancer evolution from multi-region derived tumor biopsies via somatic mutations
GNU General Public License v3.0
35 stars 9 forks source link

mutHeatmap n number in title #261

Open emilysouthworth opened 6 months ago

emilysouthworth commented 6 months ago

What does the number mean in each plotted mutHeatmap for a given sample?

For example, in example data V974 has 298 n. I assumed this was the total mutations however the maf file has 1588 rows, assuming (without duplicate variants) this is total number of mutations observed across all samples. Using the calculated percentages for shared, public and private events also doesn't get me to the number 298. Screenshot 2024-04-12 103311

Niinleslie commented 4 months ago

@emilysouthworth As each mutation exists in at least one sample, the row count does not represent the total number of mutations across all samples from V974. The number of unduplicated muts can be calculated using dim(dplyr::distinct(maf$V974@data[, c("Chromosome", "Start_Position", "Reference_Allele")])

emilysouthworth commented 3 months ago

Thank you for the reply. So, is the number 298 pertaining to the total number of mutations shared across all samples?