AlexandrovLab / SigProfilerAssignment

Assignment of known mutational signatures to individual samples and individual somatic mutations
BSD 2-Clause "Simplified" License
46 stars 10 forks source link

SBS signatures with numbers above 96 #122

Closed jasmussen-bcm closed 5 months ago

jasmussen-bcm commented 5 months ago

I recently ran SigProfiler on some TCGA data and got samples enriched for SBS signatures with values like "SBS288", "SBS384", "SBS1536", "SBS4608", and "SBS6144." What signatures are these referring to? I don't see any on the COSMIC website. https://cancer.sanger.ac.uk/signatures/sbs/

mdbarnesUCSD commented 5 months ago

The COSMIC signatures as of v3.4 release are supported for contexts SBS96, ID83, DBS78, SV32, CNV48, and RNA-SBS192. The contexts that you see for SBS288, SBS384, SBS1536, SBS4608, and SBS6144 are extended contexts that can be useful for analysis, but are not supported contexts for COSMIC signatures. The parameter collapse_to_SBS96 can be used to collapse from a higher context (ie SBS1536) down to SBS96, which is the context that SBS COSMIC signatures are currently presented in.

jasmussen-bcm commented 5 months ago

Thank you!

And, how do I interpret the "Assignment_Solution_Activities.txt" file with the signatures by sample ID? The values in the matrix don't appear to be scaled. Are these the count of each mutation associated with each signature per sample?

mdbarnesUCSD commented 5 months ago

Yes that is correct, the values in Assignment_Solution_Activities.txt are the mutation count associated with each signature per sample.

mdbarnesUCSD commented 5 months ago

Please reach out if you have any additional questions.