cio-abcd / variantinterpretation

Collaborative Interpretation-Pipeline workflow based on nf-core pipeline structure
MIT License
7 stars 1 forks source link

BRCAness SBS3 signature as clinical marker #18

Open biolancer opened 1 year ago

biolancer commented 1 year ago

Description of feature

BRCAness (SBS3 signature following the COSMIC reference set up by Alexandrov et al.) can be used as a proxy for HRD and is requested as clinical marker in the DNPM KDS v2. The BRCAness will be given as a float between 0-1, indicating the relative contribution of the SBS3 signature to a sample. My proposal for calculating the BRCAness is as follows:

Two major caveats of the SigProfiler suite are the dependency on multiple samples for NMF and no existing containers of the suite in public docker/singularity sites.

As requested in the KDS v2, a confidence interval should be generated for the assigned SBS3 values. Different python packages as e.g. Scipy.Stats are able to generate CI intervals based on sample size, sample mean and the sample standard deviation. I propose to calculate these based on the respective VCF input dataset, as trinucleotide context assignments are merged for NMF either way, although this would mean that the CI would depend on the input dataset size and composition and would only be reproducible if the same dataset is used as input.

Some literature references I used: