AlexsLemonade / OpenPBTA-analysis

The analysis repository for the Open Pediatric Brain Tumor Atlas Project
Other
99 stars 66 forks source link

Updated mutational-signatures: PR 1 of 2 #1226

Closed sjspielman closed 2 years ago

sjspielman commented 2 years ago

Purpose/implementation Section

The mutational-signatures module needs to be finalized, with the following overall goals. These goals replace previous 05 and 06 scripts/notebooks.

This PR:

  1. Perform extraction of the 8 CNS Signal/RefSig signatures with two methods, deconstructSigs and sigfit. This is now performed in 05-fit_cns_signatures.R. This script saves two files contained the estimated signature exposures, one per methods:
    • results/fitted_exposures_signal-cns-deconstructSigs.RDS
    • results/fitted_exposures_signal-cns-sigfit.RDS
  2. Perform a brief comparison of the results from these two methods in 06-compare_cns_exposures.Rmd. Comparison shows overall high similarity.

Next PR:

  1. Update this module with pub-ready visualizations which will live in an 07 notebook.

In the next PR, the overall analysis README.md and the specific mutational-signatures module README.md will be updated.

What GitHub issue does your pull request address?

1220

Directions for reviewers. Tell potential reviewers what kind of feedback you are soliciting.

Which areas should receive a particularly close look?

Overall organization.

Is there anything that you want to discuss further?

There is a world in which 05 just uses deconstructSigs and proceeds straight to 07, skipping any (then moot) comparisons which happen in the current 06. Plus, because the current approach does a quick evaluation of both methods, we don't actually end up saving time by visualizing the faster method. But, skipping the comparison seems less desirable for posterity and having fewer "magic" decisions in the pipeline.

Is the analysis in a mature enough form that the resulting figure(s) and/or table(s) are ready for review?

Yes

Results

What types of results are included (e.g., table, figure)?

New result files in results/, as well as some quick scatterplots in the 06 notebook. These are not meant for the publication.

What is your summary of the results?

sigfit and deconstructSigs give fairly consistent results, with the latter method being much faster. So, the latter method will be use for subsequent viz.

Reproducibility Checklist

Documentation Checklist