AlexsLemonade / OpenPBTA-analysis

The analysis repository for the Open Pediatric Brain Tumor Atlas Project
Other
101 stars 67 forks source link

Updated analysis: update some figures to include annotations by `cancer_group` for manuscript #1144

Closed jharenza closed 2 years ago

jharenza commented 3 years ago

What analysis module should be updated and why? What changes need to be made? Please provide enough detail for another participant to make the update.

What input data should be used? Which data were used in the version being updated?

v20

When do you expect the revised analysis will be completed?

this and next week

Who will complete the updated analysis?

@kgaonkar6

Note, ticket still a WIP

kgaonkar6 commented 3 years ago

While working in the interaction plots I've identified some points that I'd like to get clarification on:

1) Do we want to plot coocurrence plot per each cancer_group corresponding to the broad_histology here https://github.com/AlexsLemonade/OpenPBTA-analysis/blob/0f5a7263af2b02f80c6949f9d33730fe006eaab2/analyses/interaction-plots/01-create-interaction-plots.sh#L40-L49 ? Asking because dividing some cancer_groups only have a couple of samples

             broad_histology                        cancer_group   n
1 Low-grade astrocytic tumor      Diffuse fibrillary astrocytoma   7
2 Low-grade astrocytic tumor                       Ganglioglioma   1
3 Low-grade astrocytic tumor                 Gliomatosis cerebri   1
4 Low-grade astrocytic tumor                                 GNG  47
5 Low-grade astrocytic tumor                   Oligodendroglioma   1
6 Low-grade astrocytic tumor               Pilocytic astrocytoma 123
7 Low-grade astrocytic tumor       Pleomorphic xanthoastrocytoma  11
8 Low-grade astrocytic tumor Subependymal Giant Cell Astrocytoma   9

And since currently only genes that are mutated in minimum 5 samples are kept for plotting we end up erroring out on some of the smaller cancer_groups. So we can either drop them with a cutoff of samples or plot cooccurrence by broad_histology but plot the cancer_group in the barplots?

jharenza commented 3 years ago

@kgaonkar6 I was envisioning only the barplot would be changed to cancer group, but the analysis would remain as is.

kgaonkar6 commented 3 years ago

For transcriptomic-dimension-reduction I get an error saying Error: Insufficient values in manual scale. 58 needed but only 26 provided.

The manual scale has 26 colors here, do we want to add the 58 colors from the figures palette for cancer_group here? https://github.com/jaclyn-taroni/OpenPBTA-analysis/blob/1ac140e1d93fcb9615d57265f9a11c55c9f4bc74/analyses/transcriptomic-dimension-reduction/util/dimension-reduction-functions.R#L256-L265

jaclyn-taroni commented 3 years ago

@kgaonkar6 do you need to alter the module itself or would be altering the figure generation steps to use an updated palette be sufficient for what you're trying to accomplish (e.g., in here)?

kgaonkar6 commented 3 years ago

umm good question, I thought we want to update the modules in this ticket and the figures/ will be recreated in #1132. Let me know if I mis-understood.

jaclyn-taroni commented 3 years ago

That question probably wasn't for me, but I'd say that I don't see any reason to update that module if it's only for display purposes. In fact, keeping figures/ separate is handy for this kind of tweak as it keeps the final touches of display separate from the analysis which can sometimes be exploratory in nature & is nice to have preserved.

For interaction plots, the figure is just copied from the module so changing the module is the right call: https://github.com/AlexsLemonade/OpenPBTA-analysis/blob/75c737d980884d3bec8c6fdcbecbe97b617e168d/figures/generate-figures.sh#L60

jaclyn-taroni commented 3 years ago

I'll also add: I'm not sure making a scatter plot with a 58 color palette is going to be helpful to or discernible by our readers?

kgaonkar6 commented 3 years ago

That makes sense @jaclyn-taroni!

From additional discussions with @jharenza we decided to:

jharenza commented 2 years ago

This seems to be completed.