AlexsLemonade / OpenPBTA-analysis

The analysis repository for the Open Pediatric Brain Tumor Atlas Project
Other
99 stars 66 forks source link

Visualize quanTIseq fractions across molecular subtypes #1251

Closed sjspielman closed 2 years ago

sjspielman commented 2 years ago

This PR is continued from #1232.

Adds two visualizations that explore the cell type fractions estimated by quanTIseq across molecular subtypes from three different cancer groups: High-grade glioma astrocytoma, Ependymoma, Medulloblastoma. Unclassified subtypes and subtypes with <3 samples are not considered.

Two visualization options are included - the data is the same but the faceting is flipped. Note that one of the versions contains fractions of uncharacterized cell types, which doesn't fit (due to faceting scheme) in the other version. Points are currently colored by cancer group.

Is the analysis in a mature enough form that the resulting figure(s) and/or table(s) are ready for review?

Yes

Questions for reviewers

Results

What types of results are included (e.g., table, figure)?

Two figures: version 1 and version 2

What is your summary of the results?

Reproducibility Checklist

Documentation Checklist

sjspielman commented 2 years ago

@jharenza I've marked this as a draft PR since I imagine we'll have some back-and-forth about the figure conceptualization here. But, it's ready for you to have a look (just not "formal" review!).

sjspielman commented 2 years ago

Update - added one more plot as an option to consider. This barplot shows the fractions of characterized cell types, similar to the barplot I had made for molecular signature fractions.

sjspielman commented 2 years ago

I've updated this code based on some feedback from @jharenza and I feel this is ready for review now! Note that the code is now in a notebook inside the immune-deconv module for improved organization, so in the end (probably a separate PR) we can copy final figures over into the main figures/ directory.

I suggest we proceed with survival analysis (again, note that given lack of variation in CD8+ scores, this is not likely going to be a useful predictor, but we'll find out!) in a subsequent PR and in its own notebook potentially in the survival-analysis/ module.

sjspielman commented 2 years ago

Noting that I also updated the analysis README to describe this notebook and its output.