greenelab / pancancer-evaluation

Evaluating genome-wide prediction of driver mutations using pan-cancer data
BSD 3-Clause "New" or "Revised" License
9 stars 3 forks source link

F-statistic distribution heatmaps #52

Closed jjc2718 closed 1 year ago

jjc2718 commented 1 year ago

For my committee meeting presentation, Casey and I thought a heatmap showing the distribution of univariate f-statistics across cancer types would be useful. The idea is to show that there are some features that are strongly correlated with the labels in one cancer type and not others, and some features that are correlated across many cancer types.

EGFR is a good example of this - when we select by pan-cancer f-statistic (middle heatmap) we can see that most of the genes/features are strongly correlated in LGG and not as correlated in other cancer types, but when we select by median f-statistic (right heatmap) correlations tend to be more spread out across cancer types.

EGFR_heatmaps

review-notebook-app[bot] commented 1 year ago

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB