AlexsLemonade / OpenPBTA-analysis

The analysis repository for the Open Pediatric Brain Tumor Atlas Project
Other
100 stars 67 forks source link

Updated analysis: oncoprint landscape missing SMARCB1 for embryonal (ATRT) #1184

Closed jharenza closed 2 years ago

jharenza commented 3 years ago

What analysis module should be updated and why?

oncoprint landscape - the embryonal tumor plot for ATRTs is missing SMARCB1, the major driver mutation

What changes need to be made? Please provide enough detail for another participant to make the update.

Not sure why this is missing - see this pedcbio page - we seem to be missing the variant calls and two homodeletions. It is possible some of the samples in pedcbio do not overlap with the independent specimen list, but we need to investigate this, as missing this key alteration is a red flag. SMARCB1 is in the GOI list here: https://github.com/AlexsLemonade/OpenPBTA-analysis/blob/d31c927a27813ec0b8032fbe768002f31723636f/analyses/oncoprint-landscape/data/embryonal-tumor_goi_list.tsv#L63

What input data should be used? Which data were used in the version being updated?

v21

When do you expect the revised analysis will be completed?

this week?

Who will complete the updated analysis?

@runjin326

jaclyn-taroni commented 3 years ago

We don’t use the whole GOI list by default and in the publication plots we use the top 20 genes when using a GOI list. Since all embryonal tumors are plotted together, it would not surprise me if the top 20 cutoff is the culprit.

First step should check the underlying files used for the oncoprints, though.

jharenza commented 3 years ago

We don’t use the whole GOI list by default and in the publication plots we use the top 20 genes when using a GOI list. Since all embryonal tumors are plotted together, it would not surprise me if the top 20 cutoff is the culprit.

First step should check the underlying files used for the oncoprints, though.

That's true, but I also see genes mutated in only one sample in those plots, and with there being a few ATRTs in the plot, I would have thought we'd capture some SMARCB1, as we do in pedcbio. I think we also need to check whether the samples in pedcbio harboring the mutations exist in the oncoprint and if so, why aren't they showing up here. I think maybe this is the culprit, but want to be sure so we can explain it.

runjin326 commented 3 years ago

@jharenza, so I looked into this - and the reason is due to the top20 filter for GOI list.

Let me know if there are anything else that you want me to look into.

jharenza commented 3 years ago

Thanks for this! We may want to prioritize the genes being shown in this specific plot - let me think about it.

jharenza commented 2 years ago

going to close this since these genes are not technically missing