Closed kgaonkar6 closed 3 years ago
@jharenza I tried to update the colors to the palettes but seems like I'm missing something.
For example, I updated the level 5 which is the RNA-Seq level
level5 <- tm %>%
dplyr::filter(level == 5 ) %>%
dplyr::mutate( color = dplyr::if_else(level5 == "RNA-Seq", "#CCCCCC", "#FFFFFF"))
the level should have a gray and white color, but it has a variety of other colors.
Not sure what I'm missing, can you and/or @cbethell give me some pointers?
Thanks to @jharenza I think I've made some major progress with the color assignment as of https://github.com/AlexsLemonade/OpenPBTA-analysis/pull/1161/commits/73a91b364cdfe612a2b0abad64975bb7b9610353! 🙌
We also discussed adding Kids_First_Biospecimen_ID because we wanted to specify tumor_descriptors, but now the assays are mutually exclusive. Maybe sample_id
will help capture the multiple assays available per specimen ?
Also I think we need specific color codes for tumor_descriptor?
[1] "Initial CNS Tumor" "Progressive"
[3] "Progressive Disease Post-Mortem" "Recurrence"
[5] "Second Malignancy"
oh yes, that's right - we should do this by sample_id
instead of BS_ID to capture DNA/RNA pairs!
For:
Also I think we need specific color codes for tumor_descriptor? [1] "Initial CNS Tumor" "Progressive" [3] "Progressive Disease Post-Mortem" "Recurrence" [5] "Second Malignancy"
I don't remember if we had this code here or maybe I am now mixing up OpenPBTA and OpenTargets - I would suggest we use the following since we don't have a palette yet (taken from the simpsons palette we are using for violin plots):
Initial CNS Tumor
--> "#709AE1FF"Progressive or Progressive PM
--> "Progressive" | "Progressive Disease Post-Mortem" --> "#075149FF"Recurrence
--> "#FD8CC1FF"Second Malignancy
--> "#FD7446FF"Can you add this palette to the figures section and pull it in and if we need to change, we do it there?
In the latest commits I've updated the plot to use sample_id and added figures/palettes/tumor_descriptor_palette.tsv for the tumor_descriptor values as mentioned above.
hmm, seems to error out in CI with testing
but runs ok locally with testing data. Rerunning CI to see if it fixes it.
Thanks @jharenza !
However, it seems the figures
code to plot sample distribution errors out in CI I've removed Rscript figures/scripts/fig1-sample-distribution.R
from CI . Since we will be coping over the sunburst pie plot from here to figures.
Does that sound ok @jaclyn-taroni ?
If we're not going to use figures/scripts/fig1-sample-distribution.R
, which is what I understand from your comment, we could also just remove that script from the repository in addition to removing it from CI (that seems less confusing to me).
ok removed figures/scripts/fig1-sample-distribution.R
Purpose/implementation Section
What scientific question is your analysis addressing?
Update sample distribution multi-layer plot centered on cancer_group.
What was your approach?
So as per requested multi-layer plot in #1144 I've added
What GitHub issue does your pull request address?
1144
Directions for reviewers. Tell potential reviewers what kind of feedback you are soliciting.
Which areas should receive a particularly close look?
Available
andNot Available
so each layer is color coded the same way. I did try putting in "WGS" when WGS is available in 3rd layer but then the color for "WGS","WXS" and "RNA-Seq" would be different so could be too much color wise? Let me know if we would like some other representation.cancer_group
is the only histologies term we are using in the latest plot.Is there anything that you want to discuss further?
Is the analysis in a mature enough form that the resulting figure(s) and/or table(s) are ready for review?
Yes
Results
What types of results are included (e.g., table, figure)?
figures
What is your summary of the results?
multi-layer sample distribution plot, example vis:
Reproducibility Checklist
Documentation Checklist
README
and it is up to date.analyses/README.md
and the entry is up to date.