Closed kgaonkar6 closed 3 years ago
The following issues were fixed:
LGG,To be classified
pathology_free_text_diagnosis updates should only have harmonized_diagnosis
Notes
to be added Could you also pop #1108 into this PR since it is in the same module?
should be
mutate(integrated_diagnosis = case_when(molecular_subtype == "CRANIO, ADAM" ~"Adamantinomatous craniopharyngioma",
molecular_subtype == "CRANIO, PAP" ~"Papillary craniopharyngioma",
TRUE ~ NA_character_),
I also wanted to check that you plan to update molecular-subtype-integrate
after this PR with updated harmonized_diagnosis
since I do not see those filled in here... in which case, I think we may need a ticket for that one?
In the latest commit, I've updated the typo Adamantinomatous craniopharyngioma and added back meningioma.
About molecular-subtype-integrate
didn't we decide that we will only run the module per data release so that the pbta-histologies.tsv file matches? Maybe I mis-understood, a ticket will be good.
That's possible, but I think the way it is set up now is via an excel sheet matching, right, and there needs to be some logic added for updating the harm dx based on free text dx, rather than just pulling over path dx, right?
That's possible, but I think the way it is set up now is via an excel sheet matching, right, and there needs to be some logic added for updating the harm dx based on free text dx, rather than just pulling over path dx, right?
Hmm I don't think the logic needs to be updated in the integrate step.
Any BS_ID already subtyped ( ie. exists in compiled_molecular_subtypes_with_clinical_pathology_feedback_and_report_info.tsv) is not going to be part of the code that pulls harm_dx/broad_hist/short_hist from the excel sheet. That part is only for the non-subtyped + pathology_diagnosis=="Other" BS_IDs. We could do have molecular-subtyping-integrate
rerun PR to see if that's true or I could have missed some key logic.
gotcha! that makes sense.
Purpose/implementation Section
What scientific question is your analysis addressing?
Adding back the individual pathology_free_text_diagnosis subtypes to the
compiled_molecular_subtypes.tsv
What was your approach?
analyses/molecular-subtyping-pathology/results/compiled_molecular_subtypes.tsv
so we will be updating the values for these samples using outputs from the pathology-free-text-diagnosis terms. Fileanalyses/molecular-subtyping-pathology/results/compiled_molecular_subtypes.tsv
as well since these subtypes don't have molecular-subtyping results FileWhat GitHub issue does your pull request address?
1061
Directions for reviewers. Tell potential reviewers what kind of feedback you are soliciting.
Which areas should receive a particularly close look?
ETMR, C19MC-altered
in but is not needed anymore since there are no duplicate subtypes for this patient in v19Is there anything that you want to discuss further?
compiled_molecular_subtypes.tsv
from the hotspots updates / downstream molecular-subtype modulesIs the analysis in a mature enough form that the resulting figure(s) and/or table(s) are ready for review?
Yes
Results
What types of results are included (e.g., table, figure)?
tables
What is your summary of the results?
Reproducibility Checklist
Documentation Checklist
README
and it is up to date.analyses/README.md
and the entry is up to date.