Closed jharenza closed 3 years ago
cc @yuankunzhu and @migbro
For the V19 pbta-histlogies.tsv
file, I made a new column called cancer_group
using the following code:
v19_2 <- v19 %>%
dplyr::mutate(cancer_group = str_extract(harmonized_diagnosis, "[^,]*")) %>%
dplyr::mutate(cancer_group = str_extract(cancer_group, "[^(]*")) %>%
dplyr::mutate(cancer_group = case_when(cancer_group == "High-grade glioma/astrocytoma " ~ "High-grade glioma/astrocytoma",
cancer_group == "Low-grade glioma/astrocytoma " ~ "Low-grade glioma/astrocytoma",
cancer_group == "Dysembryoplastic neuroepithelial tumor " ~ "Dysembryoplastic neuroepithelial tumor",
cancer_group == "Brainstem glioma- Diffuse intrinsic pontine glioma" ~ "Diffuse intrinsic pontine glioma",
cancer_group == "Atypical Teratoid Rhabdoid Tumor " ~ "Atypical Teratoid Rhabdoid Tumor",
TRUE ~ as.character(cancer_group))) %>%
filter(experimental_strategy == "RNA-Seq")
This file was given to @ewafula for merging with the others.
Edited: add language highlight for the code
gene-counts-rsem-expected_count.rds
and gene-expression-rsem-tpm.rds
is up. gtex and tcga files for expected counts was not there, but is for tpm. However, since the table entry for tpm did not list to add tcga and gtex yet, I have left that out.
fusion-arriba.tsv.gz
and fusion-starfusion.tsv.gz
are also now up, with an updated md5sum file. Consensus is on hold until run for kfnbl set
thanks @migbro !
@migbro @zhangb1 for the above remaining MAF files - have we run KF NBL through consensus yet? If not, what is the timeline? For PBTA (CBTN+PNOC), do we have a consensus MAF to which we can append the KF NBL consensus to for release? You can use the v4 histologies file for BS IDs which are included - select for cohort == CBTN | PNOC | GMKF.
@migbro @zhangb1 for the above remaining MAF files - have we run KF NBL through consensus yet? If not, what is the timeline? For PBTA (CBTN+PNOC), do we have a consensus MAF to which we can append the KF NBL consensus to for release? You can use the v4 histologies file for BS IDs which are included - select for cohort == CBTN | PNOC | GMKF.
I can manage that ,since we already have the 4 callers results. I can use the latest consensus app to do that , should be done by today or tomorrow.
Thank you @zhangb1 !
@jharenza I finished kfnbl consensus maf and able to merge them to kfnbl-snv-consensus-mutation.maf.tsv.gz
.
do you want to merge it to pbta consensus merged file? both are CHOP method right? just to confirm.
do you want to merge it to pbta consensus merged file? both are CHOP method right?
yes please, for PBTA - to get bs_ids, you can use the v4 histologies.tsv
and pull WGS/WXS/Panel experimental_strategy
and cohort == CBTN or PNOC
@zhangb1 just wanted to circle back about the consensus file snv-consensus-plus-hotspots.maf.tsv.gz
is this on s3 already ?
@zhangb1 just wanted to circle back about the consensus file
snv-consensus-plus-hotspots.maf.tsv.gz
is this on s3 already ?
Not yet, I switch the work to the Open Target, but I can manager that , I will upload that in 2 days. sorry about that
Thanks for the update 👍
@kgaonkar6 @jharenza
snv-consensus-plus-hotspots.maf.tsv.gz
kfnbl-snv-consensus-mutation.maf.tsv.gz
are updated in the s3://kf-openaccess-us-east-1-prd-pbta/open-targets/v5/ folder
done with #35 - will open new for v6
What data file(s) does this issue pertain to?
all files
What release are you using?
v3
Put your question or report your issue here.
Consensus will be d3b 2/4+hotspots and I think this still needs to be run on KF NBL.
Merge and add to v4 bucket:
s3://kf-openaccess-us-east-1-prd-pbta/open-targets/v4/
Who will complete this?
@zhangb1 for data files @jharenza will upload histologies files from @ewafula