What changes need to be made? Please provide enough detail for another participant to make the update.
checkout the branch for PR #340 and create branch off it for this update. The download-data.sh bash script in this branch is already updated for v12 data downloads
update the module to include the TCGA TPM gene expression matrix. You need to read in both gene-expression-rsem-tpm-collapsed.rds and tcga-gene-expression-rsem-tpm-collapsed.rds matrices then merge them into a single dataframe to be analyzed together. Note that there are 26 genes in the OPC matrix that are not in the TCGA matrix. When merging, only keep the common genes between the matrices so that way we are on level a playing ground with what expression is being measured for gene across all samples.
What analysis module should be updated and why?
Update rna-seq-expression-summary-stats module
What changes need to be made? Please provide enough detail for another participant to make the update.
checkout the branch for PR #340 and create branch off it for this update. The
download-data.sh
bash script in this branch is already updated for v12 data downloadsgit checkout -b v12-post-release origin/v12-post-release
git branch v12-tpm
update the module to include the TCGA TPM gene expression matrix. You need to read in both
gene-expression-rsem-tpm-collapsed.rds
andtcga-gene-expression-rsem-tpm-collapsed.rds
matrices then merge them into a single dataframe to be analyzed together. Note that there are 26 genes in the OPC matrix that are not in the TCGA matrix. When merging, only keep the common genes between the matrices so that way we are on level a playing ground with what expression is being measured for gene across all samples.rerun the module using v12 data release files
submit your PR to the
v12-post-release
branchoutput files will likely be larger than the maximum limit on GitHub. Include a .gitignore in the
results/
directoryWhat input data should be used? Which data were used in the version being updated?
When do you expect the revised analysis will be completed?
1 day
Who will complete the updated analysis?
@sangeetashukla