After discussions between @komalsrathi , Diskin Lab, @migbro, and @zhangb1 , we found that the collapse-rnaseq post GTEX and TCGA liftover has a bug in that there are genes missing from the collapsed count and tpm matrices. For example, "CD99" in the below pre-collapse counts file for TCGA:
Which new datasets are being added with this release?
What is the sample breakdown (number of WGS, WXS, RNA-Seq, Panel, Methylation, other)?
Same as v14
What module(s) generated any new files to include in the release? Has that module been added to the analysis/README.md, and to CI?
NA
Are you aware of any modules impacted by the file(s) change(s)? Describe if the file name is changed.
No
What data file(s) are added/updated/removed in this release?
GTEX and TCGA counts and TPM matrices will be updated
[Pre-release files]
[Commit files]
[Bed files and sample mapping]
[File descriptions and notes]
Any additional notes to add for discussion?
After discussions between @komalsrathi , Diskin Lab, @migbro, and @zhangb1 , we found that the
collapse-rnaseq
post GTEX and TCGA liftover has a bug in that there are genes missing from the collapsed count and tpm matrices. For example, "CD99" in the below pre-collapse counts file for TCGA:But after collapse, only
CD99L2
exists.We would like to update these 4 files to contain all genes which are non-0.