cognoma / cancer-data

TCGA data acquisition and processing for Project Cognoma
Other
20 stars 28 forks source link

Treehouse Childhood Cancer Initiative #39

Open gwaybio opened 6 years ago

gwaybio commented 6 years ago

New, publicly available dataset of 11,078 RNAseq + clinical childhood cancer tumors.

Xena data

Blog Post

This will open up a lot of analysis opportunities - exciting it is now available!

dhimmel commented 6 years ago

Nice! It looks like the release includes:

But mutation data is not available? Or @gwaygenomics is mutation data available elsewhere, do we know?

gwaybio commented 6 years ago

Mutation data is available as sequencing but is under controlled access - not sure if there are plans on making mutation calls available.

On a closer inspection, it looks like most of the samples are the same TCGA tumors. There are 732 TARGET tumors and 549 Treehouse tumors.

Here's the TARGET tumor breakdown:

Tumor Count
acute myeloid leukemia 224
acute lymphoblastic leukemia 194
neuroblastoma 162
wilms tumor 123
clear cell sarcoma of the kidney 11
clear cell carcinoma of the kidney 2

A more thorough clinical data exploration in this notebook

gwaybio commented 6 years ago

Update - It looks like some variant calls are available in the [target data matrix](looks like some mutation data are available as MAF calls).

including:

  1. ALL (ftp://caftpd.nci.nih.gov/pub/OCG-DCC/TARGET/ALL/WXS/Phase2/L3/mutation/)
  2. AML (ftp://caftpd.nci.nih.gov/pub/OCG-DCC/TARGET/AML/WXS/L3/mutation/BCM/VerifiedSomatic/)
  3. NBL (ftp://caftpd.nci.nih.gov/pub/OCG-DCC/TARGET/NBL/WXS/L3/mutation/Broad/VerifiedSomatic/)
  4. WT (ftp://caftpd.nci.nih.gov/pub/OCG-DCC/TARGET/WT/WXS/L3/mutation/BCM/VerifiedSomatic/)
dhimmel commented 6 years ago

@gwaygenomics are there any files with mutation calls for specific samples or are all the mutation datasets just summaries?

gwaybio commented 6 years ago

are there any files with mutation calls for specific samples or are all the mutation datasets just summaries?

Yes, the data are there for specific samples