cognoma / cancer-data

TCGA data acquisition and processing for Project Cognoma
Other
20 stars 28 forks source link

Add disease acronyms and update covariates.tsv #27

Closed dhimmel closed 8 years ago

dhimmel commented 8 years ago

Manually created download/diseases.tsv from @gwaygenomics tcga_dictionary.tsv file at https://git.io/vPvTb. See #26.

Added acronym column to samples.tsv. In covariates.tsv use acronym rather than full disease name for more manageable column names.

Simplified parts of 4.covariates.ipynb. Moved n_mutation computation for samples to 2.TCGA-process.ipynb.

dhimmel commented 8 years ago

@stephenshank, I made some changes to our covariates, so this pull request may be of interest to you. @gwaygenomics is also a good reviewer regarding acronyms.

dhimmel commented 8 years ago

@gwaygenomics ready for your review again.