keaan95 / learning

0 stars 0 forks source link

GDAC Firehose Data Background #3

Open keaan95 opened 7 years ago

keaan95 commented 7 years ago

TCGA offline in July of 2016.

GDAC Broad Institute provides DATA snapshots every 2 months. Analysis runs 3 per year. OV, COAD/READ, LAML are hg18 (else is hg19).

RSEM (RNA-Seq by Expectation-Maximization) is considered to be a better estimation method.

API - fine grained retrieval of sample-level data by genes.

md5 hating algorithm to provide assurance that a transferred file has arrived intact. Store a one-way hash of password, often with key stretching.

keaan95 commented 7 years ago

API useful for clinical data. Search by genes for individual samples.