ga4gh / ga4gh-schemas

Models and APIs for Genomic data. RETIRED 2018-01-24
http://ga4gh.org
Apache License 2.0
214 stars 110 forks source link

Improve expression level interchange model #775

Open david4096 opened 7 years ago

david4096 commented 7 years ago

Sometimes one receives an RNA expression table with genes and samples connected by some expression level. In the case when the units are not known, one can specify this using the ExpressionUnit enum.

However, the normalization may also be unknown. This issue is to change that boolean to an enum to allow for the case when normalization is not known. This will make it easy to correctly interchange expression level data when not all the metadata are available. @saupchurch

The need for this feature arose when trying to find sane defaults when these data are expected to be interchanged. Assuming normalization can dramatically alter the outcome of an analysis.

We should also provide a way to describe both the effective_count for multi-reads in addition to the raw_read_count so that new normalizations can be made.

https://github.com/ga4gh/server/issues/1515#issuecomment-271718265