As of 8f3ed5fa0fc05dad381adda79e2cf502fe9e43bc, the memento diff expr method uses all cube dimensions as covariates when performing the computation. However, it may desirable to support allowing the user to only specify a subset of dimensions as the covariates.
To support diff expr using only a subset of dimensions, we must:
Update the cube to replace the sem value (standard error of the mean) with statistics that can be aggregated to compute the sem: sum (sum of expression values), sumsq(sum of squares of expression values).
Update the diff expr computation to compute sem on the fly using sum and sumsq (how)
Update the compute_all function to take a list of covariates. The computed design matrix should be created for the specified covariates.
Update the query_estimators function to perform the necessary row aggregations to compute the sem for each distinct tuple of covariate/dimension values.
Update the compute_all_estimators_for_gene function to remove the dense_gene_data() function call. See comment.
As of 8f3ed5fa0fc05dad381adda79e2cf502fe9e43bc, the memento diff expr method uses all cube dimensions as covariates when performing the computation. However, it may desirable to support allowing the user to only specify a subset of dimensions as the covariates.
To support diff expr using only a subset of dimensions, we must:
sem
value (standard error of the mean) with statistics that can be aggregated to compute thesem
:sum
(sum of expression values),sumsq
(sum of squares of expression values).sem
on the fly usingsum
andsumsq
(how)compute_all
function to take a list of covariates. The computed design matrix should be created for the specified covariates.query_estimators
function to perform the necessary row aggregations to compute thesem
for each distinct tuple of covariate/dimension values.compute_all_estimators_for_gene
function to remove thedense_gene_data()
function call. See comment.