Hello. Thank you for the development of this amazing tool.
I have one question about conducting the downstream analysis.
Is it statistically acceptable to subset my anndata and full_score.gz file when conducting downstream analysis?
The flow I am thinking is:
Run compute-score on my entire dataset to find cell types (e.g. cell type A) that are highly associated with a given trait.
Subset only cell type A cells, and the full_score.gz file of the cells and run perform-downstream, specifically --gene-analysis on the subsetted data to find genes that are correlated with the trait within cell type A.
Hello. Thank you for the development of this amazing tool. I have one question about conducting the downstream analysis.
Is it statistically acceptable to subset my anndata and
full_score.gz
file when conducting downstream analysis? The flow I am thinking is:compute-score
on my entire dataset to find cell types (e.g. cell type A) that are highly associated with a given trait.full_score.gz
file of the cells and runperform-downstream
, specifically--gene-analysis
on the subsetted data to find genes that are correlated with the trait within cell type A.Thank you in advance :)