How to get a relative high confidence results in the circumstance of using a imbalanced data

Hi,

A lot of thanks to you and your team for the great contributions to singlecell DE analysis and making this wonderful package !

I was using Libra to run DE analysis in my own sc-seq dataset.However I have a few questions about how these data type present below influences the final statistical power in finding real DE genes(pesudo methods)

Type one : Imbalanced cell number data when a certain celltype number vary dramatically between biological replicates .

For example：

data like this

Question : Can i choose a cell number ,for instance 1000 or even samller one as a new celltype number for every Biologicalreplicates ,and then resample every Biologicalreplicates to make a balanced data for pesudo-bulk ?

Type two : DE analysis between different celltype

Question : In my understandings , pesudo-methods are better than singcell-methods in the circumstance of making DE within a certain celltype ,is it also a good method in the circumstance of making DE between different celltype (find important marker gene)?

Forgive my poor english expression and awful question format , Hope to get your reply !

Thanks Yufeng

neurorestore / Libra

How to get a relative high confidence results in the circumstance of using a imbalanced data #33

Type one : Imbalanced cell number data when a certain celltype number vary dramatically between biological replicates .

Type two : DE analysis between different celltype