Simulate distribution - Githubissues

KevinMenden / scaden

Deep Learning based cell composition analysis with Scaden.

MIT License

71 stars 25 forks source link

Hi @khkk378 ,

yes that might make sense as an additional option. We intentionally didn't do it because it of course introduces some bias into the training set. If you have only one dataset for data simulation, and this is somewhat weirdly distributed, that could be problematic. And scRNA-seq data is not the best tool for estimating cell type fractions, sometimes cells are also selected.

So would be an interesting thing to try as an option - I believe that the default should still be random fractions.

But if you want to cook up a PR, I would be happy to include that :)

KevinMenden / scaden

Simulate distribution #89