Open sreichl opened 11 months ago
check if scRNA-seq data from SCCAF Teichmann paper works: https://www.nature.com/articles/s41592-020-0825-9 specifically their benchmarking data: https://github.com/SCCAF/sccaf_example
cellxgene: https://cellxgene.cziscience.com/ HCA Data portal: https://data.humancellatlas.org/
Start "easy" with a small and a large PBMC data set i.e., very clearly defined "ground truth"
compare to their scRNA-seq specific clustering approach (quite similar ie iterative RFs) https://www.biorxiv.org/content/10.1101/2024.01.18.576317v1.full
look for clustering benchmark datasets (from various domains) to test the approach and put the result into the documentation) → Clustering benchmark papers