quantile.use argument - Githubissues

dviraran / SingleR

SingleR: Single-cell RNA-seq cell types Recognition (legacy version)

GNU General Public License v3.0

271 stars 98 forks source link

Thanks.

You can see in supp info 1 how SingleR makes its decisions. Each cell type in the reference may have multiple samples, and SingleR chooses the top cell types. For each single-cell there you can imagine such boxplots. The question is how to order them. One option is based on the median, but the problem is that the samples associated with the cell type may be a mix of multiple subsets, and taking the median might be problematic. Another approach is just the max (1), but this can lead to false results because of randomness. I played with 0.75, 0.8, 0.9, and they all gave me similar results more or less. The intuition I use - if there are many samples for each cell type use a high value since there might be multiple subtypes combined together. This is why I use 0.9 for the 'main cell types' option.

Hope this makes sense.

Best, Dvir

dviraran / SingleR

quantile.use argument #51