Predictions contain Node1, Node4 labels

Hi Kevis,

Thank you for using CHETAH. Please see the vignette: https://bioconductor.org/packages/release/bioc/vignettes/CHETAH/inst/doc/CHETAH_introduction.html For each cell, CHETAH walks the classification tree ( PlotTree(input) ) and at each node (split in the tree) decides whether a cell belongs to the right or left side based on similarity (correlation). When a cell is as similar/dissimilar to both branches (sides of the tree), the classification stops there. This means that CHETAH knows to one of the cell types below that node, but not which. For example, in the vignette. If a cell is assigned to node 6, CHETAH is confident that the cell is a T cell (all cells below node 6 are T cells), but not which specific type. This could for example be, because the cell of interest is a T cell subtype that is not in the reference (a gamma-delta T cell for example).

To stop this behaviour, just run the following: input <- Classify(input, 0) . Just be aware, this will likely increase the number of incorrect classifications. Feel free to reopen the issue, or open a new issue if you have more questions. https://bioconductor.org/packages/release/bioc/vignettes/CHETAH/inst/doc/CHETAH_introduction.html#confidence-score

jdekanter / CHETAH

Predictions contain Node1, Node4 labels #20