dviraran / SingleR

SingleR: Single-cell RNA-seq cell types Recognition (legacy version)
GNU General Public License v3.0
275 stars 98 forks source link

Positive correlation between the identity score of the right identities and the rest #159

Open cesarsierran opened 8 months ago

cesarsierran commented 8 months ago

Thank you for this great package, it has worked very good in my hands with the identities in my mouse brain PFC dataset correctly assigned.

However, looking more carefully into the data I was surprised by the following obvservation: cells with a high "score" for their assigned identity also show a high score for the rest ("wrong") of identities and viceversa, as shown in the plot.

correlation

I would have expected the opposite behaviour: those cells with a higher score for their right identity should have a lower score for the "wrong" identities, as I would expect their transcriptomes to differ more. This same observation is replicated using different parameters in your singleR and also using my custom set of markers.

Do you have any thoughts on this?

Thank you in advance!