Closed LorenzoMerotto closed 9 months ago
Hi @LorenzoMerotto thanks for sharing this data. I understand your point, but I would like to have a deeper look also at the other results.
Meanwhile, here are some unstructured thoughts:
It's a really important point raised here by @LorenzoMerotto and I agree that correlation coefficients do not always best reflect the outcome. As @FFinotello points out neither does the sum of squares since it will not account for global shifts in the same way as the correlation does. So indeed both measures are complementary. I'd propose to also fit a simple linear model as we considered for evaluating Simbu before. In a linear model, one can look at the intercept as a global shift and the slope as a global bias. Maybe intercept, slope and sum of squares together will give a good impression of the results?
As mentioned during the last meeting, I post here to discuss about the fairness of the correlation coefficient as the evaluation metric. In this case I selected the 'uniform' scenario, without mRNA bias added (since I didn't had TPM values yet).
What we can see is that for the CD4/CD8 T cells the correlation coefficients are not so good. Since we have a few cell types the simulated fractins for each one would be low. With the correlation coefficients we get an insight on how the relative differences across samples are maintained in the estimated cell proportions, but I wonder if this is a right evaluation metric in this scenario.
Scatterplot with the individual cell types
Scatterplot with all cell types together
Simulated and reconstructed fractions
@FFinotello proposed to consider the sum of squared errors, which I think would be good.
Alternatives?