tSNE/UMAP don't look right in the sc workbench

beamilon commented 1 year ago

@jorvis @adkinsrs @songeric1107 I used E14, mouse, scRNA-seq, cochlear epithelium (Kelley) and started a new analysis. The clustering looks horrible, not like a clustering at all, everything seems to overlap. I noticed that when I prepared the slides for the EARssentials workshop but didn't think much of it at the time. Katie noticed the same thing with different parameters, different datasets even her own. We may add more examples tomorrow.

songeric1107 commented 1 year ago

@beamilon , that is what I mentioned before, the problem is caused by the normalized values. all the Kelly datasets are normalized values, not raw values. so it is not appropriate to use sc workbench to re-nomoralize the log normalized values. you should use the primary analysis for those datasets.

jorvis commented 1 year ago

This is an ongoing discussion on updating the datasets which are normalized and which aren't, then disabling parts of the interface where they shouldn't be used.

beamilon commented 1 year ago

Does it make sense that the tSNE function (instead of UMAP) works great? I know these are 2 different methods but why would one be so messed up and not the other one (top image). I also used the raw datasets from Jan and when choosing the UMAP, the result is really not good (middle image). Choosing tSNE instead bring up something expected for clustering (bottom image). I don't think the raw versus normalized matrix is the problem. Something is wrong with the UMAP function and it was not the case before.

songeric1107 commented 1 year ago

@jorvis , I compare the saved analysis from before and a new one that I tried just now, I agree with @beamilon that there might be a bug for umap display which leads to different UMAP display although I use the same parameter.

test dataset with raw values https://umgear.org/analyze_dataset.html?dataset_id=e084843c-32b0-4551-7307-0942eaa45756

saved analysis from before: Screen Shot 2022-07-28 at 10 48 50 AM

new analysis today:

Screen Shot 2022-07-28 at 10 51 50 AM