LieberInstitute / Habenula_Pilot

habenulaPilot project code repository
0 stars 0 forks source link

snRNA-seq re-processing: clustering #4

Closed lcolladotor closed 2 years ago

lcolladotor commented 2 years ago

After #3, we'll continue with tSNE / UMAP and clustering. So this will involve creating code/09_snRNA-seq_re-processed/03_clustering.R.

Briefly, Erik computed tSNE / UMAP and used the PCs he got from the poisson pearson residuals then ran 4 graph-based clustering options: with k nearest neighbors 5, 10, 20 and 50. Then he plotted a few marker genes and chose k = 20 to continue.

Matt and Louise:

I think that we should use the strategy Matt and Louise used and go with k = 20 from the beginning. Let's see what the marker gene plots look like for the set of:

We could also compare the resulting prelimCluster and collapsedCluster labels with the clusters Erik had created. This can be done with addmargins(table()) for example or with a heatmap.

We'll examine these plots with everyone involved.

Our resulting object from this script should have the prelimCluster and collapsedCluster labels. We'll then make a new one that will add the cellType and cellType.broad columns based on what we decide with everyone on how to label each collapsedCluster (or similar spelling: use the ones Matt & Louise have in the final published objects, not the intermediate column names; Louise can tell you which ones they are) .