reconstrue / single_cell

Single cell analysis tools built to run on Jupyter, especially Colab
http://reconstrue.com
Apache License 2.0
1 stars 0 forks source link

McInnes et alia: UMAP and HDBSCAN #12

Open JohnTigue opened 4 years ago

JohnTigue commented 4 years ago

See also #28 (comparing clusters to Allen cell types)

JohnTigue commented 4 years ago

HDBSCAN

McInnes PCA(50) => HDBSCAN

JohnTigue commented 4 years ago

UMAP => hdbscan

This sounds like it could scale rather well:Impact of cancer mutational signatures on transcription factor motifs in the human genome

We first used the UMAP dimensionality reduction method on the table of exposure values of the 2708 samples, and then defined clusters using the hdbscan method, as implemented in the largeVis R-package.

JohnTigue commented 4 years ago

How to filter outliers to train a model on cleaner data?

How about tune UMAP to accentuate outliers. The HDBSCAN cluster. The take only the clustered nodes and run them through UMAP.

https://youtu.be/DinhTHoDRjk?t=1038