rmattson1008 / ornet

Organellar segmentation, tracking, and network modeling.
MIT License
0 stars 0 forks source link

Show embeddings w umap #28

Open rmattson1008 opened 1 year ago

rmattson1008 commented 1 year ago

Followup to #23. I think it will be able to handle this dataset very well. But are you really understanding the embedding space correctly? What is local structure like in higher (but not super high) R^n? It does not seem like each dim in a vector carries the information split equally, it seems like things that "look" like neighbors may be less related than one would think. Honestly I want to just do PCA too. Find "axes of info" and see if it lines up w any bio interpretation.

magsol commented 1 year ago

Yeah I think a constellation of viz approaches is going to be the best here: UMAP as one, t-SNE as another, PCA as another.

Micky774 commented 1 year ago

Yeah I think a constellation of viz approaches is going to be the best here: UMAP as one, t-SNE as another, PCA as another.

I would actually recommend against t-SNE since it is difficult to tune and often redundant-at-best or inferior to UMAP. Also you may want to consider using UMAP + HDBSCAN to generate a fully-unsupervised density-based hierarchical clustering of the manifold-projected data.

magsol commented 1 year ago

Also you may want to consider using UMAP + HDBSCAN

+1 to this.