selkamand / sigminerUtils

Easily Run Sigminer Analysis and Save Results to sqlite DB
Other
0 stars 0 forks source link

Change UMAP dependency from umap to uwot package #14

Closed selkamand closed 1 month ago

selkamand commented 1 month ago

Both are native R & C++ implementations instead of reticulate-wrappers of python implementations.

Reason to switch.

  1. umap stores the full original dataset used to build the umap in the object returned - unnecessarily blowing out the size of the serialized model. the uwot model structure is much lighter
  2. uwot conveniently makes it easy to return either just the embedding OR the full model
  3. uwot has a dedicated function for adding new data to an existing embedding umap_transform which makes the code clearer (imo) than the reliance on the predict generic and method dispatch.