Open msto opened 4 years ago
Hmm, seems like the sparse case missed the fact that we need n_features for correlation distances. The "special" thing is likely the fact that for less than 4096 samples it will just compute all pairs distances (since this is cheaper when the dataset size is small). I'll see if I can get this fixed when I get some time.
I am having exactly the same error here! Has this been fixed? What would be the roadmap to fix? I am happy to give it a go.
I also have a similar error even when metric=Euclidean
, but this is related to pickling it. I will try and get a minimal example an open an issue about it.
Hi,
I'm encountering an error when attempting to run UMAP using correlation as the distance metric. I've reduced my code to a minimal reproducible example below.
This results in the following error:
The error does not appear when using the default Euclidean metric, nor when providing a dense numpy matrix.
The error appears to start occurring when the matrix has at least 4,096 rows, and is unaffected by the number of features.
Is there anything special about exceeding 2^12 rows that might be causing this?
Thanks!