blab / pathogen-embed

Create reduced dimension embeddings for pathogen sequences
https://pypi.org/project/pathogen-embed/
MIT License
1 stars 0 forks source link

Enable clustering by distance matrix input #33

Closed huddlej closed 1 month ago

huddlej commented 1 month ago

Adds a mutually exclusive input argument, --distance-matrix, to the pathogen-cluster command and corresponding logic to load this matrix and find clusters from its using HDBSCAN's precomputed metric option. When an embedding is provided instead, pathogen-cluster runs HDBSCAN with the default Euclidean distance metric.