bacpop / PopPUNK

PopPUNK 👨‍🎤 (POPulation Partitioning Using Nucleotide Kmers)
https://www.bacpop.org/poppunk
Apache License 2.0
90 stars 18 forks source link

Improvements to DBSCAN fitting #301

Closed nickjcroucher closed 7 months ago

nickjcroucher commented 7 months ago

Motivated by trying to fit a DBSCAN model to a large dataset. Problems were:

If you approve these changes conceptually, then I'll add tests and docs. At the moment, local tests fail on the mandrake clustering step - I don't know if these are related to the failing tests for mandrake, or a local installation problem - will see what the CI outcomes are. Hence the slightly early-stage PR, sorry!