glstott / PMeND

Phylogeny and Metadata Network Database
2 stars 0 forks source link

Investigate patristic distance algorithms #9

Closed glstott closed 2 years ago

glstott commented 2 years ago

Current method is storage-expensive, requires some configuring, and slower than preferable. Swapping to a separate script for patristic distance calculations and inputting those directly into the graph would be faster, but more memory intensive. Maybe look at both approaches and provide them as options for users.

glstott commented 2 years ago

Two new methods used.

glstott commented 2 years ago

I ended up using the latter option. It performs better, and doesn't have a significant memory cost. The first alternative listed above also works great, but would add an additional dependency that doesn't seem necessary. In the future, if we want to come back to it, loaded in a file generated from a patristic distance script in Python (so I wouldn't need as large of a file each time). Alternatively, PATRISTIC could have done something similar.