lbehnke / hierarchical-clustering-java

Implementation of an agglomerative hierarchical clustering algorithm in Java. Different linkage approaches are supported.
141 stars 79 forks source link

Performance updates #5

Closed taer closed 10 years ago

taer commented 10 years ago

Ran into some performance issues when running with a large dataset. In this case, we had about 2055 zip codes for Texas. It was spending a majority of the time in distanceMap.remove() method iterating over the array.

Now it completes in about 35 seconds while the arraylist version took 1482

lbehnke commented 10 years ago

Thanks for your contribution!