Closed simonhkswan closed 2 years ago
Metrics: Bhattacharyya distance, Total Variation distance, Ideal number of clusters according to the average silhouette method
As there are many clustering algorithms, I will be limiting the scope to k-means for now as it is a very popular algorithm, but requires the knowledge of an optimal number of clusters in order to perform optimally
For the tests of the new metrics, would be good to follow (you can search this approach too):
Let's have a look at
pytest.fixture
s too.