h1alexbel / srdataset

GitHub repositories dataset that contains sample repositories (SRs), with their metrics and metadata
MIT License
4 stars 0 forks source link

feat(#37): agglomerative, dbscan, gmm for numerics.csv, simple plots #44

Closed h1alexbel closed 1 week ago

h1alexbel commented 1 week ago

ref #37


PR-Codex overview

This PR introduces new clustering algorithms (Agglomerative, DBSCAN, GMM) and updates existing KMeans clustering with improved data visualization features like saving cluster members to .txt files.

Detailed summary

The following files were skipped due to too many changes: steps/dbscan_numerical.py

✨ Ask PR-Codex anything about this PR by commenting with /codex {your question}

h1alexbel commented 1 week ago

@rultor merge

rultor commented 1 week ago

@rultor merge

@h1alexbel OK, I'll try to merge now. You can check the progress of the merge here

rultor commented 1 week ago

@rultor merge

@h1alexbel Done! FYI, the full log is here (took me 13min)