h1alexbel / srdataset

GitHub repositories dataset that contains sample repositories (SRs), with their metrics and metadata
MIT License
4 stars 0 forks source link

Add an ability to run clustering on prepared data in HuggingFace/GitHub release #40

Open h1alexbel opened 3 months ago

h1alexbel commented 3 months ago

let's add support for running clustering on already pushed to the HuggingFace datasets/source CSVs. By doing so, we enable easily reproducibility of our research.