src-d / ml-backlog

Issues belonging to source{d}'s Machine Learning team which cannot be related to a specific repository.
0 stars 3 forks source link

Use language experience by LoC to cluster developers and projects in the same space #77

Closed warenlg closed 5 years ago

warenlg commented 5 years ago

Leverage the language distribution by LoC of both the developers and the repos of a given codebase, and cluster them showing similarities between one another. What has been done for now:

warenlg commented 5 years ago

Results so far https://plot.ly/~warensourced/35/devproject-similarity-based-on-language-experience-loc-normalized-on-sourced-cod/#/

I'm now working on adding the cluster colors and the edges between the points to represent the contributions

vmarkovtsev commented 5 years ago

This is done.