Open mrnabiz opened 1 year ago
The link in ur description does not work, it shows 404.
@Mengjun74 Thanks for your comment. Actually, the problem was from the K-mean model running on the backend. Please try again https://github-user-segmentation.onrender.com/
After doing some data wrangling, I used the K-means clustering algorithm for user segmentation and the PCA method to reduce the dimensionality of the interaction record for visualization purposes with Plotly.
There are some distinct clusters visible in the data:
📊 Next, I plotted their behavior pattern with a Sankey visualization which is usually used to show a flow from one set of values to another. Sankeys are best used to show a many-to-many mapping with multiple paths through a set of stages. In a nutshell, a majority of users start their interaction by committing and pushing then flowing toward creating PRs and reviewing the other PRs.
Looks great! I have a few things to note:
I wonder what type of GitHub user I am! Very interesting project. As for suggestions for improvement:
Please go crazy and rip off my project and of course, criticize me!