Closed missaugustina closed 7 years ago
I decided to do the month of February and the month of March up to the 21st when I took the samples. I have the Github API data in JSON format. I've also started performing initial analysis on the population I sampled from. Currently dealing with a shuffleboard issue to get the CSV's together properly.
Had to fix an issue where a) repo slug was reversed and b) 3 repos had missing data either due to deletion or going private since creation. Issue #72
Added summaries to compare the different samples in one plot. Only one left at this point is the README size. Not going to do summaries for Languages and Build Status in README due to time contraints.
R was being stupid and I was having issues generating the files so I moved this into blocked. This issue has been fixed and the HTML output has been generated.
Population:
(important to do but kind of a stretch goal, basically verifies my assumptions from the previous study about the overall event population I'm sampling from)
GBQ Events:
Repos from same owner
Events per actor percent log frequency
event type frequency
Repos per Actor frequency
Github API:
Repos Age and updated since
Language Freq, Language Pct Freq
Build Status Tag in Readme Freq (preliminary has CI)
Deliverables: