Closed rahlk closed 9 years ago
Georgios Gousios G.Gousios@tudelft.nl Martin Pinzger martin.pinzger@aau.at Arie van Deursen Arie.vandeursen@tudelft.nl
This is a collection of database dumps from github conducted at various dates. There are multiple options for download.
The following tables are available: commit comments, commits, events, followers, forks, issue comments, issue events, issues, org members, pull request comments, pull requests, repo collaborators, repo labels, repos, users, watchers
This is a reduced version of the original dataset in which only the top-10 stared github projects are reported. This MSR'14 challenge data is available as a MongoDB or MySQL database. The data, description, and instructions are available at: http://www.ghtorrent.org/msr14.html
Full dumps by date available at: http://www.ghtorrent.org/downloads.html
Query the most recent DB live rather than downloading the whole thing at: http://www.ghtorrent.org/dblite/
Get a torrent of just the table you want for the dump-date you want at: http://www.ghtorrent.org/downloads.html
Link to paper http://dl.acm.org/ft_gateway.cfm?id=2568260&ftid=1467975&dwn=1&CFID=609849406&CFTOKEN=18730546 Link to data (Link to a paper that contains the data) - G. Gousios. The GHTorrent dataset and tool suite. In Proceedings of MSR ’13, May 2013.