-
First step will allow to find all events related to given repository.
It will work like this:
> java gha.jar find_events --repository=klangner/matrobot --data=/home/user/githubarchive --output=events…
-
Temporary issue or has something changed?
Coincidental with the date that clocks rolled back?
Up until 2012-11-03-23 responds normally.
wget http://data.githubarchive.org/2012-11-04-01.json.gz
--201…
-
Hi.
As I understand the format of the files on githubarchive.org is json lines separated by new line.
But I found that not all files have new line character. Sometimes records are not separated.
It m…
-
change parser to process githubarchive files in parallel. this is an embarrassingly parallel task.
-
I'm using the script provided on the web site and I get UTF-8 issues every once in a while. One example is for Jan 1, 2012 09:00:
``` ruby
require 'open-uri'
require 'zlib'
require 'yajl'
gz = open(…
-
Hi,
I'm afraid I found another problem.
There are 2 different formats in file: 2012-03-11-0.json.gz
Sometimes the key for repository info has name repo. Like in this example:
{
"repo": {
…
-
Add preprocesor to parse all github events for specific repository and create CSV file with following fields:
- event create data
- event type
-
It would be very useful to be able to see the distribution of committers in every period. How many people committed in a certain period? How are the commits distributed between these people?
-
For example:
[501:5761] home/ubuntu/githubarchive/2012-07-06-5.json.gz
...so we know how many files are left.
okram updated
11 years ago
-
Hi there,
while analyzing the most active Go projects on github, using http://www.githubarchive.org/, I have mentioned that your project description has a typo. s/Plataform/Platform/
It's a descript…