Add a pga-create repack command. This downloads latest GHTorrent MySQL dump and repacks it on the fly to store a local copy without the files that are not needed. This is particularly useful during development of pga-create, since the repacked version can be processed 10-20 minutes, while processing the original GHTorrent MySQL dump takes hours.
Add a
pga-create repack
command. This downloads latest GHTorrent MySQL dump and repacks it on the fly to store a local copy without the files that are not needed. This is particularly useful during development ofpga-create
, since the repacked version can be processed 10-20 minutes, while processing the original GHTorrent MySQL dump takes hours.This PR depends on https://github.com/src-d/datasets/pull/78 (check only last commit) and do not merge.