repack command: it only repacks watchers.csv and projects.csv from
the ghtorrent's mysql dump.
discover command: it only generates one file named repositories.csv.gz
containing all the information needed in the next steps
(repository name, number of stars).
select command: it can only filter by number of stars. It generates
two files: a list on the stdout of one url repository per line to
feed borges and a file named repositories-index.csv.gz (filtered from
repositories.csv.gz) to be used by the index generation command.
index command: it will use the repositories.csv.gz by default. To give
it the filtered information in repositories-index.csv.gz the flag -r
must be used.
Changes:
repack command: it only repacks watchers.csv and projects.csv from the ghtorrent's mysql dump.
discover command: it only generates one file named repositories.csv.gz containing all the information needed in the next steps (repository name, number of stars).
select command: it can only filter by number of stars. It generates two files: a list on the stdout of one url repository per line to feed borges and a file named repositories-index.csv.gz (filtered from repositories.csv.gz) to be used by the index generation command.
index command: it will use the repositories.csv.gz by default. To give it the filtered information in repositories-index.csv.gz the flag -r must be used.
Signed-off-by: Manuel Carmona manu.carmona90@gmail.com