Closed andymeneely closed 9 years ago
Also,
And, for 4 above, use this regexp:
([0-9]{3,6})\1
then use the group from the first. Be sure to write a verify for this based on the copied one from our dev data (229611229611)
Also do this:
issue
Working on verifying the reason for the dangling bugs.
The ~70k missing bugs from this relationship are key - we need to be scraping these ASAP.
I placed the recovered bugs in /tmp/recovered_bugs/
they can be added to the production build data.
A total of 4201
bug ids returned error [403 Forbidden, 404 Not Found]
, when trying to fetch the data.
The error log is located at /tmp/recovered_bugs/error_log.csv
We need to handle the many-many relationship between a bug and a commit. We need:
BUG=
field in the git lot parser properly - see belowbug_commit
table.~axmvse/chromium/realdata/chromium-gitlog.txt
For parsing the BUG= field, do the following:
chromium:
string from each fieldto_i