Closed bmschmidt closed 10 years ago
Just to get some numbers down: with the LOC corpus (1.7m documents, which is very much shorter than it should be) the initial clean scan took about 11 hours.
This is not an issue.
Just to get some numbers down: with the LOC corpus (1.7m documents, which is very much shorter than it should be) the initial clean scan took about 11 hours.