laurentprudhon / nlptextdoc

Suite of tools to extract and annotate language resources for NLP applications
Other
1 stars 2 forks source link

Incorrect display of the % of unique text blocks #10

Closed laurentprudhon closed 5 years ago

laurentprudhon commented 5 years ago

Very often, the % of unique text blocks displayed on screen diverges :

Time | Pages | Errors | Unique | Download | Disk | Parsing | Convert | 0:01:00 | 299 | 28 | -2147483648 % | 67,9 Mb | 0,1 Mb | 0:03:11 | 0:00:00 |

Example website to reproduce this bug : https://www.lolivier.fr/ - after at most 299 pages

laurentprudhon commented 5 years ago

Fixed