zeeguu / api

API for tracking a learner's progress when reading materials in a foreign language and recommending further personalized exercises and readings.
https://zeeguu.org
MIT License
8 stars 24 forks source link

why can't we parse this article? #285

Closed mircealungu closed 2 weeks ago

mircealungu commented 2 weeks ago

https://journalisten.dk/7-ud-af-10-af-os-arbejder-i-storrumskontor-og-rigtig-mange-hader-det/

i tried to read it with the extension, the waiting animation is very long

image

followed by

image
tfnribeiro commented 2 weeks ago

There seems to be something wrong with the find_or_create_article:

image

mircealungu commented 2 weeks ago

/var/www/nltk_data was not present in the API container, and the article parser was throwing an exception.

somewhat surprisingly, issue was fixed by changing the order of installing apache and installing punkt in setup.py during the building of the API image: https://github.com/zeeguu/api/commit/fbc85d3a0484c97feed5fbd7578c45a101ac3fa1