granawkins / latent-dictionary

A dictionary, but it shows you position in embedding space relative to some synonyms/antonyms instead of a definition.
MIT License
69 stars 7 forks source link

Add script to fetch word lists in 10 languages using NLTK #15

Open mentatbot[bot] opened 1 month ago

mentatbot[bot] commented 1 month ago

This update addresses the issue of obtaining the most common 10,000 words in 10 different languages using NLTK. A new script, fetch_word_lists.py, has been added to handle this task. The script downloads the necessary NLTK data and saves the word lists in a directory named wordlist. Additionally, the startup sequence in app.py has been modified to run this script. The NLTK library has also been added to requirements.txt.

Closes #14