aau-network-security / richkit

Domain Enrichment Toolkit $ pip install richkit
https://pypi.org/project/richkit/
MIT License
11 stars 3 forks source link

ngram analysis test with dummy data #111

Closed kidmose closed 4 years ago

kidmose commented 4 years ago

After reactivating richkit.test.analyse.test_analyse.TestAnalyse.test_get_grams_alexa_2ld the test running time gone up, likely because it relies on downloading the Alexa Top-1M.

Goal: 1) Establish if downloading of the Top-1M has a significant impact on the test running time (share findings here and if necessary they can also be discussed here) and 2) and, if found relevant, modify test to run with a dummy file (e.g. the alexa Top-100)