datasciencecampus / pygrams

Extracts key terminology (n-grams) from any large collection of documents (>1000) and forecasts emergence
https://datasciencecampus.github.io/pygrams
Other
63 stars 23 forks source link

Broken tests in test_tfidf_reduce.py #383

Open l-bailey-fnc opened 1 year ago

l-bailey-fnc commented 1 year ago

test_terms() and test_scores() need to be updated as follows:

for x, _ in term_score_tuples -> term_score_tuples['accept']

This is because the term_score_tuples gets sorted into an accept and reject group by the utils.remove_stop_words() method in Pipeline.py line 180