krinkv / information-ranking

The Unlicense
0 stars 0 forks source link

Clear inverse index of invalid terms #25

Closed KristiyanCvetanov closed 9 months ago

KristiyanCvetanov commented 9 months ago

We need further preprocessing on reading documents. There are some invalid terms left and we have them as keys for initialized inverse index. For example: "!!", "!!!", "!!!!", "!)", "$$$", "$)", etc.