Using windows, ran VSM prep on a corpus of Indiana court cases. After selecting a value for the number of words for the upper threshold, the program went into loop printing repeatedly "FIlter will remove 1571 occurrences of these 21 words"
Problem arises for any value less than or equal to 27.
Using windows, ran VSM prep on a corpus of Indiana court cases. After selecting a value for the number of words for the upper threshold, the program went into loop printing repeatedly "FIlter will remove 1571 occurrences of these 21 words"
Problem arises for any value less than or equal to 27.
Screenshot attached