Instead of finding the intersection of two unique word sets, we can simply keep track of "missed" words, and exit when we reach a certain threshold (determined by the comparison threshold.) This may or may not be a performance improvement; we have to test it to verify.
Instead of finding the intersection of two unique word sets, we can simply keep track of "missed" words, and exit when we reach a certain threshold (determined by the comparison threshold.) This may or may not be a performance improvement; we have to test it to verify.