theSage21 / WordHashDocSearch

Word hash based document search.
0 stars 1 forks source link

split after applying tokenizer #1

Open dchatterjee172 opened 5 years ago

dchatterjee172 commented 5 years ago

https://github.com/theSage21/WordHashDocSearch/blob/dev/word_hash.py#L43

https://github.com/theSage21/WordHashDocSearch/blob/dev/word_hash.py#L51

text_to_words returns a string, not a list. You have to split it.

theSage21 commented 5 years ago

what! my world is a lie. :frowning:

dchatterjee172 commented 5 years ago

it is more like, test_to_space_separated_tokens. -_-