Open stroncod opened 6 years ago
Thanks for using ESA, and even more for your feedback!
text
is expected to be plain text, without control characters (such as quotes to combine multiple words into a single token), so I think your solution is correct.
Do you want to issue a pull request with the change and a unit test or two? Then your contribution will be carved into stone.
I got a problem when trying to read a dataset with special characters and trying to get the concept vector. This is easily solve by adding the escape function in the Vectorizer class
Great implementation by the way! Thanks
Source: https://stackoverflow.com/questions/10259907/lucene-exception-query-parser-encountered-eof-after-some-word/10259944