Hi, I got an issue with watson when trying to search unicode.
eg.When I searched ផលិត, it turns out that watson removed to be ផលត. The problem is watson did removing character it thought non-word characters. Is there a workaround to fix this? Thanks.
RE_NON_WORD = re.compile(r"[^ \w\-\.']", re.UNICODE)
def escape_query(text):
...
text = RE_NON_WORD.sub("", text) # Remove non-word characters.
return text
Hi, I got an issue with watson when trying to search unicode. eg.When I searched ផលិត, it turns out that watson removed to be ផលត. The problem is watson did removing character it thought non-word characters. Is there a workaround to fix this? Thanks.