Closed frreiss closed 3 years ago
@BryanCutler would you mind giving these changes a quick review?
Thanks!
Sorry, slipped under the radar. Looking at it now.
Thanks for the review! Pushed some corrections. Will merge once this branch passes tests.
This PR includes some fixes for issues with our dictionary matching that encountered while working on a market intelligence use case for a blog post.
The specific problems addressed here are:
create_dict()
.simple_tokenizer()
that returns a tokenizer that splits on every chunk of whitespace and on every punctuation character. Dictionary creation uses that tokenizer by default now.I also fixed a minor bug in the handling of the
warnings
element of responses from Watson Natural Language Understanding.