Closed cli0 closed 4 years ago
I have this same error? Do y'all know what is going on?
The following workaround worked for me:
import HTMLSentenceTokenizer sentence = HTMLSentenceTokenizer.HTMLSentenceTokenizer() example_html_one = open('example_html_one.html', 'r').read() parsed_sentences = sentence.feed(example_html_one) print(parsed_sentences)
Without the download method of @conorosully:
from htmlst import HTMLSentenceTokenizer
sentence = HTMLSentenceTokenizer.HTMLSentenceTokenizer()
example_html_one = open('example_html_one.html', 'r').read()
parsed_sentences = sentence.feed(example_html_one)
print(parsed_sentences)
any other users seeing this tool missing large chunks of text? looks great though :)
While trying to run your example code I get the error:
And it ultimately stems from this:
HTMLSentenceTokenizer' is not callable
that I get directly from the IDE (Pycharm).