Closed mejo closed 10 years ago
I think train_from_htmlpage
should work. could you check if you pass HtmlPage
or raw data?
BTW you can convert the raw html data to HtmlPage simply with HtmlPage(body=raw_body)
@mejo did you manage to solve your issue with @tpeng suggestion?. Can you close this ticket if so, thanks.
Did not verify, pursuing other projects. Closing anyway.
Older issue mentions 'train_from_htmlpage' method but its not working anymore? What I try to do is provide preprocessed html data (utf8 conversion done to make scrapely work) for scrapely.