Closed edevil closed 10 months ago
The link is no longer available, do you have another example?
iex> Readability.summarize("http://www.techhive.com/article/3158435/home-tech/43-off-tp-link-smart-led-wi-fi-light-bulb-dimmable-and-alexa-compatible-deal-alert.html#tk.rss_smartappliance")
%Readability.Summary{
title: "301 Moved Permanently",
authors: nil,
article_html: "<div></div>",
article_text: ""
}
It's because options are different, in case of Readibility.article
we use default options, in case of Readibility.summarize
- no. I think it's worth unifying.
Nevertheless, original bug shouldn't happen anymore because we use a different helper for calculating text_length
- #53
I'll close this issue in favor of #57
Example URL: "http://www.techhive.com/article/3158435/home-tech/43-off-tp-link-smart-led-wi-fi-light-bulb-dimmable-and-alexa-compatible-deal-alert.html#tk.rss_smartappliance"
Can summarize it:
Extracting article directly from HTML fails:
This is because in the second case the "clean_conditionally: true" option is passed to the Sanitizer. Using different options is a bit surprising, nevertheless it should not crash.