metalwarrior665 / actor-article-extractor-smart

Combines Apify's crawling system and article parsing with unfluff library.
https://apify.com/lukaskrivka/article-extractor-smart
11 stars 5 forks source link

Perhaps use Mozilla Readability #8

Open mnmkng opened 2 years ago

mnmkng commented 2 years ago

https://github.com/mozilla/readability might be better than unfluff.

metalwarrior665 commented 2 years ago

Cool, will check if we could implement 2 backends with the same API and then we can compare.