Open Miserlou opened 9 years ago
This looks interesting.. https://github.com/grangier/python-goose
Unfortunately, goose is not as good as Diffbot. It's certainly faster, but is unable to find article body that DB could in many of my example tests.
Still, it is provided in the -goose branch if you think you can make improvements.
Would be way better to do the article extraction locally..