Miserlou / Seek

Search the internet from your terminal. Speed read your results. Terminal nirvana.
MIT License
20 stars 3 forks source link

Diffbot Is Pretty Slow #1

Open Miserlou opened 9 years ago

Miserlou commented 9 years ago

Would be way better to do the article extraction locally..

Miserlou commented 9 years ago

This looks interesting.. https://github.com/grangier/python-goose

Miserlou commented 9 years ago

Unfortunately, goose is not as good as Diffbot. It's certainly faster, but is unable to find article body that DB could in many of my example tests.

Still, it is provided in the -goose branch if you think you can make improvements.