GravityLabs / goose

Html Content / Article Extractor in Scala - open sourced from Gravity Labs
http://gravity.com
Apache License 2.0
1.53k stars 322 forks source link

ImportError: cannot import name StandardContentExtractor #106

Open jajana opened 8 years ago

jajana commented 8 years ago

Hi, I'm using Ubuntu 14.04 and I am trying to run a script within scrapy but get this error:

File "/home/anjo/anaconda2/lib/python2.7/site-packages/goose/crawler.py", line 28, in from goose.extractors import StandardContentExtractor ImportError: cannot import name StandardContentExtractor

What can this be..?

leakydata commented 8 years ago

Installing Goose Article Extractor following the instructions below, worked for me: git clone https://github.com/grangier/python-goose.git cd python-goose pip install -r requirements.txt python setup.py install