openaustralia / morph

Take the hassle out of web scraping
https://morph.io
GNU Affero General Public License v3.0
461 stars 74 forks source link

Container fails on Standard import and undeletable #1097

Open psychemedia opened 8 years ago

psychemedia commented 8 years ago

Hi I've had a scraper that did used to work [https://morph.io/psychemedia/PlanningEnforcementAppeals] erroring after a morph.io update some time ago:

Traceback (most recent call last):
   File "scraper.py", line 21, in <module>
     from bs4 import BeautifulSoup
   File "/app/.heroku/python/lib/python2.7/site-packages/bs4/__init__.py", line 29, in <module>
     from .builder import builder_registry
   File "/app/.heroku/python/lib/python2.7/site-packages/bs4/builder/__init__.py", line 306, in <module>
     from . import _html5lib
   File "/app/.heroku/python/lib/python2.7/site-packages/bs4/builder/_html5lib.py", line 57, in <module>
     class TreeBuilderForHtml5lib(html5lib.treebuilders._base.TreeBuilder):
 AttributeError: 'module' object has no attribute '_base'

The same error occurs if I rebuild the container by updating the original scraper.py file. bs4 loads fine for me in another scraper?

The scraper also throws an error if I try to delete it.