algolia / docsearch-scraper

DocSearch - Scraper
https://docsearch.algolia.com/
Other
306 stars 106 forks source link

TypeError: list indices must be integers, not str #20

Closed dustincoates closed 8 years ago

dustincoates commented 8 years ago

Got this on Chef, Lodash, and one I was using for Go.

2015-12-31 15:49:37 [scrapy] ERROR: Spider error processing <GET https://docs.chef.io/resources.html> (referer: https://docs.chef.io/)
Traceback (most recent call last):
  File "/usr/local/lib/python2.7/site-packages/scrapy/utils/defer.py", line 102, in iter_errback
    yield next(it)
  File "/usr/local/lib/python2.7/site-packages/scrapy/spidermiddlewares/offsite.py", line 28, in process_spider_output
    for x in result:
  File "/usr/local/lib/python2.7/site-packages/scrapy/spidermiddlewares/referer.py", line 22, in <genexpr>
    return (_set_referer(r) for r in result or ())
  File "/usr/local/lib/python2.7/site-packages/scrapy/spidermiddlewares/urllength.py", line 37, in <genexpr>
    return (r for r in result or () if _filter(r))
  File "/usr/local/lib/python2.7/site-packages/scrapy/spidermiddlewares/depth.py", line 54, in <genexpr>
    return (r for r in result or () if _filter(r))
  File "/usr/local/lib/python2.7/site-packages/scrapy/spiders/crawl.py", line 67, in _parse_response
    cb_res = callback(response, **cb_kwargs) or ()
  File "/Users/dustin/Documents/code/documentation-scrapper/src/documentation_spider.py", line 51, in callback
    records = self.strategy.get_records_from_response(response)
  File "/Users/dustin/Documents/code/documentation-scrapper/src/strategies/default_strategy.py", line 25, in get_records_from_response
    records = self.get_records_from_dom()
  File "/Users/dustin/Documents/code/documentation-scrapper/src/strategies/default_strategy.py", line 47, in get_records_from_dom
    level_selector = self.config.selectors[level]
TypeError: list indices must be integers, not str```
pixelastic commented 8 years ago

Is this still an issue? Could you check with the latest master and post you config if it still occurs?

dustincoates commented 8 years ago

@pixelastic It's not--thought I had closed it. Closing.