Open alexmiller00 opened 7 years ago
Hey @alexmiller00
Unfortunately caranddriver went from static HTML to using Angular Components. Since they changed the structure of their HTML, the script no longer works. The project is not being actively maintained and it would require a bit of an overhaul and some time. If you need the data, send me your email and I'll send the files over.
I am not getting any data returned in my json file although i have used your command "scrapy crawl automotive -o mybetternameforafile.json" and it creates the file but it is empty? this is my shell output
2017-03-08 11:12:49 [scrapy.utils.log] INFO: Scrapy 1.3.2 started (bot: automotive) 2017-03-08 11:12:49 [scrapy.utils.log] INFO: Overridden settings: {'NEWSPIDER_MODULE': 'automotive.spiders', 'FEED_URI': 'mybetternameforafile.json', 'SPIDER_MODULES': ['automotive.spiders'], 'BOT_NAME': 'automotive', 'ROBOTSTXT_OBEY': True, 'FEED_FORMAT': 'json'} 2017-03-08 11:12:49 [scrapy.middleware] INFO: Enabled extensions: ['scrapy.extensions.feedexport.FeedExporter', 'scrapy.extensions.logstats.LogStats', 'scrapy.extensions.telnet.TelnetConsole', 'scrapy.extensions.corestats.CoreStats'] 2017-03-08 11:12:49 [scrapy.middleware] INFO: Enabled downloader middlewares: ['scrapy.downloadermiddlewares.robotstxt.RobotsTxtMiddleware', 'scrapy.downloadermiddlewares.httpauth.HttpAuthMiddleware', 'scrapy.downloadermiddlewares.downloadtimeout.DownloadTimeoutMiddleware', 'scrapy.downloadermiddlewares.defaultheaders.DefaultHeadersMiddleware', 'scrapy.downloadermiddlewares.useragent.UserAgentMiddleware', 'scrapy.downloadermiddlewares.retry.RetryMiddleware', 'scrapy.downloadermiddlewares.redirect.MetaRefreshMiddleware', 'scrapy.downloadermiddlewares.httpcompression.HttpCompressionMiddleware', 'scrapy.downloadermiddlewares.redirect.RedirectMiddleware', 'scrapy.downloadermiddlewares.cookies.CookiesMiddleware', 'scrapy.downloadermiddlewares.stats.DownloaderStats'] 2017-03-08 11:12:49 [scrapy.middleware] INFO: Enabled spider middlewares: ['scrapy.spidermiddlewares.httperror.HttpErrorMiddleware', 'scrapy.spidermiddlewares.offsite.OffsiteMiddleware', 'scrapy.spidermiddlewares.referer.RefererMiddleware', 'scrapy.spidermiddlewares.urllength.UrlLengthMiddleware', 'scrapy.spidermiddlewares.depth.DepthMiddleware'] 2017-03-08 11:12:49 [scrapy.middleware] INFO: Enabled item pipelines: ['automotive.pipelines.AutomotivePipeline'] 2017-03-08 11:12:49 [scrapy.core.engine] INFO: Spider opened 2017-03-08 11:12:49 [scrapy.extensions.logstats] INFO: Crawled 0 pages (at 0 pages/min), scraped 0 items (at 0 items/min) 2017-03-08 11:12:49 [scrapy.extensions.telnet] DEBUG: Telnet console listening on 127.0.0.1:6023 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/robots.txt> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/bugatti> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/acura> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/alfa-romeo> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/bentley> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/buick> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/bmw> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/audi> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/chrysler> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/aston-martin> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/cadillac> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/dodge> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/chevrolet> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/ferrari> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/fiat> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/genesis> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/gmc> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/honda> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/infiniti> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/jaguar> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/hyundai> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/koenigsegg> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/lamborghini> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/ford> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/jeep> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/lexus> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/lincoln> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/kia> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/land-rover> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/lotus> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/maserati> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/mclaren> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/mercedes-amg> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/mercedes-benz> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/mercedes-maybach> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/mini> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/mazda> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/mitsubishi> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/nissan> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/pagani> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/porsche> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/ram> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/rolls-royce> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/spyker> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/subaru> (referer: None) 2017-03-08 11:12:49 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/smart> (referer: None) 2017-03-08 11:12:50 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/tesla> (referer: None) 2017-03-08 11:12:50 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/volvo> (referer: None) 2017-03-08 11:12:50 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/scion> (referer: None) 2017-03-08 11:12:50 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/volkswagen> (referer: None) 2017-03-08 11:12:50 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.caranddriver.com/toyota> (referer: None) 2017-03-08 11:12:50 [scrapy.core.engine] INFO: Closing spider (finished) 2017-03-08 11:12:50 [scrapy.statscollectors] INFO: Dumping Scrapy stats: {'downloader/request_bytes': 11473, 'downloader/request_count': 51, 'downloader/request_method_count/GET': 51, 'downloader/response_bytes': 1285842, 'downloader/response_count': 51, 'downloader/response_status_count/200': 51, 'finish_reason': 'finished', 'finish_time': datetime.datetime(2017, 3, 8, 10, 12, 50, 178655), 'log_count/DEBUG': 52, 'log_count/INFO': 7, 'response_received_count': 51, 'scheduler/dequeued': 50, 'scheduler/dequeued/memory': 50, 'scheduler/enqueued': 50, 'scheduler/enqueued/memory': 50, 'start_time': datetime.datetime(2017, 3, 8, 10, 12, 49, 239611)} 2017-03-08 11:12:50 [scrapy.core.engine] INFO: Spider closed (finished)