Open knirbhay opened 6 years ago
Custom feed will look like this
python kafka_monitor.py feed { "url": "http://dmoztools.net", "appid": "testapp", "crawlid": "ABC123", "spiderid": "myspiderid", "headers": { "Accept": "text/html,application/xhtml+xml,application/xml;q=0.9,ima/webp,/;q=0.8", "Accept-Encoding": "gzip, deflate", "X-Requested-With": "dmoztools.net", "User-Agent": "My Custom User Agent" }, "cookies": { "device_id": "1", "app_token": "guid" } }
Due to shared cookie middle ware the coverage has decreased by 0.4%. @madisonb do you think this can be managed? I improved few percentage in distributed_scheduler.
Adding support for
1.Custom Headers and Cookies with Initial request 2.Shared cookies middleware to share cookies between crawl nodes
Linked Issue #182