Closed krtek4 closed 8 years ago
Small hint, it could be possible to move our bucket implementation to replace Crawler.prototype.queueURL
so that we decide in which order SimpleCrawler fetches URLs.
Then, once a URL is fetched and validated, we can pass it directly to pa11y
because it will be in the right "order" already.
The tester part of the bucket systems works great, but the crawling is not 100%.
The crawler fetches (and de factor add) URLs based on the order they appear in the document, this means if the first thousand URLs of a document all go to the same bucket, we will first fetch those 1000 URLs before filling any other bucket.
This is cumbersome when you want to quickly have a panel of different URLs tested.
I can imagine two solutions :