TorSpider / TorSpider-Backend

The database backend with which the spiders share their discoveries.
BSD 3-Clause "New" or "Revised" License
1 stars 0 forks source link

Move to a backend populated queue for next_url #16

Closed artagel closed 6 years ago

artagel commented 6 years ago

The next_url api is an expensive join query that needs to be reworked into a queuing system that is generating the url list periodically. We could probably run this every 5-10 min, as currently each time we run the query we get a full list of urls and then grab a random one that need scanning. If we simply populate the queue and pop one off, it'll be way more efficient.