The general idea is to generate a queue in the database of links to be crawled. The primary source of those links will be recent publications in relevant journals. Which journals are supposed to be relevant will probably be encoded in the journal ranking.
[x] Generate a queue table of relevant journals.
[x] Download the first page of article links for queued journals.
The general idea is to generate a queue in the database of links to be crawled. The primary source of those links will be recent publications in relevant journals. Which journals are supposed to be relevant will probably be encoded in the journal ranking.