Closed harsham05 closed 8 years ago
TODO: Add second pass of parser-indexer (to run through nutch segments) to wrangler_cron.sh to ingest atomic updates of crawl metadata lost during File Dump
Echo incremental status updates of the cron job
refactor MikeJ dump scripts, only want full_dump
fixed in https://github.com/memex-explorer/weapons/pull/15
TODO: Add second pass of parser-indexer (to run through nutch segments) to wrangler_cron.sh to ingest atomic updates of crawl metadata lost during File Dump