ukwa / ukwa-heritrix

The UKWA Heritrix3 custom modules and Docker builder.
9 stars 7 forks source link

Broken checkpoints #74

Closed anjackson closed 2 years ago

anjackson commented 2 years ago
SEVERE: org.archive.crawler.framework.CrawlJob beansException Failed to start bean 'warcWriterViralOld'; nested exception is java.lang.RuntimeException: java.io.FileNotFoundException: File '/heritrix/state/craw
l-frequent-npld.host-npld-heritrix3-worker-1/checkpoints/cp00001-20210928094133/warcWriterViralOld' does not exist [Tue Sep 28 09:48:40 GMT 2021]
org.springframework.context.ApplicationContextException: Failed to start bean 'warcWriterViralOld'; nested exception is java.lang.RuntimeException: java.io.FileNotFoundException: File '/heritrix/state/crawl-fre
quent-npld.host-npld-heritrix3-worker-1/checkpoints/cp00001-20210928094133/warcWriterViralOld' does not exist

i.e. it seems I messed up the warc writer somehow.

anjackson commented 2 years ago

Ah, this was just a mistake due to switching from ukwa/heritrix-worker to ukwa/heritrix Docker images.