-
Monitrix has seemingly stopped following logs. There was a Cassandra failure as per #17, but the Monitrix interface became available after Cassandra was restarted.
The admin. page shows the various l…
-
hello ,
When i run a new job , i got this error when the job is in progress :
dk.netarkivet.common.exceptions.IOFailure: Crawl probably interrupted by shutdown of HarvestController
i found this…
-
WAIL - Heritrix User Interface (UI) Basic Requirements
1. The UI must list all (potentially many) Heritrix jobs currently residing in the jobs directory.
2. On selecting a job in the UI listing (1), …
-
I was listing all the classes, one-by-one, in my Jdee Global Classpath.
Since it's very difficult to maintain, I decided to switch to "short"ened version when specifying the classpath.
Now, jdee is…
-
Having a subdirectory for crawl/capture artifacts (configuration files, logs, reports etc) would be useful for the use case of storing or transporting an entire crawl job in a form that's also immedia…
-
### Are you submitting a **bug report** or a **feature request**?
Feature request
### What is the current behavior?
I am using a fairly fresh macOS 10.14 that does not have Java installed. Fo…
-
I'm curious about teaching warcprox to record SSL certificates. Has there been any internal work or discussion you can share? Do any other crawlers (Heritrix) currently record certificates?
Cf. this …
-
After working through the process with a guide ( #16) it looks like the [template](https://raw.githubusercontent.com/edgi-govdata-archiving/guides/master/guide-template.md) needs to be updated... this…
-
I remember having this issue when I worked for DoD and now at LANL. We had to set a proxy server for all HTTP and HTTPS traffic. Right now it's making it difficult for me to try out WAIL at LANL.
I k…
-
Likely critical but might not be available via Chrome's webRequest API.
**Heritrix 3.2.0**
``` sh
WARC/1.0
WARC-Type: request
WARC-Target-URI: http://matkelly.com/
WARC-Date: 2015-12-11T13:25:07Z
WA…