-
This has been requested a few times but there is currently no way to do this in the WAIL interface, most recently by Beaudry Allen, Digital Archivist at Villanova.
Q: What needs to be included in a…
-
Rather than our own `webrender-api`, consider switching to https://github.com/webrecorder/browsertrix-crawler
The integration pattern is somewhat different to Browsertrix's primary use case, but it…
-
Hi,
I've observed in the code that the value "${launchId}" is expected to be replaced with a value I'm not sure what is. Anyway, I'm trying to understand the configuration file and I found that the…
-
After noticing a small crawl gap:
```
2019-12-05 10:00:12,826 INFO: Worker Worker(salt=769972995, workers=1, host=ingest, username=root, pid=28327) was stopped. Shutting down Keep-Alive thread
20…
-
-
we are testing openwayback using a .warc file generated by heritrix.
we run openwayback on centos7+tomcat7. OWB seems capable of indexing urls the .warc file. however, when we click the version (da…
-
This would be locations that already exist that they want to replay and not necessarily have Heritrix write to. As an example, they may want to specify a folder in their local Dropbox directory. This …
-
Tested in both the basic and advanced interface, tried crawling https://matkelly.com and the default https://matkelly.com/wail, both resulting WARCs only contain the DNS record.
Other URIs seem to …
-
We instal·led las openwayback version, reindexed all crawled content using CDX and start to search.
Reviewing results table after quering for an URLsome of the results has more than one entry for a d…
-
It could be clearer that the Yes or No buttons refer to installing Java. We may also want to disclose to the user from where it is being downloaded and what is being downloaded and installed.
![scr…