-
The introductory Arquillian example in Chapter 03 uses a remote JBoss container.
The test case deploys the war file built from the main java resources that probably do not contain the test case:
## [s…
-
Hi @anjackson : could you confirm how the Website Title facet field is derived ? That everything contained within the host news.bbc.co.uk probably has Website Title = BBC News ?
If so, I think it is …
-
Fuck link rot.
Can I make an Action which:
- Parses all markdown in the content folder for links.
- Downloads a list of previously known links.
- Filters for only new links.
- Screenshots, weba…
-
I get
"mmap() failed: [12] Cannot allocate memory
PHP Fatal error: Out of memory (allocated 3105890304) (tried to allocate 4096 bytes) in phar:///usr/local/bin/composer/src/Composer/DependencyRes…
-
webarchive-commons uses GPL v2 code in at least two places.
[OpenJDK7GZIPInputStream](https://github.com/iipc/webarchive-commons/blob/24846d0d8870e8c6f4d35901a83cda593544dc97/src/main/java/org/archiv…
-
I originally asked this question about a feature request here: https://developer.jboss.org/thread/275654
So this is a repeat of that. I was testing a new WFSwarm fraction that needed to add an Und…
-
wayback_machine_downloader www.gylymordasy.kz
Downloading www.gylymordasy.kz to websites/www.gylymordasy.kz/ from Wayback Machine archives.
Getting snapshot pagesTraceback (most recent call last):…
-
Try webarchive in that case. You will be surprised.
-
For example:
https://fatcat.wiki/file/rcbebk4ox5esbnnpipbnegy7si
Some file entities have two wayback URLs, one with 12 digits and one with the full 14. In the majority of cases, however, there i…
-
The extraction rules for links to resources in https://github.com/netarchivesuite/webarchive-discovery/blob/some/warc-indexer/src/main/java/uk/bl/wa/analyser/payload/TwitterAnalyser.java should be syn…
tokee updated
2 years ago