machawk1 / wail

:whale2: Web Archiving Integration Layer: One-Click User Instigated Preservation
https://matkelly.com/wail
MIT License
346 stars 33 forks source link

Integrate SolrWayback into WAIL #489

Open machawk1 opened 3 years ago

machawk1 commented 3 years ago

https://github.com/netarchivesuite/solrwayback

Could supply full text search, visualization of WARCs, etc. License is compatible (Apache) but will also have to include Solr, which might be able to reuse bundled Java.

-- edit --

machawk1 commented 3 years ago

The question was reiterated by @ymaurer at the IIPC WAC 2021 presentation on WAIL. The feasibility ought to be investigated further. Details to come.

machawk1 commented 3 years ago

A barrier on WAIL's end is that it still relies on Java 7.

345 describes efforts at updating Heritrix inclusive of updating to bundling Java 11.

Updating to Java > 7 is a requirement before integration with SolrWayback can proceed.