Islandora-Collaboration-Group / ISLE

Islandora Enterprise (ISLE) is a community project that addresses two of the most significant pain-points in Islandora: installation and maintenance.
https://Islandora-Collaboration-Group.github.io/ISLE/
GNU General Public License v3.0
32 stars 34 forks source link

WARC tools missing from ISLE-apache Dockerfile #402

Closed bondjimbond closed 3 years ago

bondjimbond commented 3 years ago

Issue description

The WARC tools from https://github.com/internetarchive/warctools need to be added to the Dockerfile in ISLE-apache (https://github.com/Islandora-Collaboration-Group/isle-apache/blob/master/Dockerfile).

These tools are required to support islandora_solution_pack_web_archive, which depends on warcfilter and warcindex. These are the default paths to those files:

/usr/local/bin/warcfilter
/usr/local/bin/warcindex

For an issue, describe steps to reproduce the issue

Within the apache container, search for warcfilter and warcindex; you won't find them. Ingesting an item with the Islandora Web ARChive content model therefore results in -1 Byte derivatives.

Additional details / screenshots

Screen Shot 2020-12-18 at 2 57 27 PM

noahwsmith commented 3 years ago

@g7morris @bondjimbond I had a few minutes while something else was compiling so I figured out what we needed to add. Gavin had warned me about an unrelated OpenJPEG issue which is preventing this from building fully, but my local testing shows the WarcTools step succeeds.

g7morris commented 3 years ago

Hi @bondjimbond

Thanks for this ticket.

Just an update that I merged in @noahwsmith 's PR https://github.com/Islandora-Collaboration-Group/isle-apache/commit/36f9cdf3aca1d91b628f6d2040647b5b4a468bea to add the WARCTools as directed.

Recommend that you docker-compose down and then docker-compose pull to get the newly built 1.5.3 images when you have a chance to review and test. You might also have to update your associated docker-compose.*.yml files as directed in the docs for upgrading ISLE.

Also @McFateM this might interest you too as I think you are a WARC user.

Cheers, Gavin

bondjimbond commented 3 years ago

Thanks!

g7morris commented 3 years ago

Closing ticket as resolved.