samvera / hydra-works

A ruby gem implementation of the PCDM Works domain model based on the Samvera software stack
Other
24 stars 14 forks source link

move full text extraction from curation concerns to works. #195

Closed blancoj closed 9 years ago

blancoj commented 9 years ago

Move full text extraction from curation concerns to works.

mjgiarlo commented 9 years ago

Thanks much for the PR, @blancoj! I have just a few comments and requests for you.

grosscol commented 9 years ago

In order to support the full text extraction functionality, solr needs additional libraries. Obtaining those libraries was done by an install generator in curation concerns. The issue of installing the libraries for full text extraction, and modifying the solr config were eschewed here, as those changes have already been made in umich's shared development environment.

So the issue of getting the libraries and modifying the solrconfig.xml remains. I created an issue to gauge interest in putting as an option in jettywrapper with projecthydra/jettywrapper#46.

A number of libraries need to be added under lib/contrib/extraction/lib, and the needs to be modified to point to them solrconfig.xml

<!-- for full-text indexing -->
<lib dir="../lib/contrib/extraction/lib" regex=".*\.jar" />
mjgiarlo commented 9 years ago

Left some comments that shouldn't add too much work, @blancoj. Thanks!