emory-libraries / dlp-curate

Digital curation and preservation workbench for the Emory Preservation Repository.
11 stars 4 forks source link

Implement fulltext indexing and search across works in Curate and Lux #2185

Open eporter23 opened 1 year ago

eporter23 commented 1 year ago

Story

As a repository end user, I want to be alerted to matches in full-text in my search results, so that I can find material that matches my interests more quickly

As a repository administrator, I want a user interface to manage full-text indexing of specific works, so that I can ensure only materials with OCR data undergo the additional processing

See also the enhancement request from the Rose Library.

Acceptance Criteria

Use one or more of the following options to provide acceptance criteria.

Notes

This epic follows work done previously to research and prototype options for enabling full text works. This epic seeks to actually implement a solution. The scope of this epic would include enhancements to the full text indexing process in Curate as well as enabling search and display of full text matches within Lux (https://digital.library.emory.edu). Searching within an individual work in the Universal Viewer will be a separate effort.

As background, our digitized text material which has already had OCR processing consists of 2 main types of outputs, "Kirtas" and "LIMB". Kirtas is the legacy output, and some examples include the Yellowbacks. LIMB is the newer, current output and includes collections such as the Yearbooks.

Both will always contain:

Kirtas volumes also contain:

LIMB volumes also contain:

Links to Additional Information

Add links here

Checklist

Optional if time permits

Given/When/Then

eporter23 commented 1 year ago

Re-opening because this is an Epic.