Islandora-Labs / islandora_solution_pack_oralhistories

Adds all required Fedora objects to allow users to ingest and retrieve Oral Histories (video/audio) files through the Islandora interface
GNU General Public License v3.0
13 stars 23 forks source link

Option for capturing Table of Contents in addition to Transcript #132

Open wpwentzell opened 6 years ago

wpwentzell commented 6 years ago

The Briscoe Center for American History at the University of Texas is really enjoying this module! There is one major setback that prevents us from fully embracing it as it stands:

As is currently, there is not a way to capture both a table of contents and transcript together, and to distinguish between them. The only feasible way we could find to distinguish between Transcript and Table of Contents was to create a separate speaker named "Table of Contents, and provide timecode text cues that overlapped with the transcription cues, using the same transcription tier. This also creates overlapping timecode text in the captions. The obvious additional cons of this approach is that all "Speakers" are still defined as such, are not filterable, and are still under the umbrella of "Transcript" rather than given their own distinct hierarchy/tier.

A more elegant solution would be to actually allow multiple tiers to display distinctly, such that they are fully filterable from each other. My initial thought is that the tiers would allow for this behavior, but it is not so. The following XML cues validate, but do not behave as expected. The timecode text mixes together due to the overlap, and it is not possible to exclude one tier or the other without the timecode play marker remaining.

     <cue>
        <speaker>Table of Contents</speaker>
        <start>1</start>
        <end>105</end>
        <toc>part 1</toc>
    </cue>
    <cue>
        <speaker>Table of Contents</speaker>
        <start>106</start>
        <end>200</end>
        <toc>part 2</toc>
    </cue>
    <cue>
       <speaker>Test Speaker One</speaker>
        <start>1</start>
        <end>5</end>
        <transcript>Hello, my name is Fred</transcript>
    </cue>
    <cue>
        <speaker>Test Speaker Two</speaker>
        <start>7</start>
        <end>15</end>
        <transcript>Hello Fred, it's nice to meet you</transcript>
    </cue>

My apologies if this request is unclear, or if this is not formatted to standards. This is my first request submitted for a feature implementation, so please let me know how I can be more helpful!

MarcusBarnes commented 6 years ago

@wpwentzell Do you have an example of an existing oral history in another system that has a table of contents which I can take a look at to get a better idea of what you might need and how we might implement this? We'll review this feature request as we work on the 7.x-1.11 release of the Oral Histories Solution Pack. Thanks!

Natkeeran commented 6 years ago

@wpwentzell

Does your TOC timecode correspond to cue timecodes? Something like this:

   <speaker>Test Speaker One</speaker>
       <start>1</start>
       <end>5</end>
       <transcript>Hello, my name is Fred</transcript>
       <toc>Hello, my name is Fred</toc>
   </cue>

If so, that functionality could be achieved. We can possibly hide the tier if the content is empty.

wpwentzell commented 6 years ago

@Natkeeran

Hi Natkeeran, yes, that is the idea. We would ultimately need a way to distinguish between the two timecodes, toggling them off and on.

@MarcusBarnes

Here is an example of the OHMS solution pack, a different project that has the functionality we're looking for: http://digital.lib.sfu.ca/techbc-71/dianne-cyr-2014-11-26-sfu-surrey

This project has toggle between index and transcript, as well as keyword search built in.