IIIF / iiif-stories

Community repository for documenting stories and use cases related to uses of the International Image Interoperability Framework.
21 stars 0 forks source link

I want to identify annotations that make up a transcript for an AV item #142

Open elynema opened 5 months ago

elynema commented 5 months ago

Description

For an audio or video item, I may have one annotation, or a series of annotations, that represent a transcription of the spoken text. For audio only items, this transcription may be required to be available to the user to meet accessibility requirements. In order to allow users to see just the annotation(s) that contain the transcription text, the IIIF viewer must be able to filter out just those annotations when presenting different types of annotations to the user.

Transcription annotation(s) will likely be presented to the user via a component or div in the IIIF viewer and should be able to be downloaded by the user. They should not be rendered on top of the video, as caption/subtitle annotations would be, and they may not be timed text, so they may not be possible to render as a caption/subtitle.

Variation(s)

Proposed Solutions

Introduce annotation motivation of transcribing to declare the nature of these annotations.

Proposed snippet that utilizes a transcribing motivation for an annotation that points out to a webvtt file.

"annotations": [
{
"type": "AnnotationPage",
"id": "https://avalon-dev.dlib.indiana.edu/media_objects/gf06g267d/manifest/canvas/d217qp50k/annotation_page/1005c73e-4b3d-4786-a0b0-49c7f1ecc43d",
"items": [
{
"type": "Annotation",
"motivation": "transcribing",
"body": {
"id": "https://avalon-dev.dlib.indiana.edu/master_files/d217qp50k/supplemental_files/95",
"type": "Text",
"format": "text/vtt",
"label": {
"none": [
"The Legislative Organization of Congress.vtt"
]
},
"language": "eng"
},
"id": "https://avalon-dev.dlib.indiana.edu/media_objects/gf06g267d/manifest/canvas/d217qp50k/annotation_page/1005c73e-4b3d-4786-a0b0-49c7f1ecc43d/supplementing/83695573-ad20-47ca-8444-c12ac90c6a13",
"target": "https://avalon-dev.dlib.indiana.edu/media_objects/gf06g267d/manifest/canvas/d217qp50k"
}
]

Additional Background

Transcriptions can be represented as a single annotation that points out to an external file, which may be timed text (such as a webvtt file) or non-timed text (such as a word document). They may also be represented one or a series of annotations that contain the actual text of the transcription. In either case, the transcribing motivation would be relevant.

kevinglickavp commented 5 months ago

Transcripts of audio files (as well as video files) are displayed in Aviary, beside the media player. Se the screenshot as an example. It may be confusing, but this example is not a video, but instead an audio with a thumbnail display image. TranscriptionOfAudioFilewithTranscript

elynema commented 5 months ago

Aviary has been employing a transcribing motivation to explicitly identify transcripts alongside a subtitling motivation for quite some time. Here is an example of an Aviary manifest that utilizes both motivations: https://weareavp.aviaryplatform.com/iiif/mc8rb6wr0j/manifest

One of the transcribing annotation snippets follows. Although this is technically timed text since it refers to a time fragment of the target, it is far too long of a string of text to be utilized as a caption, and so must be displayed outside of the video in a separate component in the player.

"id": "https://weareavp.aviaryplatform.com/collections/41/collection_resources/42190/file/114198/transcript/29336/annotation/1",
"type": "Annotation",
"motivation": "transcribing",
"body": {
"type": "TextualBody",
"value": "I see you're curious about Aviary, come with me on a quick flight around and let me show you how to Aviary. There are a few different types of users of Aviary. The first is an anonymous, unregistered user and anonymous unregistered user can come into Aviary and search for audiovisual content and its resources. To do so, I show an example. We'll go to Queen's Public Libraries page and I'll search Solange. Upon doing so, I'll get a list of results that match my example. Now, when you have a free and anonymous account, you can only see results of things that are said as public by the publisher, if you'd like to see things that aren't set as public. You'll have to go in and create an account. So we see here I have one result from my search. Now to create an account. I'd go here and log in and I'll go down and I'll hit join now, and upon doing so you can make a free account of Aviary and from here you'll be a registered user. Registered users are also free, and once they log in, they can now ask for access to restricted resources in their searches. I'm going to log in and demonstrate that now. And once I'm logged in, you'll see my original search request was here, which is public, and then now I see another result from my search and when I click on it, I need to request access to view this from the publisher. And again, in order to request access from the publisher, we need to have a general log in and create an account which is free. Finally, the last type of user of Aviary is the publisher. Publishers are people, organizations that have audio or video that they want to share, either publicly or privately.",
"format": "text/plain"
},
"target": "https://weareavp.aviaryplatform.com/collections/41/collection_resources/42190/file/114198#t=7.22,131.679"
elynema commented 5 months ago

Explicit display of transcript in Ramp (as embedded in Avalon) in a separate ta:

image