internetarchive / iiif

The official Internet Archive IIIF service
GNU General Public License v3.0
22 stars 4 forks source link

v3 support for A/V captions as annotations #49

Closed saracarl closed 4 months ago

saracarl commented 8 months ago

<moved from https://github.com/ArchiveLabs/iiif.archivelab.org/issues/79.

If captions or transcripts are available for A/V material in the Internet Archive, manifests should expose them as annotations so that players can display them.

Example items: https://blog.archive.org/2018/03/07/archive-video-now-supports-webvtt-for-captions/ And we can add some fixtures to https://github.com/ArchiveLabs/iiif.archivelab.org/issues/73

benwbrum commented 5 months ago

If we have a derivative vtt files, it should be straightforward to attach them as annotations.

saracarl commented 5 months ago

Cookbook recipe example: https://iiif.io/api/cookbook/recipe/0219-using-caption-file/ test in Ramp, here: https://ramp.avalonmediasystem.org/

glenrobson commented 5 months ago

Example item from blog:

    {
          "name": "cruz-test.en.vtt",
          "source": "original",
          "format": "Web Video Text Tracks",
          "mtime": "1477931243",
          "size": "6223",
          "md5": "ed675b8fd0e0dcfced5e48e81a6aae21",
          "crc32": "0397fd6d",
          "sha1": "6327cd74bcbabd501931cefa479b2309b96b3639"
    },
   {
       "name": "cruz-test.cy.vtt",
          "source": "original",
          "format": "Web Video Text Tracks",
          "mtime": "1477931247",
          "size": "831",
          "md5": "1dfacb0563da71c4e3f9e856e1b2a2d4",
          "crc32": "0cc01233",
          "sha1": "4758e5c47272c1fe58882a5e79a456350b834eb3"
   }
glenrobson commented 5 months ago

This one has a single autogenreated vtt file:

https://archive.org/metadata/youtube-SvH4fbjOT0A

    {
          "name": "34C3_-_International_Image_Interoperability_Framework_IIIF_Kulturinstitutionen_schaffen_interop-SvH4fbjOT0A.autogenerated.vtt",
          "source": "original",
          "mtime": "1670021248",
          "size": "18376",
          "md5": "b5fbddfb39f77a9106418d867894c436",
          "crc32": "1eb49e29",
          "sha1": "a4e0038ab3fd1904e025bcf990e91f537894975b",
          "format": "Web Video Text Tracks"
    }

Manifest: https://iiif.archive.org/iiif/3/youtube-SvH4fbjOT0A/manifest.json

glenrobson commented 5 months ago

I've got the first vtt file linked but its not working as there is no CORS set for the vtt file:

< HTTP/2 200 
< server: nginx/1.25.1
< date: Fri, 26 Apr 2024 11:53:34 GMT
< content-type: text/plain;charset=UTF-8
< strict-transport-security: max-age=15724800
< onion-location: https://archive6zg5vrdwm4ljllgxleekeoj43lqayscd4d4kmhnyblq4h3ead.onion/download/youtube-SvH4fbjOT0A/34C3_-_International_Image_Interoperability_Framework_IIIF_Kulturinstitutionen_schaffen_interop-SvH4fbjOT0A.autogenerated.vtt
< referrer-policy: no-referrer-when-downgrade
glenrobson commented 4 months ago

Look at proxying the vtt.