When using Whisper's auto-detected language, insert that language into the Cocina

When testing out captioning for Bengali language video, Whisper output text in what appears to be Bengali. (After checking with the curator for South Asian materials, it appears that we do not have anyone on staff who can read or speak Bengali.)

However, we are not yet applying a language tag to these caption files, which results in the display showing the language as English (the default). If we can get the language from Whisper, then we have a place to put it in the Cocina for the VTT file, like so:

"type": "https://cocina.sul.stanford.edu/models/file",
              "externalIdentifier": "https://cocina.sul.stanford.edu/file/9ae07267-1b89-40c3-a6b2-ad265894ab66",
              "label": "qf378nj5000_spa_cap.vtt",
              "filename": "qf378nj5000_spa_cap.vtt",
              "size": 54775,
              "version": 17,
              "hasMimeType": "text/vtt",
              "languageTag": "es",
              "use": "caption",

Note that even though "spa" (for Spanish) is in the filename of the vtt in that example, it's the languageTag field that makes the difference to the display.

Screenshot of current display:

sul-dlss / speech-to-text

When using Whisper's auto-detected language, insert that language into the Cocina #45