Open JeffersonBledsoe opened 4 weeks ago
A client of mine is testing out Whisper to provide transcriptions of audio and video files used as evidence in criminal trials. It supports transcriptions and translations of multiple spoken languages, but they've only tried English. It works very well when only one voice is speaking, but it breaks down with background noise or music, or when there are multiple voices speaking at once. If no caption file is provided, and if the audio falls within Whisper's technical ability, it could provide a default transcription or translation.
As far as providing i18n support for transcriptions, I have tried Weblate with Plone core, and I am thoroughly impressed with its ease of use. See https://6.docs.plone.org/i18n-l10n/contributing-translations.html#weblate-for-translations. I'm not sure of the workflow for content.
PLIP (Plone Improvement Proposal)
Responsible Persons
Proposer: Jefferson Bledsoe
Seconder:
Abstract
This PLIP aims to deliver a new media uploading and viewing experience to improve the out-the-box accessibility of video when compared to the existing video block, whilst also providing the same level of functionality for audio which is not catered to by Volto currently.
Motivation
The current video block has a number of missing accessibility features such as the ability to add transcriptions, captions and alternative versions of the video for things like Audio Description. The block is based on Semantic UI React's Embed component, which also has its own accessibility issues. We do not have an 'audio' block in Volto and so the same accessibility issues that impact video also impact audio.
Assumptions
Proposal & Implementation
Which media player to use
We evaluated AblePlayer, plyr, MediaElement and video.js.
We are proposing to use video.js for this improvement.
Editing experience
Viewing experience
Implementation
Deliverables
Risks
Participants
Jefferson Bledsoe