Generates VTT files for a video using FFMPEG and Azure Cognitive Services.
pip3 install ffmpeg-python
pip3 install azure-cognitiveservices-speech
Copy config.example.yml to config.yml and fill with your Azure Information:
key: azurekey
region: azureregion
./vtt-gen.py --input path/to/video --output path/to/subtiles.vtt
VTT file tested with VLC and GNOME Videos
The output of this script is very basic, but can be customized, please see the following links for working with the vtt file.
Links to web pages for helping upload caption files: