cr08 / TwitchVault

Simplified tool to automatically download/archive VODs, clips, highlights, including associated chat logs for specified Twitch channels
GNU General Public License v3.0
18 stars 1 forks source link

VTT generation produces files that show one word at a time (as viewed via VLC) #3

Open cr08 opened 2 years ago

cr08 commented 2 years ago

Initial testing of the VTT render code and playing back in VLC shows that it displays a single word at a time on screen. Accuracy at first blush seems good and timing is perfect otherwise.

More research needs to be done here. Plan is to either find out how to fix the VTT file generation or skip it and just have Vosk write out an SRT file instead.

This is low priority.

cr08 commented 2 years ago

Looks like vosk-api has a good example of SRT output. Will be testing to see if the same single word output issue exists and further research, however this looks promising and will test this soon...

https://github.com/alphacep/vosk-api/blob/master/python/example/test_srt.py

cr08 commented 2 years ago

The basis of this has been sorted and ironically the code streamlines quite a bit. In brief testing this works. Will continue to test but the change has been committed to the repo.

Still need to change all 'VTT' references/variables to SRT now that we are outputting that format. This does not affect the functionality in any way, just a final cleanup step that needs to be done.

cr08 commented 2 years ago

One additional side addition I want to do here is see about adding the option to use the 'full' English speech model (vosk-model-en-us-0.22) in addition to the current 'small' model as included in this repo. More testing needs to be done to see how resource intensive this is on an 'average' test system.

We'll probably do like TDCLI and remove the models entirely from the repo and rely on the user to download and place it in their working directory. This will especially be needed for the full model which runs at a whopping 1.8GB. Pretty sure Github won't be happy hosting that in a code repo. 😛

cr08 commented 2 years ago

Work is largely completed on this and functional. The only thing remaining is changing calls from VTT/WebVTT to something more 'current' with the new code. All of it is merely cosmetic work. Keeping this open as a TODO until that is completed.