This app was created specifically for my use case and not optimized.
It combines processing from several AI models into one app to automate the transcription of .wav and .mp4 files.
A .json file will be created for each step, this is for my own troubleshooting and possible to restart from any step
This program has been created to be modular, not with efficiency in mind.
For installation please install the necessary dependencies in requirements.txt and also refer to https://pytorch.org/get-started/previous-versions/ for installation of the necessary cuda support.
Please also obtain your own pyannote token
I have used python 3.8 as some of the libraries does not support 3.10 yet.