AaltoRSE / speech2text

Instructions to setup and use Aalto speech2text app on Triton.
Other
1 stars 1 forks source link

whisperX Integration #3

Closed hsnfirooz closed 9 months ago

hsnfirooz commented 9 months ago

This PR integrates WhisperX to our current pipeline. WhisperX enables batch computation which leads to faster inference. The current status is x25 realtime speed.

Major changes:

Required Files WhisperX requires this VAD model to be available in TORCH_HOME directory. TORCH_HOME can be accessed via torch.hub._get_torch_home() and in speech2text module is located at /scratch/shareddata/speech2text

Environment Variables For SPEECH2TEXT_CPUS_PER_TASK, 6 is enough as we are using

ruokolt commented 9 months ago

Looks good to me! I ran black + isort and I think you can squash+merge this.