Closed morsczx closed 2 months ago
@morsczx did you resolve this issue? I was also trying to fine tune my segmentation performance and wondering whether chunk size would have an impact. Agree that these could be moved to config file.
@kevdawg94 - no I did not resolve the issue - thought I. might have been wrong in the suggestion. I had a lot of audio clips with 10 second intervals of languages - which were being incorrectly identified - hence wanted control over the chunk size. will take this up
I found you can already adjust chunk size as follows:
transcription = model.transcribe(audio, batch_size=batch_size, chunk_size=chunk_size)
gotcha, thanks a lot!
Disclaimer - new to github & had a doubt In whisperx/audio.py - hyperparameters are hardcoded. can or should we move these to a config file so that we can edit these as required? specifically the chunk_length ? if so what file should be made? I can raise a PR basis suggestions.