appleboy / go-whisper

Speech o Text using docker image with ggerganov/whisper.cpp
MIT License
54 stars 6 forks source link

Fix auto language #7

Closed ken0911208818 closed 1 year ago

ken0911208818 commented 1 year ago

In Whisper, the "language auto" option can significantly slow down the execution when using the ggml-large.bin model. This is evident when benchmarking a 10-second video with whisper.cpp:

With "language auto" set, the processing time is 423803.19 ms. When the language is explicitly set to "zh", the processing time is reduced to 163633.48 ms. This behavior is demonstrated in the example provided here: https://github.com/ggerganov/whisper.cpp/blob/master/bindings/go/examples/go-whisper/flags.go#L96

However, I'm currently facing an issue where the go-whisper command hangs, and I'm unable to resolve this problem.