Open djthorpe opened 1 year ago
Great work!
Keen on realtime translation and a way of calling out/streaming the output to another app - gRPC seems the best option for this
Yeah thanks.
I'm doing the audio downsampling to 16KHz at the moment in a different repository (go-media)
The realtime transcription and translation should be pretty straightforward, but pretty experimental, even for whisper.cpp
I will take a while to get to the gPRC microservice :-(
Added a "stream" command for the start of real-time streaming, but:
There's also some issues with the segmenting in the main package (repeated segments come out!) needs fixing.
Coming back to this after some time!
Remaining tasks:
Lower priority:
Also:
Also:
Simplified Dockerfile and now uses the base images from here as a base:
https://github.com/mutablelogic/docker-llamacpp
This is still now working; Now I need to have the ffmpeg shared libraries included in the runtime image. Considering whether to just copy over the libraries from the build image, or to install ffmpeg libraries from source.
Create bindings for https://github.com/ggerganov/whisper.cpp