[FEAT]: Audio to Text - Whisper / LocalAi

Mintplex-Labs / anything-llm

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.

https://anythingllm.com

MIT License

23.59k stars 2.38k forks source link

[FEAT]: Audio to Text - Whisper / LocalAi #2347

Open czerr opened 6 days ago

czerr commented 6 days ago

What would you like to see?

Hello everyone, First of all, thank you for this superb project. Would it be possible to use LocalAI for Whisper? Currently the model is Xenova Whisper which uses the CPU... I have a LocalAI WHISPER Large server optimized for NVIDIA GPU that I would like to use to speed up Audio transcription... Thanks for your feedback. Christophe

timothycarambat commented 6 days ago

Oh that is cool, i did not know LocalAI had whisper model support. We can indeed add it as a supported provider so you can get faster speeds

czerr commented 5 days ago

Hi Thimoty,

Yes ! You can !

Look here -> https://localai.io/features/audio-to-text/

As you did for LLM, the possibility of choosing your own Local AI Whisper server would be TOP! My Local AI Whisper server runs on Docker Desktop, which is optimized for Cuda. Thanks for your feedback ! Christophe.