Overview:

Coqui STT for Speech Recognition:

Coqui STT is an open-source speech recognition engine that can be run offline. We'll use it to capture speech from the microphone, convert it into text, and process the recognized commands.

Text-to-Speech (TTS):

Use Coqui TTS (or espeak for offline synthesis) to convert the text generated by Buzvis into spoken words.

Business Logic:

Capture microphone input.
Use Coqui STT to convert the spoken command into text.
Process the text, generate a response (either using AI or simple rules for now).
Use TTS to convert the response back to speech and play it back.

karan-aithal / buzvis-

Speech Recognition and Synthesis along with Offline support - Coqui STT and espeak #3

Overview:

Coqui STT for Speech Recognition:

Text-to-Speech (TTS):

Business Logic: