jeremyandrews / kakaia

Kakaia personal assistant
MIT License
13 stars 0 forks source link

accept stream of audio, convert to text in real-time #9

Open jeremyandrews opened 4 years ago

jeremyandrews commented 4 years ago

Currently we only accept a base64-encoded audio file, which we then convert to text with deepspeech. For very short commands this works, but it adds unnecessary delay as processing can't start until the complete command has arrived.

Instead, we need to accept an audio stream, allowing deepspeech to convert the text in near-real-time.

Look into whether to use raw sockets, web sockets, or something else, to accomplish this.