Your own voice personal assistant: Voice to Text to LLM to Speech, displayed in a web interface.
Python 3.8 - 3.11
Make sure you have the following API keys:
git clone https://github.com/AlexandreSajus/JARVIS.git
pip install -r requirements.txt
.env
file in the root directory and add the following variables:DEEPGRAM_API_KEY=XXX...XXX
OPENAI_API_KEY=sk-XXX...XXX
ELEVENLABS_API_KEY=XXX...XXX
display.py
to start the web interfacepython display.py
jarvis.py
to start the voice assistantpython main.py
Listening...
Stopped listening
Speaking...
Here is an example:
Listening...
Done listening
Finished transcribing in 1.21 seconds.
Finished generating response in 0.72 seconds.
Finished generating audio in 1.85 seconds.
Speaking...
--- USER: good morning jarvis
--- JARVIS: Good morning, Alex! How can I assist you today?
Listening...
...