I have a complete blog post explaining the project here.
The AI Assistant Automation is a Python application that uses Llama3, gTTS, OpenAI, Groq, and Faster-Whisper to create an intelligent assistant similar to Siri, with integrated image recognition support. This project allows users to interact with the assistant through voice commands and receive responses in audio format.
This application may have limitations based on the performance of the underlying AI models and available computing resources. Ensure that the necessary libraries are properly installed and the system is configured to handle audio and image processing efficiently.
siri-voice-llama3/
βββ .git/
βββ (gitignored) .venv/
βββ logs/
βββ data/
β βββ ai_response/
β β βββ .gitkeep
β β βββ (gitignored) ai_response_audio.mp3
β βββ chat_history/
β β βββ 2024/
β β βββ 10/
β β βββ (gitignored) 04.log
β β βββ (gitignored) 05.log
β βββ .gitkeep
β βββ (gitignored) user_audio_prompt.wav
βββ main.py
βββ README.md
βββ requirements.txt
βββ src/
βββ __pycache__/
βββ setup.py
βββ siri.py
βββ utils.py
βββ webcam.py
π¬ If you are using HTTPS protocol instead of SSH, change the
git clone
command accordingly.
git clone git@github.com:shricodev/siri-voice-llama3.git
cd siri-voice-llama3
GROQ_API_KEY=
GOOGLE_GENERATIVE_AI_API_KEY=
# Optional
OPENAI_API_KEY=
You can use the .env.example
file as a template.
{shell} src/scripts/start_siri_llama3.{sh/fish} main.py
Change the placeholder {shell}
to the shell you are using. It can either be bash
or fish
.
python3 -m venv .venv
source .venv/bin/activate.fish # or .venv/bin/activate if you are not using the fish shell
Install Dependencies
pip3 install -r requirements.txt
To start the assistant, execute the following command:
python main.py
This command initializes the assistant, allowing you to interact via voice commands.
OR
{shell} src/scripts/start_siri_llama3.{sh/fish} main.py
Change the placeholder {shell}
to the shell you are using. It can either be bash
or fish
.
The application logs all interactions in the data/chat_history/
directory. You can review past interactions in the log files to understand the context of your conversations.
Any ERRORS when running with the shell script will be logged in the logs/
directory.
Give a βοΈ if this project helped you!