the-llm-lab / CommanderGPT

Voice Assisted Desktop Automation for Simple to Complex Tasks using ChatGPT
146 stars 30 forks source link
ai automation chatgpt desktop ml

CommanderGPT

CommanderGPT

Welcome to the CommanderGPT repository! This project harnesses the power of OpenAI's GPT-3.5 language model to enable seamless automation of your desktop tasks using voice commands. With a simple voice instruction, you can effortlessly control your desktop environment and accomplish a wide range of automation tasks.

Key Features

Prerequisites

To utilize this CommanderGPT, ensure the following dependencies are installed.

Linux:

Mac:

Windows:

You will also need an OpenAI API key.

Getting Started

  1. Clone this repository to your local machine.

  2. Install the required dependencies by running the following command:

    pip install -r requirements.txt
  3. Rename config.yml.example to config.yml and Update the file with your OpenAI API key. You can customize other parameters if needed.

  4. Execute the script using the following command:

    python main.py
  5. The script will actively listen for voice commands. Alternatively, you can switch to interactive mode by pressing the executing script with --interactive True arg and typing commands manually.

  6. Use the hotword "commander" to activate the script and provide voice commands. For example, say "commander, open the web browser" to launch the web browser.

  7. The script utilizes OpenAI's GPT-3.5 model to generate Bash scripts based on your voice commands. These scripts will be executed, automating the desired tasks on your desktop environment.

  8. To exit the script, either type "quit" or "exit," or say the hotword followed by "quit" or "exit" (e.g., "commander, exit").

Example Usage

Here are a few examples of voice commands you can use:

Feel free to experiment with different commands and explore the limitless possibilities of CommanderGPT!

Video Example

https://github.com/theonlyfoxy/CommanderGPT/assets/12250394/8bed8d6f-46bb-4444-87a9-f015bcb9fbb4

Click the above to watch a video example showcasing the CommanderGPT in action.

Todo

Limitations and Improvements

License

This code is licensed under the MIT License.