frdel / agent-zero

Agent Zero AI framework
Other
4.91k stars 1.1k forks source link

Agent Zero

Join our Skool Community Join our Discord Subscribe on YouTube Connect on LinkedIn Follow on X.com

Intro Video

Personal and organic AI framework

Now with Responsive UI, Auto-memory and Reflection:

Web UI

Key concepts

  1. General-purpose assistant

    • Agent Zero is not pre-programmed for specific tasks (but can be). It is meant to be a general-purpose personal assistant. Give it a task, and it will gather information, execute commands and code, cooperate with other agent instances, and do its best to accomplish it.
    • It has a persistent memory, allowing it to memorize previous solutions, code, facts, instructions, etc., to solve tasks faster and more reliably in the future.
  2. Computer as a tool

    • Agent Zero uses the operating system as a tool to accomplish its tasks. It has no single-purpose tools pre-programmed. Instead, it can write its own code and use the terminal to create and use its own tools as needed.
    • The only default tools in its arsenal are online search, memory features, communication (with the user and other agents), and code/terminal execution. Everything else is created by the agent itself or can be extended by the user.
    • Tool usage functionality has been developed from scratch to be the most compatible and reliable, even with very small models.
  1. Multi-agent cooperation

    • Every agent has a superior agent giving it tasks and instructions. Every agent then reports back to its superior.
    • In the case of the first agent in the chain (Agent 0), the superior is the human user; the agent sees no difference.
    • Every agent can create its subordinate agent to help break down and solve subtasks. This helps all agents keep their context clean and focused.
  2. Completely customizable and extensible

    • Almost nothing in this framework is hard-coded. Nothing is hidden. Everything can be extended or changed by the user.
    • The whole behavior is defined by a system prompt in the prompts/default/agent.system.md file. Change this prompt and change the framework dramatically.
    • The framework does not guide or limit the agent in any way. There are no hard-coded rails that agents have to follow.
    • Every prompt, every small message template sent to the agent in its communication loop, can be found in the prompts/ folder and changed.
    • Every default tool can be found in the python/tools/ folder and changed or copied to create new predefined tools.
    • Of course, it is open-source (except for some tools like Perplexity, but that will be replaced with an open-source alternative as well in the future).
  3. Communication is key

    • Give your agent a proper system prompt and instructions, and it can do miracles.
    • Agents can communicate with their superiors and subordinates, asking questions, giving instructions, and providing guidance. Instruct your agents in the system prompt on how to communicate effectively.
    • The terminal interface is real-time streamed and interactive. You can stop and intervene at any point. If you see your agent heading in the wrong direction, just stop and tell it right away.
    • There is a lot of freedom in this framework. You can instruct your agents to regularly report back to superiors asking for permission to continue. You can instruct them to use point-scoring systems when deciding when to delegate subtasks. Superiors can double-check subordinates' results and dispute. The possibilities are endless.

Agent Zero

Nice features to have

Agent 1 System Load

Keep in mind

  1. Agent Zero can be dangerous! With proper instruction, Agent Zero is capable of many things, even potentially dangerous to your computer, data, or accounts. Always run Agent Zero in an isolated environment (like the built in docker container) and be careful what you wish for.

  2. Agent Zero is not pre-programmed; it is prompt-based. The whole framework contains only a minimal amount of code and does not guide the agent in any way. Everything lies in the system prompt in the prompts/ folder. Here you can rewrite the whole framework behavior to your needs. If your agent fails to communicate properly, use tools, reason, use memory, find answers - just instruct it better.

  3. If you cannot provide the ideal environment, let your agent know. Agent Zero is made to be used in an isolated virtual environment (for safety) with some tools preinstalled and configured. If you cannot provide all the necessary conditions or API keys, just change the system prompt and tell your agent what operating system and tools are at its disposal. Nothing is hard-coded; if you do not tell your agent about a certain tool, it will not know about it and will not try to use it.

David Ondrej video

Known problems

  1. The system prompt sucks. You can do better. If you do, help me please :)
  2. The communication between agents and terminal in Docker Container via SSH can sometimes break and stop producing outputs. Sometimes it is because the agent runs something like "server.serve_forever()" which causes the terminal to hang, sometimes a random error can occur. Restarting the agent and/or the docker container helps.
  3. The agent can break his operating system. Sometimes the agent can deactivate virtual environment, uninstall packages, change config etc. Again, removing the docker container and cleaning up the work_dir/ is enough to fix that.

Ideal environment

Time example

Setup

A detailed setup guide for Windows, macOS and Linux with a video can be found in the new Agent Zero Documentation at this page.

Consult the Documentation

The documentation dives deep into the framework and its features. It is a good place to start if you are new to Agent Zero. Click here to see the Documentation.

Coming up

Changelog [since version 0.7]

v0.7.1

v0.7

[!NOTE]
Changes to launch files since v0.6:

  • main.py file has been replaced with run_ui.py (webui) and run_cli.py (terminal) launch files.
  • configuration has been moved to initialize.py for both webui and terminal launch files.