showlab / computer_use_ootb

An out-of-the-box (OOTB) version of Anthropic Claude Computer Use for Windows and macOS
MIT License
698 stars 68 forks source link

The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use

If you like our project, please give us a star ā­ on GitHub for the latest update.
[![arXiv](https://img.shields.io/badge/Arxiv-2411.10323-b31b1b.svg?logo=arXiv)](https://arxiv.org/abs/2411.10323) [![Hits](https://hits.seeyoufarm.com/api/count/incr/badge.svg?url=https%3A%2F%2Fgithub.com%2Fshowlab%2Fcomputer_use_ootb&count_bg=%2379C83D&title_bg=%23555555&icon=&icon_color=%23E7E7E7&title=hits&edge_flat=false)](https://hits.seeyoufarm.com/api/count/incr/badge.svg?url=https%3A%2F%2Fgithub.com%2Fshowlab%2Fcomputer_use_ootb&count_bg=%2379C83D&title_bg=%23555555&icon=&icon_color=%23E7E7E7&title=hits&edge_flat=false)

šŸŒŸ Overview

This is an out-of-the-box (OOTB) solution for Claude's new Computer Use APIs.

No Docker is required, and it supports both Windows and macOS. This project provides a user-friendly interface based on Gradio. šŸŽØ

For more information, you can visit our project page. šŸŒ

Update

Demo Video

https://github.com/user-attachments/assets/333e536e-93d0-4a97-8c9b-8f0722c29d51

Watch the video

Watch the video

šŸš€ Getting Started

0. Prerequisites

1. Clone the Repository šŸ“‚

Open the Conda Terminal. (After installation Of Miniconda, it will appear in the Start menu.) Run the following command on Conda Terminal.

git clone https://github.com/showlab/computer_use_ootb.git
cd computer_use_ootb

2. Install Dependencies šŸ”§

pip install -r dev-requirements.txt

3. Start the Interface ā–¶ļø

python app.py

If you successfully start the interface, you will see two URLs in the terminal:

* Running on local URL:  http://127.0.0.1:7860
* Running on public URL: https://xxxxxxxxxxxxxxxx.gradio.live (Do not share this link with others, or they will be able to control your computer.)

4. Control Your Computer with Any Device can Access the Internet

Open the website at http://localhost:7860/ (if you're controlling the computer itself) or https://xxxxxxxxxxxxxxxxx.gradio.live in your mobile browser for remote control.

Enter the Anthropic API key (you can obtain it through this website), then give commands to let the AI perform your tasks.

Mobile Interface

gradio_interface

Desktop Interface gradio_interface

šŸ–„ļø Supported Systems

āš ļø Risks

šŸ“… Roadmap

Join Discussion

Welcome to discuss with us and continuously improve the user experience of Computer Use - OOTB. Reach us using this Discord Channel or the WeChat QR code below!

gradio_interface gradio_interface