OthersideAI / self-operating-computer

A framework to enable multimodal models to operate a computer.
https://www.hyperwriteai.com/self-operating-computer
MIT License
8.66k stars 1.14k forks source link

[BUG] -m gemini-pro-vision asking for OPENAI_API_KEY #170

Open FelipeLujan opened 7 months ago

FelipeLujan commented 7 months ago

Found a bug? Please fill out the sections below. 👍

Describe the bug

Ran operate -m gemini-pro-vision, entered my gemini API key from google AIstudio, but when I request something I always get

[Self-Operating Computer | gemini-pro-vision]
Hello, I can help you with anything. What would you like done?
[User]
turn on night mode
[Self-Operating Computer][Operate] That did not work. Trying another method 
[Self-Operating Computer][Error] -> The api_key client option must be set either by passing api_key to the client or by setting the OPENAI_API_KEY environment variable 

Steps to Reproduce

  1. Install with pip
  2. run operate -m gemini-pro-vision and enter API key
  3. insert any prompt
  4. also tried by setting up the GOOGLE_API_KEY env variable in a .env file as well as export GOOGLE_API_KEY=abc123

Expected Behavior

The machine, self-operates

Actual Behavior:

Error in the console

Environment

Screenshots

image

Additional context

Add any other context about the problem here.

ItosEO commented 4 months ago

thw same problem

shengjh commented 4 months ago

use --verbose, it seems that the execution of the instructions given by the gemini failed, so try to request the instructions of chatgpt.