J.A.R.V.I.S

An AI-RPA agent based on

Computer Vision to understand computer screens,
Selenium to operate such screens,
SSH interface to interact with other systems, and
GPT to natural language instructions

1. Getting Started

Install the library.

gem install blackstack-jarvis

Create an instance of Jarvis.

require 'blackstack-jarvis'

jarvis = BlackStack::Jarvis.new(
    # ths is to connect with your OpenAI account.
    # reference: https://platform.openai.com/docs/api-reference/authentication
    openai_api_key: '<your open AI api key here>',
    openai_model: 'gpt-4-1106-preview',

    # this is to operate browsers using AdsPower.
    # reference: https://github.com/leandrosardi/adspower-client    
    adspower_api_key: '<your adspower api key heere>',

    # this is to use dropbox as a cloud storage of screenshots, audios and text files.
    # reference: https://github.com/leandrosardi/my-dropbox-api
    dropbox_refresh_token: '<your dropbox refresh token here>',
)

2. Operating with your local computer

Create a text file with a command like this:

echo -e 'What is the most impressive invention of Leonardo Davinci?' > ~/some.text

Then, you can refer Jarvis to such a file to find an instruction.

p jarvis.q('I wrote some instructions in the file ~/jarvis.txt. Please read it and answer.')
# => "The most impressive invention of Leonardo Davinci is the the flying machine."

In the next sections, we'll store some information in files like:

passwords to access some web platforms,
step by step instructions to perform some operations in such web platforms,
ssh credentials to access remote computers.

leandrosardi / my.jarvis

readme

J.A.R.V.I.S

1. Getting Started

2. Operating with your local computer

3. Operating with other computers

4. Operating with browsers

5. Operating with websites