jasonjmcghee / plock

From anywhere you can type, query and stream the output of an LLM or any other script
MIT License
473 stars 31 forks source link
bash command context cross-platform gpt llm local ollama shell streaming tool utility

Plock

Use an LLM (or anything else that can stream to stdout) directly from literally anywhere you can type. Outputs in real time.

demo

Write a prompt, select it, and (by default) hit Cmd+Shift+.. It will replace your prompt with the output in a streaming fashion.

Also! You can first put something on your clipboard (as in copy some text) before writing / selecting your prompt, and it (by default) Cmd+Shift+/ and it will use the copied text as context to answer your prompt.

For Linux, use Ctrl instead of Cmd.

100% Local by default. (If you want to use an API or something, you can call any shell script you want specified in settings.json)

I show an example settings.json in Settings


🚨 Looking for contributions / help! 🚨

I would love to keep this project alive and growing, but can't do it alone.

If you're at all interested in contributing, please feel free to reach out, start a discussion, open a PR, look at issues, look at roadmap below, etc.

Something not working properly? There's no telemtry or tracking, so I won't know! Please log an issue or take a crack at fixing it yourself and submitting a PR! Have feature ideas? Log an issue!


Demo showing concept of Triggers, and the new flexible system

Demo using GPT-3.5 and GPT-4

If you are going to use this with remote APIs, consider environment variables for your API keys... make sure they exist wherever you launch, or directly embed them (just don't push that code anywhere)

(Original) Demo using Ollama

(in the video I mention rem, another project I'm working on)

Getting Started

Install ollama and make sure to run ollama pull openhermes2.5-mistral or swap it out in settings for something else.

Launch "plock"

Shortcuts:

Ctrl / Cmd + Shift + .: Replace the selected text with the output of the model.

Ctrl / Cmd + Shift + /: Feed whatever is on your clipboard as "context" and the replace the selected text with the output of the model.

(these two are customizable in settings.json)

Escape: Stop any streaming output

Mac will request access to keyboard accessibility.

Linux (untested), may require X11 libs for clipboard stuff and key simulation using enigo. Helpful instructions

Also system tray icons require some extras

Windows (untested), you'll need to swap out Ollama for something else, as it doesn't support windows yet.

[Settings]

There is a settings.json file which you can edit to change shortcuts, the model, prompts, whether to use shell scripts and what they are, and other settings.

After updating, click the tray icon and select "Load Settings" or restart it.

At any time you can click the tray icon and it will list the settings location. For what it's worth:

On mac, It's at ~/Library/Application Support/today.jason.plock/settings.json.

On linux, I think it's ~/$XDG_DATA_HOME/today.jason.plock/settings.json.

Windows, I think it's ~\AppData\Local\today.jason.plock\settings.json

But clicking the icon is the best way.

Correct me if any of these are wrong.

Using Settings

Take a look at the shortcut keys. A “trigger” can be started with a shortcut. That points to a process (by an 0-index) and a prompt (by a 0-index) to the lists defined in the processes and prompts fields.

a process is either “ollama” or a command (shell on mac). You can use that to call your script.

prompts can use one of the two built in variables $CLIPBOARD and $SELECTION, or any others you define using set_env_var trigger.

next_steps defines what happens to the output, which can be written to the screen (streaming or all at once ), saved to a variable, and/or kick off another trigger.

In the future i want to make it easy to trigger flexibly (cron, push), output wherever / however, easily chain things together.

very very open to feedback

Show Example ```json { "environment": { "PERPLEXITY_API": "", "OLLAMA_MODEL": "openhermes2.5-mistral", "OPENAI_API": "" }, "processes": [ { "name": "Use GPT", "command": [ "bash", "/Users/jason/workspace/plock/scripts/gpt.sh" ] }, { "name": "Execute text directly as script", "command": [] }, { "name": "Use perplexity", "command": [ "bash", "/Users/jason/workspace/plock/scripts/p.sh" ] }, { "name": "Use Dall-E", "command": [ "bash", "/Users/jason/workspace/plock/scripts/dalle.sh" ] }, "ollama" ], "prompts": [ { "name": "default basic", "prompt": "$SELECTION" }, { "name": "default with context", "prompt": "I will ask you to do something. Below is some extra context to help do what I ask. --------- $CLIPBOARD --------- Given the above context, please, $SELECTION. DO NOT OUTPUT ANYTHING ELSE." }, { "name": "step", "prompt": "$STEP" }, { "name": "say gpt", "prompt": "say \"$GPT\"" } ], "triggers": [ { "trigger_with_shortcut": "Command+Shift+,", "process": 1, "prompt": 0, "next_steps": [ { "store_as_env_var": "STEP" }, { "trigger": 4 } ], "selection_action": null }, { "trigger_with_shortcut": "Command+Shift+.", "process": 0, "prompt": 0, "next_steps": [ "stream_text_to_screen" ], "selection_action": "newline" }, { "trigger_with_shortcut": "Command+Shift+/", "process": 1, "prompt": 0, "next_steps": [ "write_final_text_to_screen" ], "selection_action": "newline" }, { "trigger_with_shortcut": "Command+Shift+'", "process": 3, "prompt": 0, "next_steps": [ "write_image_to_screen" ], "selection_action": null }, { "trigger_with_shortcut": null, "process": 0, "prompt": 2, "next_steps": [ "stream_text_to_screen", { "store_as_env_var": "GPT" }, { "trigger": 5 } ], "selection_action": null }, { "trigger_with_shortcut": null, "process": 0, "prompt": 3, "next_steps": [], "selection_action": null } ] } ```

Building Plock

If you don't want to blindly trust binaries (you shouldn't), here's how you can build it yourself!

Prerequisites

Installation Steps

Node.js

Download from: https://nodejs.org/

Rust

curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh
source $HOME/.cargo/env

Bun NPM

Whattt?? Why? - well, windows doesn't support bun in github actions afaict. So, I'm using npm instead.

How to Install Node

Project Setup

git clone <repo_url>
cd path/to/project
npm install
npm run tauri dev

Build

npm run tauri build

Inspiration / Another Great Project

Another demo

Another demo where I use the perplexity shell script to generate an answer super fast. Not affiliated, was just replying to a thread lol

https://github.com/jasonjmcghee/plock/assets/1522149/6166af73-545f-4a8e-ad46-ea8aacd84969

Secrets

Curious folks might be wondering what ocr feature is. I took a crack at taking a screenshot, running OCR, and using that for context, instead of copying text manually. Long story short, rusty-tesseract really dissapointed me, which is awkward b/c it's core to xrem.

If someone wants to figure this out... this could be really cool, especially with multi-modal models.