Use an LLM (or anything else that can stream to stdout) directly from literally anywhere you can type. Outputs in real time.
Write a prompt, select it, and (by default) hit Cmd+Shift+.
. It will replace your prompt with the output in a
streaming fashion.
Also! You can first put something on your clipboard (as in copy some text) before writing / selecting your prompt, and
it (by default) Cmd+Shift+/
and it will use the copied text as context to answer your prompt.
For Linux, use Ctrl
instead of Cmd
.
100% Local by default. (If you want to use an API or something, you can call any shell script you want specified
in settings.json
)
I show an example settings.json
in Settings
I would love to keep this project alive and growing, but can't do it alone.
If you're at all interested in contributing, please feel free to reach out, start a discussion, open a PR, look at issues, look at roadmap below, etc.
Something not working properly? There's no telemtry or tracking, so I won't know! Please log an issue or take a crack at fixing it yourself and submitting a PR! Have feature ideas? Log an issue!
If you are going to use this with remote APIs, consider environment variables for your API keys... make sure they exist wherever you launch, or directly embed them (just don't push that code anywhere)
(in the video I mention rem, another project I'm working on)
Install ollama and make sure to run ollama pull openhermes2.5-mistral
or swap
it out in settings for something else.
Launch "plock"
Shortcuts:
Ctrl / Cmd + Shift + .
: Replace the selected text with the output of the model.
Ctrl / Cmd + Shift + /
: Feed whatever is on your clipboard as "context" and the replace the selected text with the
output of the model.
(these two are customizable in settings.json
)
Escape
: Stop any streaming output
Mac will request access to keyboard accessibility.
Linux (untested), may require X11 libs for clipboard stuff and key simulation using enigo. Helpful instructions
Also system tray icons require some extras
Windows (untested), you'll need to swap out Ollama for something else, as it doesn't support windows yet.
There is a settings.json
file which you can edit to change shortcuts, the model,
prompts, whether to use shell scripts and what they are, and other settings.
After updating, click the tray icon and select "Load Settings" or restart it.
At any time you can click the tray icon and it will list the settings location. For what it's worth:
On mac, It's at ~/Library/Application Support/today.jason.plock/settings.json
.
On linux, I think it's ~/$XDG_DATA_HOME/today.jason.plock/settings.json
.
Windows, I think it's ~\AppData\Local\today.jason.plock\settings.json
But clicking the icon is the best way.
Correct me if any of these are wrong.
Take a look at the shortcut keys. A “trigger” can be started with a shortcut. That points to a process (by an 0-index) and a prompt (by a 0-index) to the lists defined in the processes and prompts fields.
a process is either “ollama” or a command (shell on mac). You can use that to call your script.
prompts can use one of the two built in variables $CLIPBOARD and $SELECTION, or any others you define using set_env_var trigger.
next_steps defines what happens to the output, which can be written to the screen (streaming or all at once ), saved to a variable, and/or kick off another trigger.
In the future i want to make it easy to trigger flexibly (cron, push), output wherever / however, easily chain things together.
very very open to feedback
If you don't want to blindly trust binaries (you shouldn't), here's how you can build it yourself!
Download from: https://nodejs.org/
curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh
source $HOME/.cargo/env
Whattt?? Why? - well, windows doesn't support bun
in github actions afaict. So, I'm using npm instead.
git clone <repo_url>
cd path/to/project
npm install
npm run tauri dev
npm run tauri build
Another demo where I use the perplexity shell script to generate an answer super fast. Not affiliated, was just replying to a thread lol
https://github.com/jasonjmcghee/plock/assets/1522149/6166af73-545f-4a8e-ad46-ea8aacd84969
Curious folks might be wondering what ocr
feature is. I took a crack at taking a screenshot,
running OCR, and using that for context, instead of copying text manually. Long story short,
rusty-tesseract really dissapointed me, which is awkward b/c it's core
to xrem.
If someone wants to figure this out... this could be really cool, especially with multi-modal models.