wolfmanstout / talon-gaze-ocr

Talon scripts to enable advanced cursor control using eye tracking and OCR.
45 stars 22 forks source link
talonvoice

talon-gaze-ocr

Talon scripts to enable advanced cursor control using eye tracking and text recognition (OCR). This is alpha functionality which uses experimental/unsupported APIs, so it could break at any time. See my blog post for an overview.

Installation

Prerequisite: Requires community to be installed as a sibling in the same directory.

git clone this repo into your Talon user scripts folder. If you are on Talon Beta, switch to the "beta" branch (add flag -b beta) to take advantage of beta-only features.

Required permissions:

Required Python packages:

Note: This installation process is for Windows and Mac. Linux is not officially supported because Talon does not provide an OCR API. You may be able to get this working using Tesseract, but this configuration is not tested by the author.

Features:

Known limitations:

Dependencies

The .subtrees directory contains dependency packages needed by talon-gaze-ocr:

Changes in this repository are automatically pushed out to gaze-ocr and screen-ocr via a GitHub Action that uses git subtree.

Running without community

As noted in the installation instructions, community is highly recommended, but most functionality will still be available in degraded form without it (and you will see some warning logs). Missing functionality: