Anki 2.1 addon to generate OCR text from images inside of Anki notes/cards. Note that this is only designed for computer generated text, not handwritten.
The aim of this addon was to generate searchable text for image-heavy notes, it is not intended to produce high quality, perfectly ordered text!
Note that because this addon changes the note template, you will see a warning about changing the database and uploading to AnkiWeb. This is normal.
Open the card browser and select the note(s) you want to process. Use the search bar at the top, select tags, decks, etc.
On the toolbar at the top, select 'Cards', then 'AnkiOCR', and select 'Run AnkiOCR on selected notes', as shown below
title
html tag, viewable as a tooltip:If you wish to have the OCR data outputted to a separate 'OCR' field on the note, which will modify your note types in your deck, you can set the text_output_location
config option to new_field
If you want to add new languages, you need to download the appropriate language data from here.
AnkiOCR depends on the Tesseract OCR library.
If you're on Windows or Mac, tesseract is bundled with the addon.
If you're on Linux carefully follow the instructions here
AnkiOCR was built on Python 3.9.
It is highly recommended to to use inside the Anki application, by installing the addon from AnkiWeb If you want to run it externally to anki, see below:
git clone https://github.com/cfculhane/AnkiOCR
make install
make test